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For My Grandson Nicky — 
without young people to continue 
to wonder and care and study and learn, 
it’s all over. 


With all humility, I think “Whatsoever thy hand findeth to do, do it with thy might” infi¬ 
nitely more important than the vain attempt to love one’s neighbor as oneself. If you want 
to hit a bird on the wing, you must have all your will in a focus; you must not be think¬ 
ing about yourself, and, equally, you must not be thinking about your neighbor; you must 
be living in your eye on that bird. Every achievement is a bird on the wing. 

Oliver Wendell Holmes 

If you bring forth what is within you, what you bring forth will save you. If you do not 
bring forth what is within you, what you do not bring forth will destroy you. 

Jesus, The Gospel of Thomas 
in the Nag Hammadi manuscripts 

The more I work and practice, the luckier I seem to get. 

Gary Player 
(professional golfer) 

A witty chess master once said that the difference between a master and a beginning chess 
player is that the beginner has everything clearly fixed in mind, while to the master every¬ 
thing is a mystery. 

N. la. Vilenkin 

Marshall’s Generalized Iceberg Theorem: Seven-eighths of everything can’t be seen. 
Everything should be made as simple as possible, but not simpler. 


Albert Einstein 
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PREFACE TO 
THE INSTRUCTOR 


It is a curious fact that people who write thousand-page textbooks still seem to 
find it necessary to write prefaces to explain their purposes. Enough is enough, 
one would think. However, every textbook — and this one is no exception—is 
both an expression of dissatisfaction with existing books and a statement by the 
author of what he thinks such a book ought to contain, and a preface offers one 
last chance to be heard and understood. Furthermore, anyone who adds to the 
glut of introductory calculus books should be called upon to justify his action (or 
perhaps apologize for it) to his colleagues in the mathematics community. 


I borrow this phrase from my old friend Paul Halmos as a handy label for the 
noise and confusion that have agitated the calculus community for the past dozen 
years or so. Regardless of one’s attitude toward these debates and manifestoes, 
it seems reasonably clear that two opinions lie at the center of it all: first, too 
many students fail calculus; and second, our calculus textbooks are so bad that 
it’s natural for these students to fail. 

About the books, I completely—or almost completely — disagree. By and 
large, our calculus textbooks are written by excellent teachers who love their sub¬ 
ject and write clear expository English. Naturally, each author has a personal 
agenda, and this is what separates their books from one another and provides di¬ 
versity and choice for a healthy marketplace. Some writers prefer to emphasize 
the theoretical parts of calculus. Others are technology buffs. Yet others (like my¬ 
self) want a modest amount of biography and history, and believe that interest¬ 
ing and substantial applications from other parts of mathematics and other sci¬ 
ences are highly desirable. 

But let there be no misunderstanding: textbooks are servants of teachers, and 
not their masters. Any group of ten calculus teachers gathered together in a room 
will have ten very different views of what should be in their courses and how it 
should be taught. They will differ on the proper amount of theory; on how much 
numerical calculation is desirable; on whether or not to make regular use of graph¬ 
ing calculators or computer software; on whether some of the more elaborate ap¬ 
plications to science are too difficult; on whether biography and history are in¬ 
teresting or boring for their students; and so on. But the bottom line is that only 
the teachers themselves are in a position to decide what goes on in their own 
classrooms—and certainly not textbook writers who are completely ignorant of 
local conditions. 
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Those of us who write these books try to provide everything we can think of 
that a teacher might want or need, in full awareness that some parts of what we 
offer have no place in the course plans of many teachers. Every teacher omits 
some sections (and even some chapters) and amplifies others, in accordance with 
individual judgment and personal taste. It is my hope that this book will be use¬ 
ful and agreeable for many diverse tastes and interests. I want it to be a conve¬ 
nient tool for teachers that offers help when help is wanted, and gets out of the 
way when it is not wanted. 

As for the fact that too many of our students fail—if indeed it is a fact—what 
are the reasons for this? To understand these reasons, let us consider for a mo¬ 
ment what is needed for success in calculus. There are clearly three main re¬ 
quirements: a decent background in high school algebra and geometry, some of 
which is remembered and understood; the ability to read closely and carefully; 
and tenacity of purpose. 

In the matter of preparation in algebra and geometry, our students are in deep 
trouble. This is suggested by the fact that a few years ago the United States ranked 
last among the thirteen industrialized nations for the mathematics achievement 
of its high school graduates. As for reading skills and tenacity of purpose, some 
of our young people have these qualities, but the great majority do not. Unfor¬ 
tunately, tenacity of purpose is especially important for genuine success in cal¬ 
culus, because this is a subject in which almost every stage depends on having 
a reasonable command of all that went before, and which therefore requires steady 
application day after day, week after week, for many months. 

We know from our own experience as teachers that calculus is very difficult 
for most students, and we fully understand the reasons why this is so. But im¬ 
proving our high school mathematics education, and arresting the decline of se¬ 
rious reading and instilling tenacity of purpose among the majority of our young 
people, are only remote possibilities. Obviously help from outside is not com¬ 
ing, so we must look within ourselves for better ways of doing our jobs. 

Most of these ways are familiar to us. Regular class meetings over periods of 
many months, with frequent quizzes, are intended to encourage steady applica¬ 
tion to the task of learning. We praise (whenever possible), plead, cajole, and 
warn. We constantly review the elementary mathematics our students either never 
learned or have forgotten. We do today’s homework problems for them in class, 
continually thinking out loud and welcoming questions, in the hope that some of 
the useful ways of thought will rub off to smooth the path for their efforts on to¬ 
morrow’s homework. However, there is one big thing we can do but rarely do. 

Most calculus courses concentrate on the technical details, on developing in 
students the ability to differentiate and integrate lots of functions. We turn out 
many students who can perform these somewhat routine tasks. However, if we 
regularly pause to ask these successful differentiators and integrators just what 
derivatives and integrals actually are, and what they are for, we rarely get a sat¬ 
isfactory answer—by which I mean an answer that reveals genuine understand¬ 
ing on the part of the student. Many can give the standard limit definitions, but 
we should expect more than parroted formal definitions. I believe we ought to 
do a better job of conveying a solid sense of what calculus is really about, what 
its purpose is, why we need the elaborate machinery of methods for computing 
derivatives and integrals, and why the Fundamental Theorem of Calculus is truly 
“fundamental.” In a word, we need to communicate what calculus is for More 


PREFACE TO THE INSTRUCTOR 


XV 


generally, we ought to do more toward encouraging students to learn why things 
are true, rather than merely memorizing ways of solving a few problems to pass 
examinations. It is clear to us, but not to them, that the only way to learn calcu¬ 
lus is to understand it—it is much too massive and complex for mere memoriz¬ 
ing to be more than a temporary stopgap—and we have an obligation to help 
students get this message. 

If we can give more attention to these matters, we have a good chance of mak¬ 
ing calculus less frightening and more relevant for many more students than we have 
in the past. One of the main purposes of this book is to help us move our teach¬ 
ing in this direction, to convey more light to our students—and less mystery. 


1. Early Trig. In the First Edition, I thought it preferable to place trigonometry CHANGES FROM THE 
just before methods of integration. I still agree with myself, but most users think FIRST EDITION 
otherwise. I have therefore inserted an account of sines and cosines in Chapter 1, 
with the calculus of these functions at appropriate places in the following chap¬ 
ters. Since a solid command of trigonometry is so essential for methods of inte¬ 
gration, a full review is still given just before the chapter on these methods (Chap¬ 
ter 10). 

2. Homework Problems. I have added many new problems, mostly of the 
routine drill type, raising the total to well over 7,000. This is an increase of more 
than 15 percent and provides about four times as many as most instructors will 
want to use for their class assignments. 

3. Chapter Summaries. It seems to help students in their efforts to review 
and pull things together if they have the ideas and methods of each chapter boiled 
down to a few pregnant phrases. I have tried to provide this assistance in the sum¬ 
maries at the ends of the chapters. 

4. Appendices. The first edition had several massive appendices totaling hun¬ 
dreds of pages and containing enrichment material that I thought was so inter¬ 
esting that others would be interested, too. Many were, but I failed to realize that 
students barely keeping their heads above water in the regular work of the course 
would take a dim view of any unnecessary burdens. The first two of these long 
appendices were a collection of material that I thought of as “miscellaneous fun 
stuff,” and a biographical history of calculus. These have been removed, aug¬ 
mented, and published separately in a little paperback book called Calculus Gems: 

Brief Lives and Memorable Mathematics (McGraw-Hill, 1992). However, I have 
retained some of this material in greatly abbreviated form and placed it in un¬ 
obtrusive locations throughout the present book. 

5. Theory. The third of the long appendices in the first edition was on the the¬ 
ory of calculus. I have retained this appendix with a few additions because many 
colleges and universities offer honors sections that use this material to provide 
greater theoretical depth than is appropriate for regular sections. Most instruc¬ 
tors seem to agree with me in my desire to avoid cluttering our regular courses 
with any more theory than is absolutely necessary. This approach says: Do not 
try to prove what no one doubts. However, a number of people have asked me 
to expand my very condensed discussion of limits and continuous functions and 
also to give an informal descriptive treatment of the Mean Value Theorem, point¬ 
ing out its practical uses as they arise. This new material can be found at the end 
of Chapter 2. 
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6. Infinite Series. My idea for handling this subject in the first edition was 
not a good one. Most students moving from the first chapter of informal overview 
into the second of detailed systematic treatment were impatient because they 
thought they were wasting their time by studying the same concepts all over 
again. I have therefore completely reorganized these two chapters into a tradi¬ 
tional treatment, with series of constants developed first, and then power series. 

7. Vector Analysis. In the first edition I closed my discussion of vector analy¬ 
sis with Green’s Theorem. However, there seems to be general agreement these 
days that multivariable calculus should go a bit further, and include Gauss’s The¬ 
orem (the divergence theorem) and Stokes’ Theorem. I have rewritten Chapter 
21 accordingly. 

8. The Workman Logo. I thought it would be useful for students if there were 
some way to signal passages in the text that always cause trouble, because most 
students are not accustomed to the very slow and careful reading these passages 
require. The logo I chose for this purpose is copied from a European road sign: 


It suggests that hard work is necessary to get through the adjoining passage. I 
have tried to use it sparingly. 

9. Simplify, Simplify! When writing this book the first time, I thought I was 
aiming at the middle of my target, but many users thought I aimed too high. Dur¬ 
ing the preparation of this revision, I kept a poster with these words on it directly 
in my line of sight as I sat at my work, and of course I looked at this message 
thousands of times. I hope it worked. 


GRAPHING These marvelous tools are great fun to use and can make many contributions to 
CALCULATO RS the teaching and learning of calculus. But like all tools they should be used wisely, 
and this means very different things to different people. A scythe can harvest 
grain or cut off a foot, depending on the skill and judgment of the user. 

Some of those in the calculus reform movement believe that the role of num¬ 
bers and numerical computations should be greatly increased to reach a parity 
with symbolic (algebraic) and geometric ways of thinking. But I believe we should 
stop far short of this. In my opinion, there are five subject areas of calculus in 
which calculators are clearly of great value: 

graphing; 

calculation of limits; 

Newton’s method; 

numerical integration; 

computations using Taylor’s formula. 

In the last four of these areas, our calculators do heavy computational labor for 
us, and we are all grateful. But there are dangers, and one of these is an increasing 
tendency to replace mathematical thinking and learning by button-pushing. 
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The most surprising examples of this that I’ve seen involve teachers whose stu¬ 
dents use graphing calculators— instead of factoring or the quadratic formula— 
to solve quadratic equations as simple as x 2 — 2x — 3 = 0. The procedure is to 
“plot” the function y = x 2 — 2x — 3 on the calculator by pushing suitable buttons 
and then look at the graph the calculator produces to see where it crosses the 
x-axis. These students are enthusiastic about their calculators and enjoy experi¬ 
menting with them, and I applaud the teachers who take advantage of this natural 
interest. But unfortunately, in many cases these students do not know how to sketch 
simple graphs, or how to factor or use the quadratic formula, and are not learning 
these basic methods of elementary algebra. More generally, sketching the graphs 
of functions by thinking is a fundamental part of learning mathematics. Let us use 
calculators in our classes to supplement this thinking—but not to replace it. Let 
us remember that the action that matters takes place in the mind of the student. 

These wonderful graphing calculators are superb instruments when used in the 
right way. It is sobering to reflect that Leibniz himself would perhaps have given 
a year of his life to possess one—Leibniz who not only (along with Newton) 
created calculus, but also invented the first calculating machine that could mul¬ 
tiply and divide as well as add and subtract. 

The many problems in this book that require the use of a calculator are sig¬ 
naled by the standard symbol H. 


This book is intended to be a mainstream calculus text that is suitable for every 
kind of course at every level. It is designed particularly for the standard course 
of three semesters for students of science, engineering, or mathematics. Students 
are expected to have a background of high school algebra and geometry, and 
hopefully, some trigonometry as well. 

The text itself—that is, the 21 chapters without considering Appendix A—is 
traditional in subject matter and organization. I have placed great emphasis on 
motivation and intuitive understanding, and the refinements of theory are down¬ 
played. Most students are impatient with the theory of the subject, and justifi¬ 
ably so, because the essence of calculus does not lie in theorems and how to 
prove them, but rather in tools and how to use them. My overriding purpose has 
been to present calculus as a problem-solving art of immense power that is in¬ 
dispensable in all the quantitative sciences. Naturally, I wish to convince students 
that the standard tools of calculus are reasonable and legitimate, but not at the 
expense of turning the subject into a stuffy logical discipline dominated by ex¬ 
tra-careful definitions, formal statements of theorems, and meticulous proofs. It 
is my hope that every mathematical explanation in these chapters will seem to 
the thoughtful student to be as natural and inevitable as the fact that water flows 
downhill (rather than uphill) along a canyon floor. The main theme of our work 
is what calculus is good for—what it enables us to do and understand—and not 
what its logical nature is as seen from the specialized (and limited) point of view 
of the modern pure mathematician. 

There are several additional features of the book that it might be useful for me 
to comment on. 

Precalculus Material Because of the great amount of calculus that must be cov¬ 
ered, it is desirable to get off to a fast start and introduce the derivative quickly. 
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and to spend as little time as possible reviewing precalculus material. However, 
college freshmen constitute a very diverse group, with widely different levels of 
mathematical preparation. For this reason I have included a first chapter on pre¬ 
calculus material, which I urge teachers to skim over as lightly as they think ad¬ 
visable for their particular students. This chapter is written in enough detail so 
that individual students who need to spend more time on the preliminaries should 
be able to absorb most of it on their own with a little extra effort.* 
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Problems For students, the most important parts of their calculus book may well 
be the problem sets, because this is where they spend most of their time and en¬ 
ergy. There are more than 7,000 problems in this book, including many old stand¬ 
bys familiar to all calculus teachers and dating back to the time of Euler and even 
earlier. I have tried to repay my debt to the past by inventing new problems when¬ 
ever possible. The problem sets are carefully constructed, beginning with routine 
drill exercises and building up to more complex problems requiring higher lev¬ 
els of thought and skill. The most challenging problems are marked with an as¬ 
terisk (*). In general, each set contains approximately twice as many problems 
as most teachers will want to assign for homework, so that a large number will 
be left over for students to use as review material. 

Most of the chapters conclude with long lists of additional problems. Many of 
these are intended only to provide further scope and variety to the problems sets 
at the ends of the sections. However, teachers and students alike should treat these 
additional problems with special care, because a few are quite subtle and diffi¬ 
cult and should be attacked only by students with ample reserves of drive and 
tenacity. 

I should also mention that there are several sections scattered throughout the 
book with no corresponding problems at all. Sometimes these sections occur in 
small groups and are merely convenient subdivisions of what I consider a single 
topic and intend as a single assignment, as with Sections 6.1, 6.2, 6.3, and 6.4, 
6.5. In other cases (e.g.. Sections 15.5 and 19.4), the absence of problems is a 
tacit suggestion that the subject matter of these sections should be touched upon 
only lightly and briefly. 

There are a great many so-called “story problems” spread through the entire 
book. All teachers know that students shudder at these problems, because they 
usually require nonroutine thinking. However, the usefulness of mathematics in 
the various sciences demands that we try to teach our students how to penetrate 
into the meaning of a story problem, how to judge what is relevant to it, and how 
to translate it from words into sketches and equations. Without these skills— 
which are equally valuable for students who will become doctors, lawyers, fi¬ 
nancial analysts, or thinkers of any kind—there is no mathematics education 
worthy of the name. 1 " 

*A more complete exposition of high school mathematics that is still respectably concise can be found 
in my little book, Precalculus Mathematics in a Nutshell (Janson Publications, Dedham, MA, 1981), 
119 pages. 

f l cannot let the opportunity pass without quoting a classic story problem that appeared in The New 
Yorker magazine many years ago. “You know those terrible arithmetic problems about how many 
peaches some people buy, and so forth? Well, here’s one we like, made up by a third-grader who was 
asked to think up a problem similar to the ones in his book: “My father is forty-four years old. My 
dog is eight. If my dog was a human being, he would be fifty-six years old. How old would my fa¬ 
ther plus my dog be if they were both human beings?” 
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Differential Equations and Vector Analysis Each of these subjects is an impor¬ 
tant branch of mathematics in its own right. They should be taught in separate 
courses, after calculus, with ample time to explore their distinctive methods and 
applications. One of the main responsibilities of a calculus course is to prepare 
the way for these more advanced subjects and take a few preliminary steps in 
their direction, but just how far one should go is a debatable question. Some 
writers on calculus try to include mini-courses on these subjects in large chap¬ 
ters at the ends of their books. I disagree with this practice and believe that 
few teachers make much use of these chapters. Instead, in the case of differen¬ 
tial equations I prefer to introduce the subject as early as possible (Section 5.4) 
and return to it in a low-key way whenever the opportunity arises (Sections 5.5, 
7.7, 8.5, 9.6, 17.7, 19.9); and in vector analysis I have responded to review¬ 
ers by including a discussion of Gauss’s Theorem and Stokes’ Theorem in 
Chapter 21. 

Appendix A One of the major ways in which this book is unique and different 
from all its competitors can be understood by examining Appendix A, which I 
will now comment on very briefly. Before doing so, I emphasize that this mate¬ 
rial is entirely separate from the main text and can be carefully studied, dipped 
into occasionally, or completely ignored, as each individual student or instructor 
desires. 

In the main text, the level of mathematical rigor rises and falls in accordance 
with the nature of the subject under discussion. It is rather low in the geometri¬ 
cal chapters, where for the most part I rely on common sense together with in¬ 
tuition aided by illustrations; and it is rather high in the chapters on infinite se¬ 
ries, where the substance of the subject cannot really be understood without 
careful thought. I have constantly kept in mind the fact that most students have 
very little interest in purely mathematical reasoning for its own sake, and I have 
tried to prevent this type of material from intruding any more than is absolutely 
necessary. Some students, however, have a natural taste for theory, and some in¬ 
structors feel as a matter of principle that all students should be exposed to a cer¬ 
tain amount of theory for the good of their souls. This appendix contains virtu¬ 
ally all of the theoretical material that by any stretch of the imagination might 
be considered appropriate for the study of calculus. From the purely mathemat¬ 
ical point of view, it is possible for instructors to teach courses at many differ¬ 
ent levels of sophistication by using—or not using—material selected from this 
appendix. 

Supplements The following supplements have been developed to accompany 
this Second Edition of Calculus with Analytic Geometry. 

A Student Solutions Manual is available for students and contains detailed so¬ 
lutions to the odd-numbered problems. An Instructor s Solutions Manual is avail¬ 
able for instructors and contains detailed solutions to the even-numbered prob¬ 
lems. Also available to instructors adopting the text are a Print Test Bank and an 
algorithmic Computerized Test Bank. 

There are a variety of texts available from McGraw-Hill that support the use 
of specific graphing calculators and mathematical software programs for calcu¬ 
lus. Please contact your local McGraw-Hill representative for more information 
on these titles. 
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As to the flaws and errors that undoubtedly remain—for there are always a 
pesky few that manage to hide no matter how fervently we try to find them— 
there is no one to blame but myself. I will consider it a great kindness if col¬ 
leagues and student users will take the trouble to inform me of any blemishes 
they detect, for correction in future printings and editions. As Confucius said, “A 
man who makes a mistake and doesn’t correct it is making two mistakes.” 


George F. Simmons 



TO THE STUDENT 


Appearances to the contrary, no writer deliberately sets out to produce an un¬ 
readable book; we all do what we can and hope for the best. Naturally, I hope 
that my language will be clear and helpful to students, and in the end only they 
are qualified to judge. However, it would be a great advantage to all of us — 
teachers and students alike—if student users of mathematics textbooks could 
somehow be given a few hints on the art of reading mathematics, which is a very 
different thing from reading novels or magazines or newspapers. 

In high school mathematics courses, most students are accustomed to tackling 
their homework problems first, out of impatience to have the whole burdensome 
task over and done with as soon as possible. These students read the explana¬ 
tions in the text only as a last resort, if at all. This is a grotesque reversal of rea¬ 
sonable procedure, and makes about as much sense as trying to put on one’s 
shoes before one’s socks. I suggest that students should read the text first, and 
when this has been thoroughly assimilated, then and only then turn to the home¬ 
work problems. After all, the purpose of these problems is to nail down the ideas 
and methods described and illustrated in the text. 

How should a student read the text in a book like this? Slowly and carefully, 
and in full awareness that a great many details have been deliberately omitted. 
If this book contained every detail of every discussion, it would be five times as 
long, which God forbid! There is a saying of Voltaire: “The secret of being a 
bore is to tell everything.” Every writer of a book of this kind tries to walk a nar¬ 
row path between saying too much and saying too little. 

The words “clearly,” “it is easy to see,” and similar expressions are not in¬ 
tended to be taken literally, and should never be interpreted by any student as a 
putdown on his or her abilities. These are code-phrases that have been used in 
mathematical writing for hundreds of years. Their purpose is to give a signal to 
the careful reader that in this particular place, the exposition is somewhat con¬ 
densed, and perhaps a few details of calculations have been omitted. Any phrase 
like this amounts to a friendly hint to the student that it might be a good idea to 
read even more carefully and thoughtfully in order to fill in omissions in the ex¬ 
position, or perhaps get out a piece of scratch paper to verify omitted details of 
calculations. Or better yet, make full use of the margins of this book to empha¬ 
size points, raise questions, perform little computations, and correct misprints. 


George R Simmons 
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Everyone knows that the world in which we live is dominated by motion and 
change. The earth moves in its orbit around the sun; a rock thrown upward slows 
and stops, and then falls back to earth with increasing speed; the population of 
India grows each year at an increasing rate; and radioactive elements decay. These 
are merely a few items in the endless array of phenomena for which mathemat¬ 
ics is the most natural medium of communication and understanding. As Galileo 
said more than 300 years ago, “The Great Book of Nature is written in math¬ 
ematical symbols.” 

Calculus is that branch of mathematics whose primary purpose is the study of 
motion and change. It is an indispensable tool of thought in almost every field 
of pure and applied science—in physics, chemistry, biology, astronomy, geol¬ 
ogy, engineering, and even some of the social sciences. It also has many impor¬ 
tant uses in other parts of mathematics, especially geometry. By any standard, 
the methods and applications of calculus constitute one of the greatest intellec¬ 
tual achievements of civilization, and to become acquainted with these ideas is 
to open many doors that lead to a broader and richer life of the mind. 

The main objects of study in calculus are functions. But what is a function? 
Roughly speaking, it is a rule or law that tells us how one variable quantity de¬ 
pends upon another. This is the master concept of the exact sciences. It offers us 
the prospect of understanding and correlating natural phenomena by means of 
mathematical machinery of great and sometimes mysterious power. The concept 
of a function is so vitally important for all our work that we must strive to clar¬ 
ify it beyond any possibility of confusion. This purpose is the theme of the 
present chapter. 

The following sections contain a good deal of material that many readers have 
studied before. Some will welcome the opportunity to review and refresh their 
ideas. Those who already understand this material and find it irksome to tread 
the same path over again may discover some interesting sidelights and stimulat¬ 
ing challenges among the Additional Problems at the end of the chapter. This 
chapter is intended solely for purposes of review. It can be studied carefully, or 
lightly, or even skipped altogether, depending on the reader’s level of prepara¬ 
tion. The actual subject matter of this course begins in Chapter 2, and it would 
be very unfortunate if even a single student should come to feel that this pre¬ 
liminary chapter is more of an obstacle than a source of assistance—for its only 
purpose is to smooth the way. 


1.1 

INTRODUCTION 


l 
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1.2 

THE REAL LINE AND 
COORDINATE PLANE. 
PYTHAGORAS 


Most of the variable quantities we study—such as length, area, volume, posi¬ 
tion, time, and velocity—are measured by means of real numbers, and in this 
sense calculus is based on the real number system. It is true that there are other 
important and useful number systems—for instance, the complex numbers. It is 
also true that two- and three-dimensional treatments of position and velocity re¬ 
quire the use of vectors. These ideas will be examined in due course, but for a 
long time to come the only numbers we shall be working with are the real num¬ 
bers.* 

It is assumed in this book that students are familiar with the elementary alge¬ 
bra of the real number system. Nevertheless, in this section we give a brief de¬ 
scriptive survey that may be helpful. For our purposes this is sufficient, but any 
reader who wishes to probe more deeply into the nature of real numbers will find 
a more precise discussion in Appendix A.l at the back of the book. 

The real number system contains several types of numbers that deserve spe¬ 
cial mention: the positive integers (or natural numbers) 


1, 2, 3, 4, 5, ; 


the integers 

. . . , -3, -2, -1,0, 1, 2, 3, ... ; 

and the rational numbers , which are those real numbers that can be represented 
as fractions (or quotients of integers), such as 

f, 4, 0, -5, 3.87, 2 

A real number that is not rational is said to be irrational; for example, 

V2, V3, V2 + V3, V5, VE, and ir 
are irrational numbers. 

We take this opportunity to remind the reader that for any positive number a, 
the symbol Va always means its positive square root. Thus, V4 is equal to 2 
and not —2, even though ( — 2) 2 = 4. If we wish to designate both square roots 
of 4, we must write ±V4. Similarly, Xfa always means the positive nth root of 
a. 


THE REAL LINE 

The use of the real numbers for measurement is reflected in the very convenient 
custom of representing these numbers graphically by points on a horizontal 
straight line (Fig. 1.1). 

This representation begins with the choice of an arbitrary point as the origin 
or zero point, and another arbitrary point to the right of it as the point 1. The dis- 


*The adjective “real” was originally used to distinguish these numbers from numbers like V— 1, 
which were once thought to be “unreal” or “imaginary.” 

f Our aims in the present book are almost entirely practical. Nevertheless, our discussions often give 
rise to certain “impractical” questions that some readers may find interesting and appealing. As an 
example, how do we know that the number V2 is irrational? For readers with the time and inclina¬ 
tion to pursue such questions—and also because we consider the answers to be worth knowing about 
for their own sake—we offer food for further thought in a little paperback book entitled Calculus 
Gems: Brief Lives and Memorable Mathematics (McGraw-Hill, 1992). Some of the facts about irra¬ 
tional numbers, with proofs, are discussed in Section B.2 of this book. 



1.2 THE REAL LINE AND COORDINATE PLANE. PYTHAGORAS 


3 


-l-f 2 y/2 y/3 2 3 7T 

_ 2 o_1 2 3 

Unit ^ 

distance Figure 1. 


tance between these two points (the unit distance) then serves as a scale by means 
of which we can assign a point on the line to every positive and negative inte¬ 
ger, as illustrated in the figure, and also to every rational number. Notice that all 
positive numbers lie to the right of 0, in the “positive direction,” and all negative 
numbers lie to the left.* The method of assigning a point to a rational number is 
shown in the figure for the number \ = 2y: the segment between 2 and 3 is sub¬ 
divided by two points into three equal segments, and the first of these points is 
labeled 2\. This procedure of using equal subdivisions clearly serves to deter¬ 
mine the point on the line which corresponds to any rational number whatever. 
Furthermore, this correspondence between rational numbers and points can be 
extended to irrational numbers; for the decimal expansion of an irrational num¬ 
ber, such as 

V2= 1.414..., V3 = 1.732..., 77- = 3.14159- 

can be interpreted as a set of instructions specifying the exact position of the cor¬ 
responding point. For example, by looking at the expansion we see that the point 
corresponding to V2 lies between 1 and 2, between 1.4 and 1.5, between 1.41 
and 1.42, and so on, and these requirements uniquely determine the position of 
the corresponding point. 

We have described a one-to-one correspondence between all real numbers and 
all points on the line which establishes these numbers as a coordinate system for 
the line. This coordinatized line is called the real line. It is convenient and cus¬ 
tomary to merge the logically distinct concepts of the real number system and 
the real line, and we shall freely speak of points on the line as if they were num¬ 
bers and of numbers as if they were points on the line. Thus, such mixed ex¬ 
pressions as “irrational point” and “the segment between 2 and 3” are quite nat¬ 
ural and will be used without further comment. 

INEQUALITIES 

The left-to-right linear succession of points on the real line corresponds to an im¬ 
portant part of the algebra of the real number system, that dealing with inequal¬ 
ities. These ideas play a larger role in calculus than in earlier mathematics courses, 
so we briefly recall the essential points. 

The geometric meaning of the inequality a < b (read “a is less than b”) is sim¬ 
ply that a lies to the left of b\ the equivalent inequality b > a (“£ is greater than 
a') means that b lies to the right of a. A number a is positive or negative ac¬ 
cording as a > 0 or a < 0. The main rules used in working with-inequalities are 
the following: 


The real line. 


*The arrowhead on the right end of the real line indicates the positive direction and nothing more. 
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1. If a > 0 and b < c, then ab < ac. 

2. If a < 0 and b < c, then ab > ac. 

3. If a < b, then a + c < b + c for any number c. 

Rules 1 and 2 are usually expressed by saying that an inequality is preserved on 
multiplication by a positive number, and reversed on multiplication by a nega¬ 
tive number; and rule 3 says that an inequality is preserved when any number 

(positive or negative) is added to both sides. It is often desirable to replace an 

inequality a > b by the equivalent inequality a — b > 0, with rule 3 being used 
to establish the equivalence. 

If we wish to say that a is positive or equal to 0, we write a > 0 and read this 
“a is greater than or equal to zero.” Similarly, a > b means that a > b or a = b. 
Thus, 3 > 2 and 3 > 3 are both true inequalities. 

We also recall that a product of two or more numbers is zero if and only if 
one of its factors is zero. If none of its factors are zero, it is positive or negative 
according as it has an even or an odd number of negative factors. 


ABSOLUTE VALUES 

The absolute value of a number a is denoted by \a\ and defined by 

ii a if a > 0, 

|a| = l-a if a < 0. 

For example, |3| = 3, \—2\ = —( — 2) = 2, and |0| = 0. It is clear that the opera¬ 
tion of forming the absolute value leaves positive numbers unchanged and re¬ 
places each negative number by the corresponding positive number. The main 
properties of this operation are 

\ab\ = \a\\b\ and \a + b\ < \a\ + \b\. 


In geometric language, the absolute value of a number a is simply the distance 
from the point a to the origin. Similarly, the distance from a to b is \a — b\. 

To solve an equation such as \x + 2\ = 3, we can write it in the form 
\x — (— 2)| = 3 and think of it as saying that “the distance from jc to — 2 is 3 ” 
With Fig. 1 . 1 in mind, it is evident that the solutions are jc = 1 and jc = —5. We 
can also solve this equation by using the fact that \x + 2\ = 3 means that jc + 
2 = 3 or jc + 2 = —3; the solutions are jc = 1 and jc = —5, as before. 


INTERVALS 

The sets of real numbers we shall be dealing with most frequently are intervals. 
An interval is simply a segment on the real line. If its endpoints are the numbers 
a and b, then the interval consists of all numbers that lie between a and b. How¬ 
ever, we may or may not want to include the endpoints themselves as part of the 
interval. 

To be more precise, suppose that a and b are numbers, with a < b. The closed 
interval from a to b, denoted by [a, b ]—using brackets—includes its endpoints, 
and therefore consists of all real numbers jc such that a < jc < b. Parentheses are 
used to indicate excluded endpoints. The interval (a, b), with both endpoints 
excluded, is called the open interval from a to b; it consists of all jc such that 
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a < x < b. Sometimes we wish to include only one endpoint in an interval. Thus, 
the intervals denoted by [a, b) and ( a , b] are defined by the inequalities a < jc < 
b and a < x < b, respectively. In each of these cases, any number c such that 
a < c < b is called an interior point of the interval (Fig. 1.2). 

Strictly speaking, the notations a<x<b and [a, b] have different meanings 
—the first represents a restriction imposed on x, while the second denotes a set 
—but both designate the same interval. We will therefore consider them to be 
equivalent and use them interchangeably, and the reader should become familiar 
with both. However, the geometric meaning of the notation a < x < b is more 
easily grasped by the eye, and for this reason we usually prefer it to the other. 

A half-line is often considered to be an interval extending to infinity in one 
direction. The symbol (read “infinity”) is frequently used in designating such 
an interval. Thus, for any real number a the intervals defined by the inequalities 
a < x and jc < a can be written as a < x < ©o and — < jc < a, or equivalently 

as ( a , oo) and (—<*>, a\ Remember, however, that the symbols °o and —°o do not 
denote real numbers; they are used in this manner only as a convenient way of 
emphasizing that x is allowed to be arbitrarily large (in either the positive or neg¬ 
ative direction). As an aid in keeping the notation clear in one’s mind, it may be 
helpful to think of —oo and as “fictitious numbers” located at the left and right 
“ends” of the real line, as suggested in Fig. 1.3. Also, it is sometimes convenient 
to think of the entire real line itself as an interval, — ©o < * < oo or ( — oo, oo). 

Sets of numbers described by means of inequalities and absolute values are 
often intervals. It is clear, for instance, that the set of all jc such that |jc| <2 is 
the interval -2 < jc < 2 or (-2, 2). 


Interior point 
/ 

a c b 


Open: a < x < b or (a, b) 
Figure 1.2 Intervals. 
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a 

b 

Closed: 

a < x < b or (a, b ] 

a 
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0 

—OO - % ; oo 

Figure 1.3 


Example 1 Solve the inequality x 2 — 2 < x. 

Solution To “solve” an inequality like this means to find all numbers jc for which 
the inequality is true. We begin by writing it as jc 2 - x — 2 < 0, and then we 
write it in the factored form 


(jc + 1 )(jc - 2) < 0. 

For this to be true, the two factors must have opposite signs: jc + 1 > 0 and jc — 
2 < 0, or jc + 1 < 0 and jc - 2 > 0. These conditions are equivalent to jc > — 1 
and jc < 2, or jc < — 1 and jc > 2. The second pair of conditions is easily seen to 
be impossible. The first pair of conditions means that jc lies in the open interval 
— 1 < jc < 2, and these jc’s constitute the solution of the given inequality. 


THE COORDINATE PLANE 

Just as real numbers are used as coordinates for points on a line, pairs of real 
numbers can be used as coordinates for points in a plane. For this purpose we 
establish a rectangular coordinate system in the plane, as follows. 

Draw two perpendicular straight lines in the plane, one horizontal and the other 
vertical, as shown in Fig. 1.4. These lines are called the x-axis and y-axis, re¬ 
spectively, and their point of intersection is called the origin. Coordinates are as¬ 
signed to these axes in the manner described earlier, with the origin as the zero 
point on both and the same unit of distance measurement on both. The positive 
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Figure 1.4 The coordinate plane or 
xy-plane. 
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jc-axis is to the right of the origin and the negative jc-axis to the left, as before; 
and the positive y-axis is above the origin and the negative y-axis below. 

Now consider a point P anywhere in the plane. Draw a line through P paral¬ 
lel to the y-axis, and let jc be the coordinate of the point where this line crosses 
the jc-axis. Similarly, draw a line through P parallel to the jc-axis, and let y be the 
coordinate of the point where this line crosses the y-axis. The numbers jc and y 
determined in this way are called the jc -coordinate and y-coordinate of P. In re¬ 
ferring to the coordinates of P, it is customary to write them as an ordered pair 
(jc, y) with the jc-coordinate written first; we say that P has coordinates (jc, y).* 
This correspondence between P and its coordinates establishes a one-to-one cor¬ 
respondence between all points in the plane and all ordered pairs of real num¬ 
bers; for P determines its coordinates uniquely, and by reversing the process we 
see that each ordered pair of real numbers uniquely determines a point P with 
these numbers as its coordinates. As in the case of the real line, it is customary 
to drop the distinction between a point and its coordinates, and to speak of “the 
point (jc, y)” instead of “the point with coordinates (jc, y).” The coordinates jc and 
y of the point P are sometimes called the abscissa and ordinate of P. Notice par¬ 
ticularly that points (jc, 0) lie on the jc-axis, that points (0, y) lie on the y-axis, 
and that (0, 0) is the origin. Also, the axes divide the plane into four quadrants, 
as shown in Fig. 1.4, and these quadrants are characterized as follows by the 
signs of jc and y: first quadrant, jc > 0 and y > 0; second quadrant, x < 0 and y > 
0; third quadrant, jc < 0 and y < 0; fourth quadrant, jc > 0 and y < 0. 

When the plane is equipped with the coordinate system described here, it is 
usually called the coordinate plane or the xy-plane. 

THE DISTANCE FORMULA 

Much of our work involves geometric ideas — right triangles, similar triangles, 
circles, spheres, cones, etc. — and we assume that students have acquired a rea¬ 
sonable grasp of elementary geometry from earlier mathematics courses. A ma- 


*In practice, the use of the same notation for ordered pairs as for open intervals never leads to con¬ 
fusion, because in any specific context it is always clear which is meant. 
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a 2 + b 2 = c 2 


a 



a b 



Figure 1.5 

and a proof. 


jor fact of particular importance is the Pythagorean theorem: In any right trian¬ 
gle, the sum of the squares of the legs equals the square of the hypotenuse (Fig. 

1.5). There are many proofs of this theorem, but the following is probably sim¬ 
pler than most. Let the legs be a and b and the hypotenuse c, and arrange four 
replicas of the triangle in the corners of a square of side a + b, as shown on the 
right in Fig. 1.5. Then the area of the large square equals 4 times the area of the 
triangle plus the area of the small square; that is, 

(a 4- b) 2 = 4 (\ab) + c 2 . 

This simplifies at once to a 2 + b 2 = c 2 , which is the Pythagorean theorem. 

As the first of many applications of this fact, we obtain the formula for the 
distance d between any two points in the coordinate plane. If the points are 
P i = (jci, y i) and P 2 = (x 2 , y 2 )> then the segment joining them is the hypotenuse 
of a right triangle (Fig. 1.6) with legs \x\ — * 2 ! and |yi — y 2 |- By the Pythago- y 
rean theorem, 

y 2( , - 

d 2 = \x 1 - x 2 \ 2 + |yi - yi\ 2 

= (*1 - X2) 2 + (y\ ~ yi ) 2 , 

so ,_ 

d = V(x 1 - x 2 ) 2 + (y\ ~ y 2 ) 2 - (1) 

This is the distance formula. ****—— 

Example 2 The distance d between the points (—4, 3) and (3, —2) in Fig. 1.4 Figure 1.6 
is 

d = V(—4 - 3) 2 + (3 + 2) 2 = V74. 

Notice that in applying formula (1) it does not matter in which order the points 
are taken. 


Example 3 Find the lengths of the sides of the triangle whose vertices are P\ = 
(-1,-3), P 2 = (5, — 1), P 3 = ( — 2, 10). 

By (1), these lengths are 

P\P 2 = V(— 1 - 5) 2 + (-3 + l) 2 = vTo = 2 v7o, 

P 1 P 3 = V(-l + 2 ) 2 + (-3 - 10 ) 2 = Vno, 

P 2 P 3 = V(5 + 2) 2 + (-1 - 10) 2 = VT70. 

These calculations reveal that the triangle is isosceles, with P\P 3 and P 2 P?> as the 
equal sides. 


The Pythagorean theorem 


p 2 = (*2> T 2 ) 
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THE MIDPOINT FORMULAS 

It is often useful to know the coordinates of the midpoint of the segment joining 
two given distinct points. If the given points are P\ = (x\, yO and P 2 = (*2> J 2 X 
and if P = (x, y) is the midpoint, then it is clear from Fig. 1.7 that x is the mid¬ 
point of the projection of the segment on the x-axis, and similarly for y. This tells 
us (examine the figure—and think!) that x = x\ + {(*2 - x\) and y - y 1 + 
ky2 -^i). so 

X = kx\ + X 2 ) and y = ^(yi + y 2 ). 

Another way of obtaining these formulas is to notice from Fig. 1.7 that* — x\ = 
X 2 ~ x, so 2x — x\ + X 2 or x = \{x\ + * 2 ), with the same argument applying to 
y. Similarly, if P is a trisection point of the segment joining P\ and P 2 , its coor¬ 
dinates can be found from the fact that x and y are trisection points of the cor¬ 
responding segments on the x-axis and y-axis. 



Example 4 In any triangle, the segment joining the midpoints of two sides is 
parallel to the third side and half its length. We know this from elementary geom¬ 
etry; but to prove it by our methods, we begin by noticing that the triangle can 
always be placed in the position shown in Fig. 1.8, with its third side along the 
positive x-axis and the left endpoint of this side at the origin. We then insert the 
midpoints of the other two sides, as shown, and observe that since they have the 
same y-coordinate, the segment joining them is parallel to the third side lying 
on the x-axis. The length of this segment is simply the difference between the 
x-coordinates of its endpoints, 

a + b _ b_ _ a_ 

~2 2 ~ 2 ’ 

which is half the length of the third side. 


This example illustrates the way in which coordinates can often be used to 
give algebraic proofs of geometric theorems. The device employed here, of plac¬ 
ing the figure in a convenient position relative to the coordinate system, has the 
purpose of simplifying the algebra. 


NOTE ON PYTHAGORAS 

Who was this Pythagoras, whose name is 
attached to the great theorem of geometry we have just been 
using? And why should we care? 

The pre-Socratic philosophers of ancient Greece—that is, 
those who lived before the time of Socrates (470?-399 B.C.) 
—were one of the most remarkable and influential groups 
of people in human history. The best known of these was 
Pythagoras of Samos (580?-500? B.C.), a mathematician, 


scientist, and mystic whose ideas live on today as part of 
the bone and flesh of our modem civilization. 

Greek geometry was certainly one of the half-dozen 
supreme intellectual achievements of all time. Pythagoras’ 
master Thales (625?-547? B.C.) had created geometry as 
the contemplation of abstract patterns of lines and figures 
and constructed the first proofs of the first theorems. But 
Pythagoras was the first person to see geometry as an orga- 
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nized system of thought held together by deductive proof, 
with one theorem depending on another in a tightly woven 
fabric of logic. Also, tradition tells us that he himself dis¬ 
covered many theorems, most notably, the fact that the sum 
of the angles in any triangle equals two right angles, and the 
famous Pythagorean theorem discussed above. 

Pythagoras was bom on the beautiful island of Samos, a 
mile or two off the Aegean coast of Turkey and a good day’s 
walk along the shore from Thales’ home town of Miletus. 
At the age of about 50, he migrated from Samos to the Greek 
colony of Crotona in southern Italy, where he established 
the famous Pythagorean school, a quasi-religious society 
with a solid claim to the honor of being the world’s first uni¬ 
versity. The Pythagoreans were best known for two teach¬ 
ings: the doctrine of transmigration of souls at death from 
one body into another, and the theory that numbers consti¬ 
tute the true essence of all things. Believers performed rites 
of purification and followed strict moral and dietary rules 
(no sex, no meat) to enable their souls to rise to higher lev¬ 
els of spirituality in subsequent lives. Their beliefs also led 
them to consider the sexes as equal and to treat animals and 
slaves humanely. For who knows? In a subsequent life one 
might return as a slave, or one’s soul might take up resi¬ 
dence in an animal’s body, or even—alas! — an insect’s. 

As a way of achieving purification of the mind, the 
Pythagoreans studied geometry, arithmetic, music, and as¬ 
tronomy—arithmetic not in the sense of useful computa¬ 
tional skills but rather as the abstract theory of numbers. 
They were particularly fond of the “figurate numbers,” 
which arise by arranging dots or points in regular geomet¬ 
ric patterns. For example, there are the square numbers 1, 
4,9, 16, ... : 


• • • • 


• • • • • • • 


• • ••• •••• 



As indicated, each square number can be obtained from its 
predecessor by adding an L-shaped border called a gnomon, 
meaning a carpenter’s square. Since the successive gnomons 
are the successive odd numbers, it is immediately clear from 
the square arrays that the sum of the first n odd numbers 
equals n 2 : 

1 + 3 + 5 + ■ • • + (In - 1) = n 2 . 

Who would have believed that the common odd numbers 
and the relatively rare perfect squares are related in such a 
simple yet remarkable way? The Pythagoreans were fasci¬ 
nated, and rightly so, by the grave and beautiful games that 
numbers play with each other—games that seemed to them 
to take place outside of space and time and to be quite in¬ 
dependent of the human mind itself. 

Further, Pythagoras performed the first deliberate scien¬ 
tific experiment, on the relation between positive whole 
numbers and the musical notes emitted by a plucked lyre 
string. Also, he was the first person to conceive the 
supremely daring conjecture that the world is an ordered, 
understandable whole, and he applied the word kosmos — 
which previously meant order or harmony—to this whole. 

In these and other ways Pythagoras was one of the prime 
creators of the Western civilization that sustains us all—as 
fish are sustained by the water in which they swim. 


PROBLEMS 


1 Among the words “integer,” “rational,” and “irrational,” 


state the ones that apply to 

(a) -f; 

(b) 0; 

(c) 9; 

(d) 0.75; 

(e) -V49; 

(f) 1 /tt; 

(g) 9.000 . . . ; 

(h) 3 I/2 ; 

(i) 

(j) T- 


2 Every integer is either even or odd. The even integers are 
those that are divisible by 2, so n is even if and only if 
it has the form n = 2k for some integer k. The odd inte¬ 
gers are those that have the form n = 2k + 1 for some 
integer k. 


(a) If n is even, prove that n 2 is also even. 

(b) If n is odd, prove that n 2 is also odd. 


In Problems 3-12, rewrite the given expression without using 
the absolute value symbol. 


3 

5 

7 

9 

11 

13 


|7 - 18|. 
k- 3|. 

\x - 5| if x < 5. 
\x z + 10|. 

|l - 3x 2 \ ifx> 1. 


4 |7| — | 18|. 

6 |3 - 7r|. 

8 \x - 5| if x > 5. 
10 |-ll| - | — 10|. 

12 |VTo-io|. 


Solve the following inequalities: 

(a) x(x - 1) > 0; 

(b) (x - l)(;t + 2) < 0; 
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(c) x 2 + 4x - 21 > 0; 

(d) lx 2 + x < 3; 

(e) 4x 2 + 10* - 6 < 0; 

(f) x 2 + 2x + 4> 0. 

14 Recall that Va is a real number if and only if a > 0, and 
find the values of jc for which each of the following is a 
real number: 

(a) V4 - .r 2 ; (b) V* 2 - 9; 

(c) '— ; (d) ' 

V4 — 3* Vjc 2 — jc — 12 

15 Find the values of x for which each of the following is 
positive: 


JC 

(b) 

JC 

x 2 + 4’ 

x 2 - 4’ 

x+\ 

(d) 

* 2 - 1 

x — 3 ’ 

* 2 - 3x 


16 State the values of a for which the following inequali¬ 
ties are valid: 

(a) a < a\ (b) a < a. 

17 If a < b and b < a, what conclusion can be drawn about 
a and bl 

18 (a) If a < b is true, is it also necessarily true that 

a<bl 

(b) If a < b is true, is it also necessarily true that 
a < bl 

19 State whether each pair of points lies on a horizontal or 
a vertical line: 

(a) (-2, -5), (-2, 3); (b) (-2, -5), (7, -5); 

(c) (-3, 4), (6, 4); (d) (2, -11), (2, 5); 

(e) (2, 2), (-13, 2); (f) (-7, -7), (-7, 7); 

(g) (3, 5), (3,-2); (h)(-l,-2), (2,-2). 

20 Three vertices of a rectangle are (-1, 2), (3, -5), 
( — 1, —5). What is the fourth vertex? 

21 Find the distance between each pair of points: 

(a) (1,2), (6, 7); (b) (2, 5), (-1, 3); 

(c) (-7, 3), (1, -2); (d) (a, b), ( b , a). 

22 In Problem 21 find the midpoint of the segment joining 
each pair of points. 

23 Draw a sketch indicating the points (x, y) in the plane for 
which 

(a) jc < 2; 

(b) - 1 < y < 2; 

(c) 0 < jc < 1 and 0 < y < 1; 

(d) x = — 1 ; 

(e) y = 3; 

(f) * = y. 

24 Use the distance formula to show that the points (—2, 1), 
(2, 2), and (10, 4) lie on a straight line. 

25 Show that the point (6, 5) lies on the perpendicular bi¬ 
sector of the segment joining the points (-2, I) and 
(2, -3). 


26 Show that the triangle whose vertices are (3, -3), 
(-3, 3), and (3V3, 3\/3) is equilateral. 

27 The two points (2, —2) and ( — 6 , 5) are the endpoints of 
a diameter of a circle. Find the center and radius of the 
circle. 

28 Find every point whose distance from each of the two 
coordinate axes equals its distance from the point (4, 2). 

29 Find the point equidistant from the three points ( — 9, 0), 
( 6 , 3), and (-5, 6 ). 

30 If a and b are any two numbers, convince yourself that: 

(a) the points (a, b) and {a, ~b) are symmetric with re¬ 
spect to the jc-axis; 

(b) {a, b) and ( — a, b) are symmetric with respect to the 
y-axis; 

(c) {a, b) and {—a, —b) are symmetric with respect to 
the origin. 

31 What symmetry statement can be made about the points 
(a, b) and ( b, a)l 

32 In each case, place the figure in a convenient position 
relative to the coordinate system and prove the statement 
algebraically: 

(a) The diagonals of a parallelogram bisect each other. 

(b) The sum of the squares of the diagonals of a paral¬ 
lelogram equals the sum of the squares of the sides. 

(c) The midpoint of the hypotenuse of a right triangle is 
equidistant from the three vertices. 

Use the fact stated in (c)to show that when the acute an¬ 
gles of a right triangle are 30° and 60°, the side opposite 
the 30° angle is half the hypotenuse. 

33 In an isosceles right triangle, both acute angles are 45°. 
If the hypotenuse is h , what is the length of each of the 
other sides? 

34 Let P\ = (jci , v j) and P 2 = (x 2 , V 2 ) be distinct points. If 
P = U, y) is on the segment joining P\ and P 2 and one- 
third of the way from P\ to P 2 , show that 

x = y( 2 *i + x 2 ) and y = y( 2 vi + y 2 )■ 

Find the corresponding formulas if P is two-thirds of the 
way from P\ to P 2 . 

35 Consider an arbitrary triangle with vertices (jcj, Vi), (x 2 , 
y 2 ), and (JC 3 , ya). Find the point on each median which is 
two-thirds of the way from the vertex to the midpoint of 
the opposite side.* Perform the calculations separately 
for each median and verify that these three points are all 
the same, with coordinates 

}(*i + x 2 + x 3 ) and 7 O 1 + y 2 + y 3 ). 

This proves that the medians of any triangle intersect at 
a point which is two-thirds of the way from each vertex 
to the midpoint of the opposite side. 


*A median of a triangle is a segment joining a vertex to the midpoint 
of the opposite side. 
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In this section we use the language of algebra to describe the set of all points 
that lie on a given straight line. This algebraic description is called the equation 
of the line. First, however, it is necessary to discuss an important preliminary 
concept: the slope of a line. 

THE SLOPE OF A LINE 

Any nonvertical straight line has a number associated with it that specifies its di¬ 
rection, called its slope. This number is defined as follows (Fig. 1.9 illustrates 
the definition). Choose any two distinct points on the line, say Pi = (x\ 9 yi) and 
P 2 = (;c 2 , yi)- Then the slope is denoted by m and defined to be the ratio 


If we reverse the order of subtraction in both numerator and denominator, then 
the sign of each is changed, so m is unchanged: 

m = T 2 -y\ = yi ~ yi 

X 2 -X] X\-X2 

This shows that the slope can be computed as the difference of the y-coordinates 
divided by the difference of the ^-coordinates — in either order, as long as both 
differences are formed in the same order. In Fig. 1.9, where P 2 is placed to the 
right of P\ and the line rises to the right, it is clear that the slope as defined by 
(1) is simply the ratio of the height to the base in the indicated right triangle. It 
is necessary to know that the value of m depends only on the line itself and is 
the same no matter where the points Pi and P 2 happen to be located on the line. 
This is easy to see by visualizing the effect of moving P\ and P 2 to different po¬ 
sitions on the line; this change gives rise to a similar right triangle and therefore 
leaves the ratio in (1) unaltered. 

If we choose the position of P 2 so that x 2 — x\ — 1, that is, if we place P 2 1 
unit to the right of Pi, then m = y 2 - y\. This tells us that the slope is simply 
the change in y as a point (x, y) moves along the line in such a way that x in¬ 
creases by 1 unit. This change in y can be positive, negative, or zero, depending 
on the direction of the line. We therefore have the following important correla¬ 
tions between the sign of m and the indicated directions: 



1.3 

SLOPES AND 
EQUATIONS OF 
STRAIGHT LINES 


Figure 1.9 
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Jt 

Figure 1.10 A variety of slopes. 


m > 0, 

line rises to the right; 

m < 0, 

line falls to the right; 

m = 0, 

line horizontal. 

Further, the absolute value of m is a measure of the steepness of the line (Fig. 
1.10). It is evident from (1) why a vertical line has no slope, for in this case the 


two points have equal jc-coordinates and the denominator in (1) is 0—and we 
know that division by 0 is undefined. 

If the line under discussion crosses the jc-axis, then the angle a from the pos¬ 
itive jc-direction to the line, measured counterclockwise, is called the inclination 
—or sometimes the angle of inclination —of the line. Students who have stud¬ 
ied trigonometry will see from Fig. 1.9 that the slope is the tangent of this an¬ 
gle, m — tan a. 


EQUATIONS OF A LINE 

A vertical line is characterized by the fact that all points on it have the same 
jc-coordinate. If the line crosses the jc-axis at the point (a, 0), then a point (jc, y ) 
lies on the line if and only if 


x = 


a , 


( 2 ) 


y 


(x, y) lies on line 
if and only if x - a 


( a , 0 ) 


x 


Figure 1.11 


as illustrated in Fig. 1.11. The statement that (2) is the equation of the line means 
precisely this: A point (jc, y) lies on the line if and only if condition (2) is satis¬ 
fied. 

Next consider a nonvertical line, and let it be “given” in the sense that we know 
a point (jco, yo) on it and its slope m (Fig. 1.12). If (jc, y) is a point in the plane 
that does not lie on the vertical line through (jco, yo), then it is easy to see that 
this point lies on the given line if and only if the line determined by (jco, yo) and 
(jc, y) has the same slope as the given line: 



This would be the equation of our line except for the minor flaw that the coor¬ 
dinates of the point (jco, yo)—which is certainly on the line—do not satisfy the 
equation (they reduce the left side to the meaningless expression 0/0). This flaw 
is easily removed by writing equation (3) in the form 

y “ yo = m(x - *o). (4) 

Nevertheless, we usually prefer the form (3), because its direct connection with 
the geometric idea illustrated in Fig. 1.12 makes it easy to remember. Either equa¬ 
tion (or both) is called the point-slope equation of a line, since the line is ini¬ 
tially specified by means of a known point on it and its known slope. To grasp 
more firmly the meaning of equation (4), imagine a point (jc, y) moving along 
the given line. As this point moves, its coordinates jc and y change; but even 
though they change, they are bound together by the fixed relationship expressed 
by equation (4). 
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If the known point on the line happens to be the point where the line crosses 
the y-axis, and if this point is denoted by (0, b\ then equation (4) becomes y — 
b = mx or 

y = mx + b. (5) 

The number b is called the y-intercept of the line, and (5) is called the slope- 
intercept equation of a line. This form is especially convenient because it tells at 
a glance the location and direction of a line. For example, if the equation 

6x — 2y — 4 = 0 ( 6 ) 

is solved for y, we see that 

y = 3x - 2. (7) 

Comparing (7) with (5) shows at once that m = 3 and b = —2, and so (6) and 
(7) both represent the line that passes through (0, —2) with slope 3. This infor¬ 
mation makes it very easy to sketch the line. It may seem that (6) and (7) are 
different equations, so that (6) should be referred to as “an” equation of the line 
and (7) as “another” equation of the line, but we prefer to regard them as merely 
different forms of a single equation. Many other forms are possible, for instance, 

y + 2 = 3x, x = yy + f, 3x - y = 2 . 

It is reasonable to cut through appearances and speak of any one of these as “the” 
equation of the line. 

More generally, every equation of the form 

Ax + By + C = 0, ( 8 ) 

where the constants A and B are not both zero, represents a straight line. For if 
B = 0, then A =£ 0, and the equation can be written as 

C 
A ’ 

which is clearly the equation of a vertical line. On the other hand, if B =£ 0, then 



and this equation has the form (5) with m = —A/B and b = —C/B. Equation (8) 
is rather inconvenient for most purposes because its constants are not directly re¬ 
lated to the geometry of the line. Its main merit is that it is capable of repre¬ 
senting all lines, without any need for distinguishing between the vertical and 
nonvertical cases. For this reason it is called the general linear equation. 

PARALLEL AND PERPENDICULAR LINES 

Two distinct nonvertical straight lines with slopes m\ and m 2 are evidently par¬ 
allel if and only if their slopes are equal: 

m \ = m2. 

The criterion for perpendicularity is the relation 


mi m 2 = —1. 


(9) 
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This is not obvious, but can be established quite easily by using similar trian¬ 
gles, as follows (Fig. 1.13). Suppose that the lines are perpendicular, as shown 
in Fig. 1.13. Draw a segment of length 1 to the right from their point of inter¬ 
section, and from its right endpoint draw vertical segments up and down to the 
two lines. From the meaning of the slopes, the two right triangles formed in this 
way have sides of the indicated lengths. Since the lines are perpendicular, the in¬ 
dicated angles are equal and the triangles are similar. This similarity implies that 
the following ratios of corresponding sides are equal: 

ni\ _ 1 

1 —m 2 

This is equivalent to (9), so (9) is true when the lines are perpendicular. The rea¬ 
soning given here is easily reversed, telling us that if (9) is true, then the lines 
are perpendicular. Since equation (9) is equivalent to 


m\ — -and 

m2 


m2 — — 


m\ ’ 



we see that two nonvertical lines are perpendicular if and only if their slopes are 
negative reciprocals of one another. 

The ideas of this section enlarge our supply of tools for proving geometric the¬ 
orems by algebraic methods. 

Example If the diagonals of a rectangle are perpendicular, then the rectangle is 
a square. To establish this, we place the rectangle in the convenient position shown 
in Fig. 1.14. The slopes of the diagonals are clearly b/a and — b/a. If these di¬ 
agonals are perpendicular, then 

— = 7 ", a 2 = b 2 > a 2 — b 2 — 0, and (a + b)(a — b) = 0. 

a b 

The last equation implies that a = b, so the rectangle is a square. 


PROBLEMS 

1 Plot each pair of points, draw the line they determine, 
and compute the slope of this line: 

(a) (-3, 1),(4, -1); (b) (2, 7), (-1, -1); 

(c) (-4,0), (2, 1); (d) (-4, 3), (5, -6); 

(e) (-5, 2), (7, 2); (f) (0,-4), (1, 6). 

2 Plot each of the following sets of three points, and use 
slopes to determine in each case whether all three points 
lie on a single straight line: 

(a) (5, — 1), (2, 2), ( — 4, 6); 

(b) (1, 1), (-5, -2), (5, 3); 

(c) (4, 3), (10, 14), (-2, -8); 

(d) (-1,3), (6, -1), (-9,7). 

3 Plot the points (-1, -1), (9, 1), (8, 6), and (-2, 4), and 
show that they are the vertices of a rectangle. 

4 Plot the points (-3, 8), (3, 5), (0, -1), and (-6, 2), and 
show that they are the vertices of a square. 


5 Plot each of the following sets of three points, and use 
slopes to determine in each case whether the points form 
a right triangle: 

(a) (2, -3), (5, 2), (0, 5); 

(b) (10, -5), (5, 4), (-7, -2); 

(c) (8,2), (-1, -1), (2, -7); 

(d) (-2, 6), (3, -4), (8, 11). 

6 Write the equation of each line in Problem 1 using the 
point-slope form; then rewrite each of these equations in 
the form y = mx + b and find the y-intercept. 

7 Find the equation of the line: 

(a) through (2, —3) with slope —4; 

(b) through (-4, 2) and (3, -1); 

(c) with slope f and y-intercept —4; 

(d) through (2, —4) and parallel to the Jt-axis; 

(e) through (1,6) and parallel to the y-axis; 
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(f) through (4, -2) and parallel to x + 3y = 7; 

(g) through (5,3) and perpendicular to y 4- 7 = 2x; 

(h) through (—4, 3) and parallel to the line determined 
by (-2, -2) and (1, 0); 

(i) that is the perpendicular bisector of the segment join¬ 
ing (1, -1) and (5, 7); 

(j) through (-2, 3) with inclination 135°. 

8 If a line crosses the x-axis at the point ( a , 0), the num¬ 
ber a is called the x-intercept of the line. If a line has 
x-intercept a =£ 0 and y-intercept b =£ 0, show that its 
equation can be written as 



This is called the intercept form of the equation of a line. 
Notice that it is easy to put y = 0 and see that the line 
crosses the x-axis at x = a, and to putx = 0 and see that 
the line crosses the y-axis at y = b. 

9 Put each equation in intercept form and sketch the cor¬ 
responding line: 

(a) 5x + 3y + 15 = 0; (b) 3x = 8y - 24; 

(c) y = 6 - 6x; (d) 2x - 3y = 9. 

10 The set of all points (x, y) that are equally distant from 
the points P\ = (-1, -3) and Pi = (5, -1) is the per¬ 
pendicular bisector of the segment joining these points. 
Find its equation 

(a) by equating the distances from (x, y) to P\ and Pi, 
and simplifying the resulting equation; 

(b) by finding the midpoint of the given segment and us¬ 
ing a suitable slope. 


11 Sketch the lines 3x + 4y = 7 and x - 2y = 6, and find 
their point of intersection. Hint: Their point of intersec¬ 
tion is that point (x, y) whose coordinates satisfy both 
equations simultaneously. 

12 Find the point of intersection of each of the following 
pairs of lines: 

(a) 2x + 2y = 2, y = x - 1; 

(b) lOx + 7y = 24, 15x-4y = 7; 

(c) 3x — 5y = 7, 15y + 25 = 9x. 

13 Let F and C denote temperature in degrees Fahrenheit 
and degrees Celsius. Find the equation connecting F and 
C, given that it is linear and that F = 32 when C = 0, 
F = 212 when C = 100. 

14 Find the values of the constant k for which the line 
(* - 3)* - (4 - k 2 )y + £ 2 - 7* + 6 = 0 is 

(a) parallel to the x-axis; 

(b) parallel to the y-axis; 

(c) through the origin. 

15 Show that the segments joining the midpoints of adja¬ 
cent sides of any quadrilateral form a parallelogram. 

16 Show that the lines from any vertex of a parallelogram 
to the midpoints of the opposite sides trisect a diagonal. 

17 Let (0, 0), {a, 0), and ( b, c) be the vertices of an arbi¬ 
trary triangle placed so that one side lies along the pos¬ 
itive x-axis with its left endpoint at the origin. If the 
square of this side equals the sum of the squares of the 
other two sides, use slopes to show that the triangle is a 
right triangle. Thus, the converse of the Pythagorean the¬ 
orem is also true. 


The coordinate plane or xy-plane is often called the Cartesian plane, and x and 
y are frequently referred to as the Cartesian coordinates of the point P = (x, y). 
The word “Cartesian” comes from Cartesius, the Latinized name of the French 
philosopher-mathematician Descartes, who is considered one of the two princi¬ 
pal founders of analytic geometry.* The basic idea of this subject is quite sim¬ 
ple: Exploit the correspondence between points and their coordinates to study 
geometric problems—especially the properties of curves—with the tools of al¬ 
gebra. The reader will see this idea in action throughout this book. Generally 
speaking, geometry is visual and intuitive, while algebra is rich in computational 
machinery, and each can serve the other in many fruitful ways. 

Most people who have had a course in algebra have learned that an equation 
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F(x, y) = 0 


( 1 ) 


usually determines a curve (its graph ) which consists of all points P = (x, y) 
whose coordinates satisfy the given equation. Conversely, a curve defined by 
some geometric condition can usually be described algebraically by an equation 


*The other (also French) was Fermat, a less well known figure than Descartes but a much greater 
mathematician. The names of these two men are pronounced “Fair-MA” and “Day-CART.” 
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of the form (1). It is intuitively clear that straight lines are the simplest curves, 
and our work in Section 1.3 demonstrated that straight lines in the coordinate 
plane correspond to linear equations in x and y. We now develop algebraic de¬ 
scriptions of several other curves that will be useful as illustrative examples in 
the next few chapters. 


CIRCLES 



The distance formula of Section 1.2 is often useful in finding the equation of a 
curve whose geometric definition depends on one or more distances. 

One of the simplest curves of this kind is a circle , which can be defined as the 
set of all points at a given distance (the radius) from a given point (the center). 
If the center is the point ( h, k) and the radius is the positive number r (Fig. 1.15), 
and if (x, y ) is an arbitrary point on the circle, then the defining condition says 
that 

V(* — h ) 2 + (y — k) 2 = r. 

It is convenient to eliminate the radical sign by squaring, which yields 

(x - h) 2 + (y - k) 2 = r 2 . (2) 

This is therefore the equation of the circle with center (/z, k) and radius r. In par¬ 
ticular, if the center happens to be the origin, so that h = k = 0, then 

Jt 2 + y 2 = r 2 

is the equation of the circle. 


Example 1 If the radius of a circle is VTo and its center is ( — 3, 4), then its 
equation is 

(x + 3) 2 + (y - 4) 2 = 10. 

Notice that the coordinates of the center are the numbers subtracted from jc and 
y in the parentheses. 



Example 2 An angle inscribed in a semicircle is necessarily a right angle.* To 
prove this algebraically, let the semicircle have radius r and center at the origin 
(Fig. 1.16), so that its equation is x 2 + y 2 = r 2 with y > 0. The inscribed angle 
is a right angle if and only if the product of the slopes of its sides is — 1, that is, 


This is easily seen to be equivalent to x 2 + y 2 = r 2 , which is certainly true for 
any point (x, y) on the semicircle, so (3) is true and the angle is a right angle. 


It is clear that any equation of the form (2) is easy to interpret geometrically. 
For instance, 


"According to tradition, this is one of the theorems discovered and proved by Thales. 
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(jc - 5) 2 + (y + 2) 2 = 16 (4) 

is immediately recognizable as the equation of the circle with center (5, —2) and 
radius 4, and this information enables us to sketch the graph without difficulty. 
However, if the equation has been roughly treated by someone who likes to “sim¬ 
plify” things algebraically, then it might have the form 

jc 2 + y 2 - 1 Ojc + 4y + 13 = 0. (5) 

This is an equivalent but scrambled version of (4), and its constants tell us noth¬ 
ing directly about the nature of the graph. To find out what the graph is, we must 
“unscramble” by completing the square .* To do this, we begin by rewriting equa¬ 
tion (5) as 

(jc 2 — 1 Ojc -h ) + (y? + 4y + ) = -13, 

with the constant term moved to the right and blank spaces provided for the in¬ 
sertion of suitable constants. When the square of half the coefficient of x is added 
in the first blank space and the square of half the coefficient of y in the second, 
and the same constants are added to the right side to maintain the balance of the 
equation, we get 

(jc 2 - 1 Ojc + 25) + (y 2 + 4y + 4) = -13 + 25 4- 4 


or 

(jc - 5) 2 + (y + 2) 2 = 16. (6) 

Exactly the same process can be applied to the general equation of the form (5), 
namely, 

jc 2 + y 2 + Ax + By + C = 0, (7) 

but there is little to be gained by writing out the details in this general case. How¬ 
ever, it is important to notice that if the constant term 13 in (5) is replaced by 
29, then (6) becomes 

(jc - 5)2 + (y + 2)2 = 0, 

whose graph is the single point (5, —2). Similarly, if this constant term is re¬ 
placed by any number greater than 29, then the right-hand side of (6) becomes 
negative and the graph is empty, in the sense that there are no points (jc, y) in the 
plane whose coordinates satisfy the equation. We therefore see that the graph of 
(7) is sometimes a circle, sometimes a single point, and sometimes empty—de¬ 
pending entirely on the constants A, B, and C. 

PARABOLAS 

The definition we use for a parabola is the following (Fig. 1 Ala): It is the curve 
consisting of all points that are equally distant from a fixed point F (called the 
focus) and a fixed line d (called the directrix). The distance from a point to a line 
is always understood to mean the perpendicular distance. 


*The form of the equation (jc + a) 2 = jc 2 + lax + a 2 is the key to the process of completing the 
square. Notice that the right side is a perfect square — the square of jc + a — precisely because its 
constant term is the square of half the coefficient of jc. 
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Figure 1.17 Parabola. 




To find a simple equation for a parabola, we place it in the coordinate system 
as shown in Fig. 1.17b, with the focus and directrix equally far above and below 
the x-axis. The line through the focus perpendicular to the directrix is called the 
axis of the parabola; this is the axis of symmetry of the curve, and is the y-axis 
in the figure. The point on the axis halfway between the focus and the directrix 
is called the vertex of the parabola; in the figure this point is the origin. If ( x , y ) 
is an arbitrary point on the parabola, the condition expressed in the definition is 
stated algebraically by the equation 

Vjt? + (y-p? = y + p. (8) 

On squaring both sides and simplifying, we obtain 

X 2 + y 2 - 2 py +p 2 = y 2 + 2 py + p 2 


or 

X 2 = Apy. (9) 

These steps are reversible, so (8) and (9) are equivalent and (9) is the equation 
of the parabola whose focus and directrix are located as shown in Fig. 1.17b. No¬ 
tice particularly that the positive constant p in (9) is the distance f rom the focus 
to the vertex, and also from the vertex to the directrix. 

If we change the position of the parabola relative to the coordinate axes, we 
naturally change its equation. Three other positions are shown in Fig. 1.18, each 
with its corresponding equation and with p > 0 in each case. Students should 





Figure 1.18 Various parabolas. 


x 2 = —4 py 


y 2 - 4px 


y 2 = —4 px 
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verify the correctness of all three equations. We also point out that each of these 
four equations can be put in the form 

y = ax 2 (10) 


These forms conceal the constant p, with its geometric significance, but as com¬ 
pensation they are more useful in visualizing the overall appearance of the graph. 
For instance, in (10) the variable jc is squared but y is not. This tells us that as a 
point ( x , y) moves out along the curve, y increases much faster than x, and so the 
curve opens in the y- direction—upward or downward, according as a is positive 
or negative. It also tells us that the graph is symmetric with respect to the y-axis, 
because x is squared, and therefore we get the same number y for any number x 
and its negative. 

Example 3 What is the graph of the equation 1 2x + y 2 = 0? If this is put in the 
form y 2 = —I2x and compared with the equation on the right in Fig. 1.18, it is 
clear that the graph is a parabola with vertex at the origin and opening to the left. 
Since \p = 12 and therefore p — 3, the point (—3, 0) is the focus and x = 3 is 
the directrix. 


Example 4 The graph of y = lx 2 is evidently a parabola with vertex at the ori¬ 
gin and opening upward. To find its focus and directrix, the equation must be 
rewritten as x 2 = and compared with equation (9). This yields Ap = so p = 
The focus is therefore (0, {), and the directrix is y = — 


We illustrate one last point about parabolas by examining the equation 

y = x 2 — Ax + 5. (11) 

If this is written as 

y — 5 = x 2 — Ax, 

and if we complete the square on the terms involving x, then the result is 

y - 1 = (x - 2) 2 . (12) 

If we now introduce the new variables 

X = x - 2, 

Y= y- 1, 

then equation (12) becomes 

Y= X 2 . 


(13) 


The graph of this equation is clearly a parabola opening upward with vertex at 
the origin of the XY coordinate system. By equations (13), the origin in the XY 
system is the point (2, 1) in the ry system, as shown in Fig. 1.19. What has hap¬ 
pened here is that the coordinate system has been shifted or translated to a new 
position in the plane, and the axes renamed, and equations (13) express the re- 


Figure 
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lation between the coordinates of an arbitrary point with respect to each of the 
two coordinate systems. In exactly the same way, any equation of the form 

y = ax 2 + bx + c, a =£ 0, 

represents a parabola with vertical axis which is congruent to y = ax 2 and opens 
up or down according as the number a is positive or negative. Similarly, the equa¬ 
tion 

x = ay 2 + by + c, a 0, 

represents a parabola with horizontal axis which opens to the right or left ac¬ 
cording as a > 0 or a < 0. 

In our work up to this stage we have used the static concept of a curve as a 
certain set of points or geometric figure. It is often possible to adopt the dynamic 
point of view, in which a curve is thought of as the path of a moving point. For 
instance, a circle is the path of a point that moves in such a way that it main¬ 
tains a fixed distance from a given point. When this mode of thought is used— 
with its advantage of greater intuitive vividness—a curve is often called a locus. 
Thus, a parabola is the locus of a point that moves in such a way that it main¬ 
tains equal distances f rom a given point and a given line. 


JSgggJ NOTE ON DESCARTES AND 

— Are there some people among us who feel 
that what passes for “knowledge” in our time is an uncriti¬ 
cal mishmash of sense and nonsense, fact and guesswork, 
gossip and hearsay and clumsy propaganda—mostly ac¬ 
quired from wishful thinking, lazy reasoning, inadequate 
senses, credulous parents, overworked teachers, and self- 
serving institutions? This was also the opinion of the 23- 
year-old Frenchman Rene Descartes (1596-1650) on Nov. 
10, 1619. For this was the day above all others when the 
modem world began, our world of victorious rationality and 
triumphant science. 

On this day — a famous day in the history of thought— 
in a state of exhaustion and feverish excitement, Descartes 
found the method he sought for extending the certainty of 
mathematics to all other fields of knowledge: 

The long chains of simple reasoning which geometers use 
to arrive at their most difficult conclusions made me be¬ 
lieve that all things which are the objects of human knowl¬ 
edge are similarly interdependent; and that if we will only 
abstain from assuming something to be true which is not, 


FERMAT 

and always follow the necessary order in deducing one 
thing from another, there is nothing so remote that we 
cannot reach it, nor so hidden that we cannot discover it. 

This is a quotation from Part 2 of his Discourse on Method, 
a short and highly readable book published in 1637 which 
is commonly considered to mark the birth of modern phi¬ 
losophy. In this work he rejected the sterile scholasticism 
prevailing at the time and set himself the task of rebuilding 
knowledge from the ground up, on a foundation of reason 
and science instead of authority and faith. He provided the 
fresh points of view needed for the vigorous development 
of the Scientific Revolution, whose influence has been the 
dominant fact of modem history. Further, in an appendix to 
the Discourse on his ideas about geometry, he foreshadowed 
the new forms of mathematics—analytic geometry and cal¬ 
culus—without which this Revolution would have died in 
infancy. It was no exaggeration for the great American ju¬ 
rist Oliver Wendell Holmes to write: “Descartes commanded 
the future from his study more than Napoleon from his 
throne.” 
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Descartes was a brilliant man—and enormously influen¬ 
tial with a corresponding ego—but he was not quite as bril¬ 
liant as he thought. His contemporary Pierre Fermat 
(1601-1665) was a man of genius and perhaps the greatest 
mathematician of the seventeenth century; and when the two 
men collided on issues of science or mathematics, it was al¬ 
ways Descartes’s nose that was bloodied. 

By profession Fermat was a lawyer and a member of the 
provincial supreme court in Toulouse, a city in southwest¬ 
ern France. However, his hobby and private passion was 
mathematics, and his casual creativity was one of the won¬ 
ders of the age to the few who knew about it. His letters 
suggest that he was a shy and retiring man, courteous anil 
affable but slightly remote. His outward life was as quiet 
and orderly as one would expect of a provincial judge with 
a sense of responsibility toward his work. Fortunately this 
work was not too demanding, and left ample leisure for the 
extraordinary inner life that flourished by lamplight in the 
silence of his study at night. 

He invented analytic geometry in 1629 and described his 
ideas in a short work that circulated in manuscript from early 
1637 on, but was not published in his lifetime. The credit 
for this achievement has usually been given to Descartes on 
the basis of his Geometry, which was published late in 1637 
as an appendix to his Discourse on Method . However, noth¬ 
ing that we would recognize as analytic geometry can be 
found in Descartes’s essay, except perhaps the idea of using 
algebra as a language for discussing geometric problems. 
Fermat had the same idea but did something important with 
it: He introduced perpendicular axes and found the general 
equations of straight lines and circles and the simplest equa¬ 
tions of parabolas, ellipses, and hyperbolas; and he further 
showed in a fairly complete and systematic way that every 
first- or second-degree equation can be reduced to one of 
these types. Descartes certainly knew some analytic geom¬ 
etry by the late 1630s; but since he had possession of the 
original manuscript of Fermat’s short essay (of which Fer¬ 
mat himself did not bother to keep a copy) several months 
before the publication of his own Geometry, it is likely that 
much of what he knew he learned from Fermat. 

The invention of calculus is usually credited to Newton 
and Leibniz, whose ideas and methods were not published 
until about 20 years after Fermat’s death. However, if dif¬ 
ferential calculus is considered to be the mathematics of 
finding maxima and minima of functions and drawing tan¬ 
gents to curves, then Fermat was the true creator of this sub¬ 
ject as early as 1629, more than a decade before either New¬ 
ton or Leibniz was bom. With his usual honesty in such 
matters, Newton stated—in a letter that was discovered only 
in 1934—that his own early ideas about calculus came di¬ 
rectly from “Fermat’s way of drawing tangents.” 


Fermat was also the founder of mathematical optics and 
the joint founder (in correspondence with Blaise Pascal) of 
the theory of probability. But to him all these activities were 
of minor importance compared with the consuming passion 
of his life, the theory of numbers. It was here that his ge¬ 
nius shone most brilliantly, for his insight into the proper¬ 
ties of the familiar but mysterious positive integers has per¬ 
haps never been equaled. He was the sole and undisputed 
founder of the modem era in this important branch of pure 
mathematics, without any rivals and with few followers un¬ 
til the next century. 

To illustrate the nature of his achievement in number the¬ 
ory, we mention his profound and beautiful foursquares the¬ 
orem: Every positive integer is either a square or the sum of 
two, three, or four squares. Like many of his discoveries, 
this was jotted down in the margin of one of his books, and 
his proof went unrecorded and was lost forever when he 
died. A proof was found at last in 1772—more than a cen¬ 
tury after Fermat’s death—as the culmination of 40 years 
of effort by one of the greatest mathematicians of the eigh¬ 
teenth century. As we see, mathematicians are people who 
are not only irresistibly attracted by truths of this kind but 
also cannot rest until they know why they are true. 

Without visibly trying, and as naturally as a hawk sus¬ 
tains itself on the wind, Fermat attained immortal fame 
among mathematicians. There are many reasons for this im¬ 
mortality, one of the most interesting being the legacy of 
what is now known as Fermat's last theorem: If n > 2, then 
the equation x n + y n = z n has no positive integer solutions 
x, y, z. Again, he wrote this statement in the margin of a 
book he was studying, near a passage dealing with the fact 
that* 2 + y 1 = z 2 has many solutions—3, 4, 5 and 5, 12, 13, 
among others. He then added the tantalizing remark, “I have 
found a truly wonderful proof which this margin is too nar¬ 
row to contain.” Unfortunately no proof has ever been dis¬ 
covered by anyone else, and Fermat’s last theorem remains 
to this day one of the most baffling unsolved problems of 
mathematics.* 


*Late report from the cutting edge: It appears that Fermat’s last the¬ 
orem may have been proved by Andrew Wiles of Princeton Uni¬ 
versity. This was announced on June 23, 1993, in the last of three 
lectures Wiles gave at Cambridge University, in England. The proof 
is about 200 pages long and follows a tortuous, roundabout path 
through many tangled jungles of sophisticated pure mathematics. 
The careful checking of every line of this proof may take years to 
carry out. It is estimated that perhaps a tenth of I percent of math¬ 
ematicians could understand all details of the proof—and this def¬ 
initely does not include the present writer. If Wiles’s proof checks 
out, the challenge will still remain of discovering a one- or two- 
page (or even a three- or four-page) proof of Fermat’s one-sentence 
theorem. For further details, see Newsweek , July 5, 1993, or Scien¬ 
tific American, September 1993. 
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PROBLEMS 

1 Find the equation of the circle with the given point as 
center and the given number as radius: 

(a) (4, 6), 3; (b) (-3, 7), V5; 

(c) (-5,-9), 7; (d) (1,-6), V2; 

(e) (a, 0), a\ (f) (0, a), a. 

2 In each case find the equation of the circle determined 
by the given conditions: 

(a) Center (2, 3) and passes through ( — 1, —2). 

(b) The ends of a diameter are (—3, 2) and (5, — 8). 

(c) Center (4, 5) and tangent to the Jt-axis. 

(d) Center (-4, 1) and tangent to the line jc = 3. 

(e) Center (-2, 3) and tangent to the line 4y - 3 jc + 2 = 
0. 

(f) Center on the line jc + y = 1, passes through ( — 2, 1) 
and (-4, 3). 

(g) Center on the line y = 3x and tangent to the line jc = 
2 y at the point (2, 1). 

3 In each of the following, determine the nature of the 
graph of the given equation by completing the square: 

(a) jc 2 + y 2 - 4jc - Ay = 0. 

(b) jc 2 + y 2 - 18jc - 14 y + 130 = 0. 

(c) jc 2 + y 2 + 8jc + lOy + 40 = 0. 

(d) 4x 2 + 4y 2 + 12* - 32y + 37 = 0. 

(e) jc 2 + y 2 - 8jc + 12y + 53 =0. 

(f) X 2 + y 2 - V2x + Vly +1=0. 

(g) x 2 + y 2 - 16* + 6y - 48 = 0. 

4 Find the equation of the locus of a point P = (jc, y) that 
moves in accordance with each of the following condi¬ 
tions, and sketch the graphs: 

(a) The sum of the squares of the distances from P to 
the points (a, 0) and ( — a , 0) is 4b 2 , where b > 
a/V2 > 0. 

(b) The distance of P from the point (8, 0) is twice its 
distance from the point (0, 4). 

5 The quadratic formula for the roots of the quadratic equa¬ 
tion ax 2 + bx + c = 0 is 

—b ± V/? 2 — 4ac 
X ~ la ' 

Derive this formula from the equation by dividing 
through by a, moving the constant term to the right side, 
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and completing the square. Under what circumstances 
does the equation have distinct real roots, equal real roots, 
and no real roots? 

At what points does the circle jc 2 + y 2 — 8jc — 6y — 
11=0 intersect 

(a) the jc-axis? (b) the y-axis? 

(c) the line jc + y = 1 ? 

Sketch the figure, and use this picture to judge whether 
your answers are reasonable or not. 

Find the equations of all lines that are tangent to the cir¬ 
cle jc 2 + y 2 = 2y and pass through the point (0, 4). Hint: 
The line y = mx + 4 is tangent to the circle if it inter¬ 
sects the circle at only one point. 

Find the focus and directrix of each of the following 
parabolas, and sketch the curves: 

(a) y 2 = 12*; (b) y = 4jc 2 ; 

(c) 2jc 2 + 5y = 0; (d) 4jc + 9y 2 = 0; 

(e) jc = — 2y 2 ; (f) 12y = -jc 2 ; 

(g) 16y 2 = x, (h) 14x 2 = y; 

(i) y 2 + 8y - 16jc = 16; (j) jc 2 + 2jc + 29 = 7y. 

Sketch the parabola and find its equation if it has 

(a) vertex (0, 0) and focus (—3, 0); 

(b) vertex (0, 0) and directrix y = — 1; 

(c) vertex (0, 0) and directrix jc = — 2; 

(d) vertex (0, 0) and focus (0, —y); 

(e) directrix jc = 2 and focus ( — 4, 0); 

(f) focus (3, 3) and directrix y = -1. 

Find the focus and directrix of each of the following 
parabolas, and sketch the curves: 

(a) y = x 2 + 1; (b) y = (jc — l) 2 ; 

(c) y = (jc — l) 2 + 1; (d) y = jc 2 — jc. 

Water squirting out of a horizontal nozzle held 4 ft above 
the ground describes a parabolic curve with the vertex at 
the nozzle. If the stream of water drops 1 ft in the first 
10 ft of horizontal motion, at what horizontal distance 
from the nozzle will it strike the ground? 

Show that there is exactly one line with given slope m 
which is tangent to the parabola jc 2 = 4 py, and find its 
equation. 

Prove that the two tangents to a parabola from any point 
on the directrix are perpendicular. 


1.5 

THE CONCEPT OF 
A FUNCTION 


The most important concept in all of mathematics is that of a function. No mat¬ 
ter what branch of the subject we consider—algebra, geometry, number theory, 
probability, or any other—it almost always turns out that functions are the pri¬ 
mary objects of investigation. This is particularly true of calculus, in which most 
of our work will be concerned with constructing machinery for the study of func¬ 
tions and applying this machinery to problems in science and geometry. 
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What is a function? Briefly—and we expand on this below—if jc and y are 
two variables that are related in such a way that whenever a permissible numer¬ 
ical value is assigned to x, there is determined one and only one corresponding 
numerical value for y, then y is called a function of x. 

Example 1 (a) If a rock is dropped from the edge of a cliff, and it falls s feet 

in t seconds, then s is a function of t. It is known from experiment that (approx¬ 
imately) s = 16r 2 . 

(b) The area A of a circle is a function of its radius r. It is known from geom¬ 
etry that A = 77 -r 2 . 

(c) If the manager of a bookstore buys n books from a publisher at $12 per 
copy and the shipping charges are $35, then his cost C for these books is a func¬ 
tion of n given by the formula C = \2n + 35. 


We continue building our understanding of the concept of a function by con¬ 
sidering an example directly related to our work in the preceding section. 


Example 2 We examine the equation 


and its corresponding graph, which we know is a parabola that opens upward 
and has its vertex at the origin (Fig. 1.20). In Section 1.4 we thought of this equa¬ 
tion as a relation between the variable coordinates of a point (jc, y) moving along 
the curve. We now shift our point of view, and instead think of it as a formula 
that provides a mechanism for calculating the numerical value of y when the nu¬ 
merical value of jc is given. Thus, y = 1 when a = 1, y = 4 when x = 2, y = } 
when jc = j, y = 1 when jc = — 1, and so on. The value of y is therefore said to 
depend on , or to be a function of the value of jc. This dependence can be ex¬ 
pressed in functional notation by writing 

y = f(x) where f(x) = x 2 . 

The symbol /(jc) is read “/ of jc,” and the letter /represents the rule or process— 
squaring, in this particular case — which is applied to any number jc to yield the 
corresponding number y. The numerical examples just given can therefore be 
written as/(l) = l,/(2) = 4,/(y) = j, and/(— 1) = 1. The meaning of this no¬ 
tation can perhaps be further clarified by observing that 

/(jc + 1) = (jc + l) 2 = JC 2 + 2jc + 1 and /(jc 3 ) = (jc 3 ) 2 = jc 6 ; 



Figure 1.20 


that is, the rule /simply produces the square of whatever quantity follows it in 
parentheses. 


This example suggests the general concept of a function as we shall use it in 
most of our work. We formulate this concept as follows. 

Let D be a given set of real numbers. A function f defined on D is a formula, 
or rule, or law of correspondence that assigns a single real number y to each num¬ 
ber jc in D. The set D of allowed values of jc is called the domain (or domain of 
definition) of the function, and the set of corresponding values of y is called its 
range. The number y that is assigned to x by the function/is usually written/(jc) 
—so that y = /(jc) —and is called the value of/at jc. It is customary to call jc the 








24 


NUMBERS. FUNCTIONS. AND GRAPHS 


independent variable because it is free to assume any value in the domain, and 
to call y the dependent variable because its numerical value depends on the choice 
of jc. 

There is nothing illegal or immoral about using other letters than jc and y to 
denote the variables. In Example 1, for instance, the independent variables are t, 
r, and n , and the dependent variables are 5, A, and C. Also, as we see in the next 
example, there is nothing sacred about the letter / and other letters can be used 
to designate functions. 

Example 3 (a) If a function /(jc) is defined by the formula /(jc) = x 3 - 3jc 2 + 

5, then /(2) = 2 3 - 3 • 2 2 + 5 = 1, /(0) = 5, and /(-2) = (-2) 3 - 3(-2) 2 + 
5 = -15. 

(b) I f a function g(jc) i s defined by the formula g(x) — V^c, then g( 1) = vT = 
1, g( 4) = V4 = 2, and a calculator tells us thatg(lO) = VTo = 3.16227766017, 
approximately. In this case the only allowed values of jc are those for which jc > 
0, because square roots of negative numbers are not real numbers. 

(c) If a function h(x) is defined by the formula h(x) = 1/(4 — jc), then h( 1) = 
1/(4 - 1) = i, h(2) = 1/(4 - 2) = and h( 4) = 1/(4 — 4) = ^ does not exist, 
because division by zero is not permitted in algebra. Thus, jc = 4 is the only value 
of jc that is not allowed. 


We point out that a function is not fully known until we know precisely which 
real numbers are permissible values for the independent variable jc. The domain 
is therefore an indispensable part of the concept of a function. In practice, how¬ 
ever, most of the specific functions we deal with are defined only by formulas 
like the ones in Example 3, and nothing is said about the domain. Unless we state 
otherwise, the domain of such a function is understood to be the set of all real 
numbers jc for which the formula makes sense. In part (a) of Example 3, this 
means all real numbers; in (b), all real numbers jc > 0; and in (c), all real num¬ 
bers except jc = 4. 

The reader is undoubtedly acquainted with the idea of the graph of a function 
/: If we imagine the domain D spread out on the jc-axis in the coordinate plane 
(Fig. 1.21a), then to each number jc in D there corresponds a number y = /(jc), 
and the set of all the resulting points (jc, y) in the plane is the graph. Graphs are 
pictures of functions that enable us to see these functions in their entirety, and 
we will examine many in the next section. 

Many people find it helpful to visualize a function by means of a machine di¬ 
agram, as shown in Fig. 1.21 b. Here a number jc in the domain is fed into the 
machine, where it is acted upon by the specific instructions built into the func¬ 
tion / and this action produces the resulting number /(jc). The domain is the set 
of all permissible inputs jc, and the range is the set of all outputs /(jc). 

Another way to picture a function is by an arrow diagram, in which the do¬ 
main is thought of as a certain set of points on the page and the range as another 
set of points (Fig. 1.21c). The arrow shows that jc has/(jc) corresponding to it, 
and the function/is the complete collection of all these correspondences thought 
of as a mapping of the first set onto the second. 

We mention machine diagrams and arrow diagrams only to help students who 
may be having difficulty grasping the concept of a function. The basic tool for 
visualizing functions throughout our work will always be graphs. Also, we will 
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Figure 1.21 


(a) 


(b) 


(c) 


see in Section 2.1 that graphs are essential for formulating the main purposes of 
calculus. 

Originally, the only functions mathematicians considered were those defined 
by formulas. This led to the useful intuitive idea that a function /“does some¬ 
thing” to each number x in its domain to “produce” the corresponding number 
y = fix). Thus, if 

y = fix) = (x 3 + 4) 2 , 


then y is the result of applying certain specific operations to jc: Cube it, add 4, 
and square the sum. On the other hand, the following is also a perfectly legiti¬ 
mate function which is defined by a verbal prescription instead of a formula: 

- . f 1 if ;t is a rational number, 

[0 if x is an irrational number. 

All that is really required of a function is that y be uniquely determined—in any 
manner whatever—when x is specified; beyond this, nothing is said about the 
nature of the rule / In discussions that focus on ideas instead of specific func¬ 
tions, such broad generality is often an advantage. We will understand this bet¬ 
ter in Chapter 6, where one of our problems is to discover what conditions must 
be imposed on an arbitrary function to guarantee that its integral exists. 

An additional remark on usage is perhaps in order. Strictly speaking, the word 
“function” refers to the rule of correspondence / that assigns a unique number 
y = fix) to each number jc in the domain. Purists are fond of emphasizing the 
distinction between the function/and its value fix) at x. However, once this dis¬ 
tinction is clearly understood, most people who work with mathematics prefer to 
use the word loosely and speak of “the function y =/(*),” or even “the function 

f(x)r 

The functions we work with in calculus are often composite (or compound) 
functions built up out of simpler ones. As an illustration of this idea, consider 
the two functions 


fix) = x 2 + 3x and g(jt) = x 2 — 1. 
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The single function that results from first applying g to jc and then applying/to 
g(x) is 

/(£(*)) =/U 2 - 1) = (jc 2 - l) 2 + 3 (jc 2 - 1) 

= x 4 + jc 2 - 2. 

Notice that/(jc 2 - 1) is obtained by replacing jc by the entire quantity jc 2 — 1 in 
the formula/(jc) = jc 2 + 3jc. The symbol /(g(jc)) is read ‘/of g of jc” and is called 
a function of a function. If we apply the functions in the other order (first / then 
g), we have 

£(/(*)) = 8(x 2 + 3x) = (x 2 + 3jc) 2 - 1 
= jc 4 + 6jc 3 + 9jc 2 - 1, 

so f(g(x)) and g(/(jc)) are different. In special cases it can happen that/(g(jc)) 
and g(/(jc)) are the same function of jc; for example, if/(jc) = 2jc — 3 and g(x) = 
—jc + 6: 

f(gM) =f(-x + 6) = 2(-x + 6) - 3 = -2jc + 9, 
g(f(x)) = g{2x - 3) = — (2jc - 3) + 6 = -2jc + 9. 

In each of these examples two given functions are combined into a single com¬ 
posite function. In most practical work we proceed in the other direction, and 
dissect composite functions into their simpler constituents. For example, if 

y = (jc 3 + l) 7 , 

we can introduce an auxiliary variable u by writing u = x 3 + 1 and decompose 
the above function into the two simpler functions 

y = iP and u = x 3 + 1. 

We shall see that decompositions of this kind are often useful in the problems of 
calculus. 

In practice, functions often arise from algebraic relations between variables. 
Thus, an equation involving jc and y determines y as a function of jc if the equa¬ 
tion is equivalent to one that expresses y uniquely in terms of jc. For example, the 
equation 4x + 2y = 6 can be solved for y, y = 3 — 2jc, and this second equation 
defines y as a function of jc. However, in some cases it happens that the process 
of solving for y leads to more than one value of y. For example, if the equation 
is y 2 = jc, we get y = ±Vjc. Since this gives two values of y for each positive 
value of jc, the equation y 2 = x does not by itself determine y as a function of jc. 
If we wish, we can split the formula y = ± Vjc into two separate formulas, y = 
Vjc and y = — Vjc. Each of these formulas defines y as a function of jc, so that 
out of one equation we obtain two functions. 

The number of distinct individual functions is clearly unlimited. However, most 
of those appearing in this book are relatively simple and can be classified into a 
few convenient categories. It may help students to orient themselves if we give 
a rough description of these categories in order of increasing complexity. 

POLYNOMIALS 

The simplest functions are the powers of jc with nonnegative integer exponents, 

1 , X, X 2 , X 3 , . • . , x n , .... 
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If a finite number of these are multiplied by constants and the results are added, 
we obtain a general polynomial, 

p(x) = Clp + Cl\X “1" 02-* 2 + Q?>X 3 + ■ ■ ■ + a n X n . 

The degree of a polynomial is the largest exponent that occurs in it; if a n 0, 
the degree of p(x) is n. The following are polynomials of degrees 1, 2, and 3: 

y = 3 jc - 2, y = 1 - 2x 4- x 2 , y = x-x 3 . 

Polynomials can evidently be multiplied by constants, added, subtracted, and 
multiplied together, and the results are again polynomials. 


RATIONAL FUNCTIONS 

If division is also allowed, we pass beyond the polynomials into the more in¬ 
clusive class of rational functions, such as 

x X + 2 jc 3 - 4x 2 + X + 6 J_ 

Jt 2 +r x - 2 ’ jc 2 H- jc H- 1 ’ * Jt' 

The general rational function is a quotient of polynomials, 

ap + a\x + fl 2 * 2 + • ' • + a n x n 

bp + b\x -I- bjx 1 + ■ ■ • + b m x m * 

and a specific function is rational if it is (or can be expressed as) such a quo¬ 
tient. If the denominator here is a nonzero constant, this quotient is itself a poly¬ 
nomial. Thus, the polynomials are included among the rational functions. 


ALGEBRAIC FUNCTIONS 

If root extractions are also allowed, we pass beyond the rational functions into 
the larger class of algebraic functions, which will be properly defined in a later 
chapter. Some simple examples are 

y = vT y = x + Vx^Th y=^=- y = </^\- 

If we replace the root symbols by fractional exponents in accordance with the 
rules of algebra, then these functions can be written 

lx T 1 \l/4 

y = x ]l2 , y = x T (jc 2 + 1) 1/3 , y = (1 — x) -1/2 , y = f _ ^ j . 

TRANSCENDENTAL FUNCTIONS 

Any function that is not algebraic is called transcendental. The transcendental 
functions studied in calculus are the trigonometric, inverse trigonometric, expo¬ 
nential, and logarithm functions. We do not assume that students have any pre¬ 
vious knowledge of these functions. All will be carefully explained later. 

We conclude this section with a brief review of some important functions aris¬ 
ing in geometry. A ready grasp of the geometric formulas given in Fig. 1.22 is 
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Figure 1.22 Geometric formulas. 



Circle 
A = nr 2 
c = 2nr 



A = 47JT 2 



Cylinder 
V - nr 2 h 
A - 2irrh 



V - \ Ttr 2 h 


A - 7 rrs 


essential for coping with many examples and problems in the following chap¬ 
ters. These formulas—for the area and circumference of a circle, the volume and 
total surface area of a sphere, and the volume and lateral surface area of a cylin¬ 
der and a cone—should be understood if possible, but remembered in any event. 
Each of the first four formulas, those for the circle and the sphere, defines a func¬ 
tion of the independent variable r, in which a given positive value of r determines 
the corresponding value of the dependent variable. 

Most of our attention in this book will be directed at functions of a single in¬ 
dependent variable, as previously defined and discussed. Nevertheless, we point 
out that each of the last four formulas in Fig. 1.22 defines a function of the two 
variables r and h\ these variables are called independent (of each other) because 
the value assigned to either need not be related to the value assigned to the other. 
In special circumstances a function of this kind can be expressed as a function 
of one variable alone. For example, if the height of a cone is known to be twice 
the radius of its base so that h = 2r, then the formula for its volume can be writ¬ 
ten as a function of r or as a function of h: 

V = y 7 rr 2 ( 2 r) = f 77r 3 or V = \tt h — j^irh 3 . 

The formulas in Fig. 1.22 also illustrate the custom of choosing letters for vari¬ 
ables that suggest the quantities under discussion, such as A for area, V for vol¬ 
ume, rfor radius, h for height, and so on. 


PROBLEMS 


1 If f{x) = 5x 2 — 3, find: 


In each of Problems 3-8, compute and simplify the quantity 

(a) /( —3); 

(b) /(2); 

f(x + h) -f(x) 

(c) /(0); 

(d) f(-V- 7); 


h 

(e) f(a + 3); 

(f) f(5t). 

3 

fix ) = 5* - 3. 4 fix) = 3-2*. 

2 If g(x) = , find: 


5 

/(*) = * 2 . 6 /(*) = 2x 2 + x. 

1 , 3 

(a) *(3); 

(b) *(-3); 

7 

/« = - 8 fix) = —i-. 

X 1 — X 

(c) *(j); 

<d) * (i); 

9 

If fix) = x 3 - 3x 2 + 4x - 2, compute /(1), /(2), /(3), 
/(0)./(-l), and/( —2). 

(e) g(a + 1); 

(f) g(r-l). 

10 

If /(*) = 2f compute /(1), /(3), /(5), /(0), and /(-2). 
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11 If/(jc) = 4jc — 3, show that/(2x) = 2f(x) + 3. 

12 What are the domains of/(jc) = 1/(jc — 8) and g(jc) = jc 3 ? 
What is h(x) = f(g(x))l What is the domain of h(x)l 

13 Find the domain of each of the following functions: 

(a) Vx\ (b) V^jc; 

(c) V?; (d) Vx 2 - 4; 


14 

15 

16 
17 


(g) V(x - l)(x + 2); 


(h) 


1 _ 

V(x- l)(x + 2)’ 


(i) V3 - 2x - x 2 ; (j) 

If f{x) = 1 — jc , show that/(/(jc)) = jc. 

If /(*) = x/(x - 1), compute /(0), /(1), /(2), /(3), and 
/(/(3)). Show that /(/(jc)) = jc. 

If /(jc) = (ax + b)t(x — a ), show that/(/(;c)) = jc. 

If/(JC) = 1/(1 - JC), compute/(0),/(l),/(2),/(/(2)), and 


/(/(/(2))). Show that /(/(/(jc))) = jc. 

18 If /(jc) = ajc, show that /(jc) + /(I — jc) = /(I). Also ver¬ 
ify that /(jc] + JC 2 ) =/(jC|) + /(JC 2 ) for all jcj and JC 2 . 

19 If /(jc) = 2/ use functional notation to express the fact 


that 2 X] • 2* 2 = 2 jri ^ jf2 . 


20 Find /(jc) if /(jc + 1) = jc 2 - 5x + 3. Hint: Let u = x + 
1 and find f(u). 

21 A linear function is one that has the form/(jc) = ax + b, 
where a and b are constants. If g(jc) = cx + d is also lin¬ 
ear, is it always true that/(g(jc)) = g(/(jc))? 

22 If f(x) = ax + b is a linear function with a =£ 0, show 
that there exists a linear function g(jc) = ax + (3 such that 
/(g(jc)) = jc.* Also show that for these two functions it is 
true that/(g(jc)) = g(/U))- 

23 A quadratic function is one that has the form /(jc) = 
ax 2 + bx + c, where a , b, c, are constants and a 0. 

(a) Find the values of the coefficients a, b, c if/(0) = 3, 
/(l) — 2,/(2) = 9. 

(b) Show that, no matter what values may be given to 
the coefficients, a , b, c, the range of a quadratic func¬ 
tion cannot be the set of all real numbers. 

24 In each case, decide whether or not the equation deter¬ 
mines y as a function of jc, and if it does, find a formula 
for the function: 

(a) 3jc 2 + y 2 = 1; (b) 3jc 2 + y = 1; 


The symbols a and [3 are letters of the Greek alphabet whose names 
are “alpha” and “beta.” The letters of this alphabet (see the front end¬ 
paper) are used so frequently in mathematics and science that seri¬ 
ous students should learn them at the earliest opportunity. Among 
other benefits, this will avoid the annoyance of reading printed mat¬ 
ter containing symbols we don’t know how to pronounce. 


(c) - - * = x; (d) x = y - — . 

y- 1 y 

25 Split the equation 2jc 2 + 2jry + y 2 = 3 into two equa¬ 
tions, each of which determines y as a function of jc. 

The following problems all involve geometry. In working on 

such a problem, always draw a sketch and use this sketch as 

a source of ideas. 

26 If an equilateral triangle has side jc, express its area as a 
function of jc. 

27 The equal sides of an isosceles triangle are 2. If jc is the 
base, express the area as a function of jc. 

28 If the edge of a cube is jc, express its volume, its surface 
area, and its diagonal as functions of jc. 

29 A rectangle whose base has length jc is inscribed in a 
fixed circle of radius a. Express the area of the rectan¬ 
gle as a function of jc. 

30 A string of length L is cut into two pieces, and these 
pieces are shaped into a circle and a square. If jc is the 
side of the square, express the total enclosed area as a 
function of jc. 

31 (a) Is the area of a circle a function of its circumference? 

If so, what function? 

(b) Is the area of a square a function of its perimeter? If 
so, what function? 

(c) Is the area of a triangle a function of its perimeter? 
If so, what function? 

32 The volume of a sphere is a function of its surface area. 
Find a formula for this function. 

33 A cylinder is inscribed in a sphere with fixed radius a. 
If h is the height and r is the radius of the base of the 
cylinder, express its volume and total surface area as 
functions of r, and also as functions of h. 

34 A cylinder is circumscribed about a sphere. If their vol¬ 
umes are denoted by C and S, find C as a function of S. 

35 A cylinder has fixed volume V. Express its total surface 
area as a function of the radius r of its base. 

36 A fixed cone has height H and base radius R. If a cylin¬ 
der with base radius r is inscribed in the cone, express 
the volume of the cylinder as a function of r. 

37 (a) A farmer has 100 ft of fencing with which to build 

a rectangular chicken pen. If jc is the length of one 
side of the pen, show that the enclosed area is 

A = 50jc - jc 2 = 625 - (jc - 25) 2 . 

Use this result to find the largest possible area and 
the lengths of the sides that yield this largest area, 
(b) Suppose the farmer in part (a) decides to build the 
pen against a side of the bam so that he will have to 
fence only three sides of it. If jc is the length of a side 
perpendicular to the bam wall, find the enclosed area 
as a function of jc. Also find the largest possible area 
and the lengths of the sides that yield this largest area. 
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1.6 

GRAPHS OF 
FUNCTIONS 



In the previous section we discussed the concept of a function at some length. 
This discussion can be summarized in a few sentences, as follows. 

If x and y are two variables that are related in such a way that whenever a suit¬ 
able numerical value is assigned to x there is determined a single corresponding 
numerical value for y, then y is called a function of x and this is expressed by 
writing y = /(jc). The letter /symbolizes the function itself, which is the opera¬ 
tion or rule of correspondence that yields y when applied to jc. However, for prac¬ 
tical reasons we prefer to speak of “the function y = /(jc)” instead of “the func¬ 
tion /” As a matter of principle, students should clearly understand that a function 
is not a formula and need not be specified by a formula—even though most of 
ours are. 

Now for graphs. 

The Chinese have a well-known proverb that can be interpreted as expressing 
a basic truth about the study of mathematics: One picture is worth a thousand 
words.* For us, in our study of functions, this means draw graphs! Even more , 
cultivate the habit of thinking graphically , to the point where it becomes almost 
second nature. 

Before getting down to the details of specific functions, we emphasize that it 
is often possible to think of the graph of a function y = /(;c) very concretely, as 
the path of a moving point (Fig. 1.23). The independent variable x can be visu¬ 
alized as a point moving along the jc-axis from left to right; each jc determines a 
value of the dependent variable y, which is the height of the point (jc, y) above 
the jc-axis. The graph of the function is simply the path of the point (jc, y) as it 
moves across the coordinate plane, sometimes rising and sometimes falling, and 
in general varying in height according to the nature of the particular function un¬ 
der consideration. The graph as a whole is intended to provide a clear overall 
picture of this variation. The graph shown in Fig. 1.23 happens to be a smooth 
curve with two high points and one low point, but many diverse phenomena are 
possible. 

We now discuss the graphs of a few representative examples of the types of 
functions described in Section 1.5. 


POLYNOMIALS 

We have seen that the simplest polynomials are the powers of jc with nonnega¬ 
tive integral exponents, 

y = 1, jc, x 2 , jc 3 , . . . , x n , . . . . 

As we know, the graph of y = 1 is the horizontal straight line through the point 
(0, 1), and the graph of y = x is the straight line through the origin with slope 1 
(Fig. 1.24 a). For larger values of the exponent n, the graphs of y = x n are of two 
distinct types, depending on whether n is even or odd: 

y = jc 2 , jc 4 , jc 6 , . . . 


*See Bartlett’s Familiar Quotations, 16th ed. (Little, Brown and Co., 1992), fn. 8, p. 782. 
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Figure 1.24 Graphs of y = x n . 


These types are shown in parts b and c of Fig. 1.24. As n increases, these curves 
become flatter near the origin and steeper outside the interval [—1, 1]. 

We already know that the graphs of all first- and second-degree polynomials, 
such as 


and 


y = 2x - 1 


y = 3x 2 - 2x + 1, 

are straight lines and parabolas. These graphs are easy to draw—without plot¬ 
ting points—on the basis of the ideas in Sections 1.3 and 1.4. 

For our next remark we need a bit of new terminology. A zero of a function 
y =/(;c) is a root of the corresponding equation /(jc) = 0. Geometrically, the ze¬ 
ros of this function (if it has any) are the values of jc at which its graph crosses 
or touches the jc-axis; they are the jc-intercepts of this graph. 

Now consider the general second-degree polynomial 

y = ax 2 4- bx + c, a =£ 0. (1) 

As we know, the graph of this function is a parabola for all values of the coef¬ 
ficients. If we assume that a > 0, so that the parabola opens upward, then there 
are three possibilities for the zeros of (1), and these are shown in Fig. 1.25. Since 
the roots of the quadratic equation ax 2 + bx + c = 0 are given by the quadratic 
formula 

—b ± \!b 2 — 4 ac 



Figure 1.25 
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it is clear that the three possibilities in Fig. 1.25 correspond to the algebraic con¬ 
ditions b 2 — 4 ac > 0, b 2 — 4 ac = 0, b 2 — 4 ac < 0. 

The problem of graphing polynomials of degree n > 3 is not easy. Our dis¬ 
cussion of the following example suggests several useful ideas. 

Example 1 The graph of 

y = jc 3 - 3jc (2) 

is shown in Fig. 1.26. At present we have no methods available for discover¬ 
ing such important features of this curve as the precise location of the indicated 
high and low points. This will come later. Nevertheless, a few observations 
can be made, and these provide at least some details and a good enough im¬ 
pression of the shape of the graph so that students should be able to sketch it for 
themselves. 

We begin by pointing out that if (2) is written in factored form, as 

v = x(x 2 - 3) = x(x + V3)u - V3), (3) 

then its zeros are obviously 0, — V3, V3. These three numbers divide the T-axis 
into four intervals, as shown in Fig. 1.27, and a careful inspection of the factors 
of (3) tells us that in each interval y has the sign given in this figure. The details 
of this determination of the sign of y are important to understand, so we pause 
and carefully think it through, as follows: 

f or jc < — V3, x is negative, 

x + V3 is negative, and 

jc- V3 is negative, 

so their product y is negative; 

for —'V3 < x < 0, jc is negative, 

x + V3 is positive, and 
jc- V3 is negative, 
so their product y is positive; 

for 0 < jc < V3, jc is positive, 

jc + V3 is positive, and 

x-is negative, 

so their product y is negative; 

for jc > V3, jc is positive, 

jc + V3 is positive, and 
x — V3 i s positive, 
so their product y is positive. 


We therefore know, for each interval, whether the graph of (2) lies above or be¬ 
low the x-axis (see Fig. 1.26). We have described this method of analysis in de¬ 
tail because it will often be useful in other problems of curve sketching. 

Our second observation relates to the behavior of the graph of (2) when jc is 
numerically large, that is, far to the right or far to the left in Fig. 1.26. If (2) is 
written in the form 


y = 



X # 0, 
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then for large positive or negative values of jc the expression in parentheses is 
nearly 1, so y is close to jc 3 . In geometric language, when jc is large, the graph of 
(2) is close to the graph of y = jc 3 , as Fig. 1.26 suggests. In particular, the graph 
of (2) rises on the far right and falls on the far left. 


Students will notice that they can always sketch a graph by laboriously plot¬ 
ting many points and joining these points by a reasonable curve. Nevertheless, 
this rather clumsy procedure should be adopted only as a last resort, when more 
imaginative methods fail. The important features of functions and their graphs 
are much more clearly revealed by the qualitative approach to curve sketching 
that we have tried to suggest in Example 1 and will continue to emphasize. 

RATIONAL FUNCTIONS 

Example 2 The simplest rational function that is not a polynomial is 

* = 7 (4) 

On examining (4), we notice the following facts: y is undefined when jc = 0; y 
is positive when jc is positive, and is small when x is large and large when jc is 
near 0 on the right; y is negative when jc is negative, and is small when jc is large 
and large when jc is near 0 on the left. The graph of (4) given in Fig. 1.28 is a 
direct pictorial version of these statements. In this particular case the graph is 
also easy to sketch by plotting a few points, as shown in the figure. However, 
students will profit much more from simply visualizing the behavior of such a 
function on the various parts of its domain and drawing what they see in the 
mind’s eye. 

A straight line is called an asymptote of a curve if, as a point moves out along 
an extremity of the curve, the distance from this point to the line approaches 0. 
It is clear that both the jc-axis and the y-axis are asymptotes of the graph shown 


y 



K,-4) 



Figure 1.28 
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Figure 1.29 



in Fig. 1.28. The behavior of the function (4) at and near the point jc = 0, that 
is, the fact that y is undefined at jc = 0 and “becomes infinite” near x = 0, is de¬ 
scribed by calling this point an infinite discontinuity of the function. 


Example 3 In the case of the function 


it is clear that the point jc = 1 is particularly significant, since y is undefined at 
jc = 1 and is large in absolute value when jc is near 1 (jc = 1 is an infinite dis¬ 
continuity). Also, y is near 1 and slightly greater than 1 when jc is large and pos¬ 
itive, and is near 1 and slightly less than 1 when jc is large and negative.* These 
observations suggest drawing the vertical and horizontal guidelines shown in Fig. 
1 29a. If we notice that y = 0 when jc = 0, and use the method of Example 1 to 
find the sign of y in each of the intervals — < jc < 0, 0<jc< 1, and 1 < jc, 

then the graph as given in Fig. 1 29a is quite easy to sketch. The lines jc = 1 and 
y = 1 are both asymptotes. 


Example 4 The function 

= x = x (G\ 

y x 2 -3x + 2 (x- IX*-2) 1 

is similar to (5) but somewhat more complicated. Here the factored form of the 
denominator reveals two infinite discontinuities, jc = 1 and jc = 2. Again, y = 0 
when jc = 0, but this time y is small when x is large, since the degree of the de¬ 
nominator is greater than that of the numerator. If we combine these facts with 
the observable sign of y in each of the intervals — °° <jc<0, 0<jc< 1, 1 < 
jc < 2, and 2 < jc (think it through in the manner of Example 1 for each inter¬ 
val!), then it is fairly straightforward to sketch the graph as shown in Fig. 1.2%. 
There is evidently a high point between 1 and 2, and a low point to the left of 
0, but at present we are unable to determine the precise location of these points 
(we shall see later that they occur at jc = V2 and jc = — V2). 

*To see this, test with convenient specific values of jc; thus, for example, y = -y when jc = 10 and 
y = when jc = - 10. 
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Example 5 The function 

y = x + - (7) 

JC 

has an infinite discontinuity at x = 0, and is positive or negative according as jc 
is positive or negative. For small positive jc’s, the first term on the right of (7) is 
negligible and the second term is large; and for large positive jc’s, the second term 
is negligible and y is approximately equal to jc. We therefore sketch the part of 
the graph in the right half-plane as follows: Draw the guideline y = jc (Fig. 1.30); 
insert the two extremities of the curve, approaching this guideline and the posi¬ 
tive y-axis, as suggested by the behavior previously stated; and connect these ex¬ 
tremities in a reasonable way in the middle, where this part of the graph has an 
obvious low point. The function behaves similarly—with a corresponding high 
point—for negative values of jc. The y-axis and the line y = jc are both asymp¬ 
totes. 



Example 6 The denominator of 


y = 


X 

X 2 + 1 


( 8 ) 


is positive (in fact > 1) for all jc, so y = 0 when jc = 0, y is positive when jc is 
positive, and y is negative when jc is negative. Also, y is small when jc is large, 
because the degree of the denominator is greater than that of the numerator.* 
These properties of the function force the graph to have the shape shown in Fig. 
1.31, with one high point and one low point. 


Example 7 In considering the function 



(9) 



it is natural to factor the numerator, obtaining 

(*+ !)(*- 1 ) 


and then to cancel the common factor, which yields 

y = jc + l.t ( 10 ) 


^Notice that when the numerator x is large, the denominator x 2 + 1 is enormous, so y is small. 
f A word of warning about a point of algebra. To “cancel” a common factor, as in the text, is OK: 


at _ a 
bt~ b 


if c + 0. 


But “canceling” a common term , as in 

a + ft _ a_ 
b + t b' 

is WRONG. Try it: Is 

1 + 2 _ 1 ? 
2 + 2 2 ‘ 


Of course not. 


* 
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l>; This cancellation is valid except when x= 1. At this point the value of (10) is 2, 

but (9) has no value (y = 0/0, which is meaningless). To graph (9), we therefore 
draw the straight line (10) and delete the single point (1, 2), as shown in Fig. 
1.32. 

| Two functions y = f(x) and y = g(x) are said to be equal if they have the same 

I domain and iff(x) = g(x) for every x in their common domain. Accordingly, the 

I functions (9) and (10) are not equal, because they have different domains—the 

| point x = 1 is in the domain of (10) but is not in the domain of (9). The fact that 

l * the graph of (9) has a gap (or hole) corresponding to jc = 1 is expressed by say- 

Figure 1.32 ing that (9) is discontinuous at x = 1, or has a discontinuity at this point. 


ALGEBRAIC FUNCTIONS 
Example 8 The functions 

y = and y = V25 — x 2 (11) 

can be obtained by solving the equations 

y 2 = x and x 2 + y 2 = 25 (12) 

for y and choosing the positive square roots. We know that the graphs of equa¬ 
tions (12) are a parabola and a circle, as shown in Fig. 1.33, so the graphs of 
(11) are the parts of these curves that lie on or above the Jt-axis. 





Example 9 The graph of the absolute value function 

y = M 

is easy to draw (Fig. 1.34). To see that this function is algebraic, we have only 
to notice the fact that \x\ = V? for every value of x. 


Figure 1.34 


As these examples show, many of the basic features of a function are made 
transparently clear by sketching its graph. We are interested less in sketches of 
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high accuracy than in those that display broad general features: where the graph 
is rising and where falling, the presence of gaps, the presence of high points and 
low points, and what its approximate shape is. Formulas are obviously important 
in the study of functions — indeed, they are indispensable whenever our purposes 
require exact calculations yielding quantitative results. But we should never for¬ 
get that the primary aim of mathematics is insight, and graphs are invaluable aids 
for gaining visual insight into the individual characteristics of functions. 


PROBLEMS 


1 Sketch the graphs of the following polynomials, paying 
special attention to the location of their zeros and their be¬ 
havior for large values of jc: 

(a) y = x 2 + x - 2; 

(b) y = x 3 — 3jc 2 + 2x; 

(c) y = (1 - jc)(2 - jcX3 - x)\ 

(d) y = x 4 - x 2 ; 

(e) y = x 4 - 5x 2 + 4. 

2 Sketch the graphs of the following rational functions: 


(a)y = ^; 

(c) y = x 2 + —; 

* 


(e) y = 


(g) y '■ 
(i) >' = 


* 2 - U 

x 2 

r 2 -l’ 


(b) 

(d) y=x 2 + -y; 


(f) y = 
(h) y = 

(j) >’ = 


x 2 + r 

JC 

x 2 — 3jc + 2 . 

2- x ; 


(k) y = 

( l ) y = 


x- 1 ’ 

(jc + 2)(jc - 5)(JC 2 + 2x - 8) 

(jc — 2)(jc 2 - 3jc — 10) ' 

3 Sketch the graphs of the following algebraic functions: 
(a) y = V(j c - 1)(3 - *); 


(b) y = 

(c) y = 


V(* - 1)(3 - *) ’ 
1 


vT 


(d) )> = 


3 —jc’ 


(e) y = 


4-x 


(f) y = 


x — 4 


x — 2 ’ ^ Vx-2‘ 

4 In each of the following, sketch the graphs of all three 
functions on a single coordinate system: 

(a) y = |*| f y = |*| + 1, y = |*| - 1; 

(b) y = M, y = \x + 1|,J = |*— 1|; 

(c) y = |*|, y = 2\x\, y = 

5 Sketch the graphs of the following functions: 

(a) y = —; (b) y = \2x + 3|; 


(d) y = 2x + |*|; 

(f) y= l + x - |*|; 


(c) y = x + |*|; 

(e) y =x - j*|; 

(g) y = |* 2 - l|. 

6 Considering only positive values of jc, show that 


0 < jc< 1, 



I JC + 1 I - JC — 1 


and sketch the graph. 

7 Are any of the following pairs of functions equal? 

(a) /(*) = j, g(x)= 1. 

(b) /(jc) = jc 2 - 1, g(x) = (jc + 1 )(* - 1). 

(c) f(x) = x, g(x) = Vx 2 . 

(d) f(x) = x , g(x) = (Vx) 2 . 


Periodic phenomena are found everywhere in the world around us—vibrating 
springs, alternating currents, swinging pendulums, revolving planets, etc. — and 
scientists describe these phenomena by using trigonometric functions. For this 
and other reasons, students beginning the study of calculus are often expected to 
know something about trigonometry. 

Although most users of this book have some familiarity with basic trigonom¬ 
etry, we nevertheless review a few of the fundamental ideas, especially the ra¬ 
dian measure of angles and the definitions and simpler properties of the very im- 
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portant functions sin 6 and cos 6 .* This review is continued in Section 9.1, where 
the discussion is broadened to include the other four trigonometric functions 
tan 6 , cot 0, sec 6 , esc 6 —all of which are indispensable in Chapter 10 but will 
not be needed until then. 

In high school trigonometry courses the sine and cosine of an acute angle 6 
are first defined as ratios of sides in a right triangle, as follows (see Fig. 1.35): 

s \ n Q — 0 PP 0S ^ te side _ £_ 
hypotenuse h ’ 

adjacent side b 

cos 6 = —^-- = -. 

hypotenuse h 

Because similar triangles have proportional sides, the values of sin 6 and cos 6 
depend only on the size of the acute angle 6 , and not at all on the size of the 
right triangle whose sides are used to compute these values. 


Example 1 We know from geometry that in a 30°-60° right triangle, the side 
opposite the 30° angle is half the hypotenuse (see Problem 32 in Section 1.2). 
This enables us to draw the familiar right triangles shown in Fig. 1.36, and from 
these triangles we see that 


sin 30° 

cos 30° 


]_ 

r 

V3 
2 ’ 


sin 60° 

cos 60° 


V3 
2 ’ 

2 ’ 


sin 45° = — 

V2 

cos 45° = -^-=. 

V2 


It is customary to rationalize the denominators on the right by writing 


jl = _l.v2. = i V 2 

V2 V2 V2 2 


but for the moment we leave these values as they stand in order to emphasize the 
defining ratios. 


*The Greek letter 9 is pronounced “theta.” 





Figure 1.36 
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The ideas described here are part of what is called right triangle trigonome¬ 
try, in which angles are measured in degrees and sines and cosines are defined 
only for acute angles of right triangles. In the equivalent forms 

a = h sin 6 and b = h cos 6 , 


these definitions have a number of applications in geometry and physics. This is 
all right as far as it goes. However, for the purposes of calculus the limitations 
of this approach are crippling. We therefore start all over again at the beginning 
and give a capsule development of analytic trigonometry, in which the trigono¬ 
metric functions are freed from their dependence on right triangles and are de¬ 
fined as real-valued functions of a real variable. As an example of what we mean 
by analytic trigonometry, let us consider the motion of an object oscillating up 
and down at the end of a spring (Fig. 1.37). If this motion is described by the 
position function 

s =f(t) = cos t, 

which gives the position 5 as a function of the time t, then it makes little sense 
to think of t as an angle and measure its values in degrees. We must consider 
what cos t means when t is not an angle but a number —the number of seconds 
that have elapsed since the motion began when t = 0. 

Our treatment below is self-contained. Even a student who knows nothing of 
the subject will be able to learn everything that matters by reading with close at¬ 
tention and working through the problems at the end of the section. 



Figure 1.37 


RADIAN MEASURE 

In elementary mathematics and daily life, angles are measured in degrees, with 
90° measuring a right angle. But the degree is an arbitrary measure inherited from 
the ancient Babylonian astronomers, and its use in calculus would make many 
of our formulas intolerably messy. In calculus we use a much more natural and 
convenient system called radian measure, which is defined in terms of how much 
arc an angle cuts off on a circle. 

In this system the unit of angle measurement is called the radian. One radian 
is the angle which, placed at the center of a circle, subtends (cuts off) an arc 
whose length equals the radius (Fig. 1.38, left). More generally, the number of 
radians 6 in an arbitrary central angle (Fig. 1.38, right) is defined to be the ratio 
of the length s of the subtended arc to the radius r, 0 = s/t; so that s = rO. We 



0 = I radian 


0 = 7 radians 


Figure 1.38 
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Figure 1.39 



360°= 2 7T 


note especially that in a unit circle (r = 1), a central angle of 6 radians subtends 
an arc of length s = 6. Since the circumference of a circle is c = 2irr , a com¬ 
plete central angle of 360° is equivalent to lirrlr — 2 tt radians. Thus, 

2 tt radians = 360° or tt radians = 180°; 
and it follows from this that 

1 radian = — = 57.296°, 1° = -|-= 0.0175 radian. 

7 T 1 o(J 

Further, 90° = 7?/2, 60° = tt/3, 45° = 7t/ 4, and 30° = 7?/6, where we follow the 
convention of omitting the word “radian” in using radian measure. It is a good 
idea for students to memorize these common conversions with the aid of the cir¬ 
cle diagrams in Fig. 1.39. In addition to knowing the conversions in these dia¬ 
grams, it will help students feel more comfortable with radians if they also think 
through and verify the additional conversions in the following table. 


Degrees 

30 

45 

60 

90 

120 

135 

150 

180 

210 

225 

240 

270 

300 

315 

330 

360 

Radians 

77 

77 

77 

77 

277 

377 

577 

77 

777 

577 

477 

377 

577 

777 

11 77 

277 

6 

~4 

3 

2 

3 

4 

6 

6 

4 

3 

2 

3 

4 

6 


The specific reason why radian measure for angles is preferred in calculus will 
appear in Section 3.4. In most of our work we will use radian measure routinely 
and mention degrees only in passing. 


DEFINITIONS OF sin 6 AND cos 6 



We approach trigonometry by way of analytic geometry. Consider the unit cir¬ 
cle x 2 + y 2 = 1 in the jcy-plane (Fig. 1.40), and let 6 be an arbitrary real num¬ 
ber. If 6 is positive, let the radius OP start in the position OA and revolve coun¬ 
terclockwise through 6 radians. Thus, 6 =tt produces half a revolution and 6 = 
2tt produces a complete revolution, both counterclockwise. If 6 is negative, we 
form the positive number — 6 and let OP revolve clockwise through — 6 radians. 
See Fig. 1.41. In this way, each real number 6 (positive, negative, or zero) de¬ 
termines a unique position of the radius OP in Fig. 1.40, and therefore a unique 
point P = (jc, y) with the property that x 2 + y 2 = 1. 

The sine and cosine o f 6 are now defined by 

sin 6 = y and cos 6 = x. 


The word “sine,” sinus in Latin, is a corruption of an Arabic word meaning 
“chord” or “bowstring ” Since sin and cos are the names of functions, the proper 
notation should be sin(0) and cos(0), just as we write f(d) when the function is 
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f However, in the case of trigonometric functions it is customary to omit the 
parentheses. It is evident from the definition that — 1 < sin 6 < 1, and similarly 
for cos 6. The algebraic signs of these quantities depend on which quadrant of 
the plane the point P happens to lie in (Fig. 1.42). For values of 0 such that 0 < 
0 < 7t/ 2, these definitions agree with the right triangle definitions given above, 
because in the triangle in Fig. 1.40 we have sin 0 = y = y/1 = (opposite side)/ 
(hypotenuse) and cos 6 = x = x/1 = (adjacent side)/(hypotenuse). 





sin 0 > 0 sin 0 > 0 sin 0 < 0 sin 0 < 0 

cos 0 > 0 cos 0 < 0 cos 0 < 0 cos 0 > 0 


Figure 1.42 


IDENTITIES 

If we compare the angles 0 and —0 in Fig. 1.43, we see at once that 

sin ( — 0) = -sin 0 and cos ( — 9) = cos 6. (1) 

The equation x 2 + y 2 = 1, or equivalently y 2 + x 2 = 1, translates immediately 
into the important identity 

sin 2 6 + cos 2 6 = 1. (2) 

[The somewhat strange notation sin 2 0 is the standard way of writing the square 



Figure 1.43 
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of the number sin 0, that is, (sin 0) 2 ; and similarly for cos 2 0 .] Problem 10 in 
Section 9.1 outlines a general proof of the addition formulas 

sin (0 + </>) = sin 0 cos </> + cos 0 sin </>, (3) 

cos (0 + </>) = cos 0 cos </> - sin 0 sin </>. (4) 

We give a proof of (3) below, in connection with Fig. 1.44, for the restricted case 
in which 0 and are both positive angles whose sum is less than 7 t/ 2. First, how¬ 
ever, we point out that if we put <£ = 0 in (3) and (4), we obtain the double¬ 
angle formulas 

sin 26 = 2 sin 6 cos 6, (5) 

cos 26 = cos 2 6 - sin 2 6. ( 6 ) 


And finally, if we write (2) and (6) together as 

cos 2 6 + sin 2 0 = 1 , 
cos 2 0 — sin 2 0 = cos 20 , 


then by adding and subtracting we get the half-angle formulas 

cos 2 0 = 7(1 + cos 20), (7) 

sin 2 0 = 4(1 - cos 20 ). ( 8 ) 

Now, to prove (3) for the restricted case mentioned above, we consult Fig. 1.44 
and write 


sin (0 +</>) = 


PQ = PT + TQ 
OP ” OP 

PT 4 - RS = P7'_ + RS_ 
OP ~ OP + OP 

PT PR | RS OR 
PR ' OP OR OP 


= cos 0 sin </> + sin 0 cos </>. 
A similar argument can be given for formula (4). 


VALUES AND GRAPHS 

Example 1 provides several first-quadrant 0’s for which exact values of sin 0 and 
cos 0 are easy to find. These facts can also be obtained by looking carefully at 
the three parts of Fig. 1.45 and remembering the Pythagorean theorem: 


sinf = iV2, 


sinf = iV3, 


C0S f =2^’ 


77 I . rr 

cos — = —V 2 , 
4 2 


77 1 

c°s T = -. 


Also, an inspection of Fig. 1.40 with OP in various positions gives us similar in¬ 
formation for the cases 0 = 0, 7 t/2, 77 , 37t/2, 27t: 


77 , 

sin y = f 


377 


sin 0 = 0 , 


sin 77 = 0, 


sin 27 7 = 0, 







1.7 INTRODUCTORY TRIGONOMETRY. THE FUNCTIONS sin 0 AND cos 0 


43 




COS 0=1, COS — = 0, COS 77= -1, COS —— = 0, COS 277 = 1. 

2 2 

Further, by drawing pictures and using the ideas in Fig. 1.45 we can find the ex¬ 
act values of sin 0 and cos 0 for any value of 0 that represents an angle one-third, 
one-half, or two-thirds of the way through any quadrant. 


Example 2 To illustrate this remark, we point out (Fig. 1.46) that 135° = 37 t/ 4 
is halfway from 7 t/ 2 to 77 , so the point P is in the second quadrant with coordi¬ 
nates (— 7 V 2 , IV 2 ), and consequently we have 

.377 1 rr 377 K/r 

Sin T = 2 V2 ’ cos —= --V2. 


Similarly, 300° = 57 t/ 3 is one-third of the way from 37 r /2 to 277, so P is in the 
fourth quadrant with coordinates ( 7 , — 7 V 3 ), and we have 





J_ 

2 ' 


Of course, most 0’s are beyond the scope of these methods, and in these cases 
the values of sin 0 and cos 0 can be found from trigonometric tables or a calcu¬ 
lator. The problem of how these values themselves are calculated is more diffi¬ 
cult, and will be discussed in Chapter 14. 

For every 0, the numbers 0 and 0 H- 277 clearly determine the same point P, so 



sin (0 + 277) = sin 0 and cos (0 4- 277) = cos 0. 


This says that the values of sin 0 and cos 0 repeat when 0 increases by 277. We 
express these properties of sin 0 and cos 0 by saying that these functions are pe¬ 
riodic with period 2 77 . 

The graph of sin 0 is easy to sketch by looking at Fig. 1.40 and using imagi¬ 
nation to follow the way y varies as 0 increases from 0 to 27r, that is, as the ra¬ 
dius swings around through one complete counterclockwise revolution. It is clear 
that sin 0 starts at 0 , increases to 1 , decreases to 0 , decreases further to — 1 , and 
increases to 0. This gives one complete cycle of sin 0 on the interval 0^0^ 
27 7 , as shown on the left in Fig. 1.47. By using the periodicity of sin 0, we see 
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that the complete graph (on the right in the figure) consists of infinitely many 
repetitions of this cycle, to the right and to the left. The graph of cos 6 can be 
sketched in essentially the same way (Fig. 1.48). The main difference is that 
cos 6 starts at 1 when 0 = 0, decreases to 0, decreases further to — 1, increases 
to 0, and increases further to 1. 




On the other hand, the graph of sin 20 makes one complete cycle on the in¬ 
terval 0 < 6 < 7r, because 2 6 increases from 0 to 2tt as 6 increases from 0 to it 
(Fig. 1.49, left). This says that sin 26 oscillates twice as fast as sin 6. In the same 




Figure 1.49 
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way we see that sin oscillates half as fast as sin 6 (Fig. 1.49, right). In gen¬ 
eral, both sin k6 and cos k6 make one complete cycle for 0 < k6 < 2ir, or equiv¬ 
alently, on the interval 0 < 0 < lirlk. 

Notice that degrees are almost entirely banished from this way of thinking 
about trigonometry. Trigonometric values can be written using degree measure 
or radian measure: either sin 30° or sin 7 t/ 6; either cos 90° or cos 7r/2. But when¬ 
ever we think of trigonometric functions, as in writing y = sin 6 or f(Q) = cos 0, 
the independent variable 0 is always understood to be in radians. 

The functions sin 0 and cos 0 are the basic trigonometric functions, but there 
are four others that are also important though less fundamental: the tangent, cotan¬ 
gent, secant, and cosecant. These can be defined as follows: 


tan 0 = 


cot 0 = 


sin 0 
cos 0’ 

cos 0 
sin 0’ 


sec 0 = 


esc 0 = 


cos 0’ 

1 

sin 0' 


Even though we mention them here, these four functions will not be essential for 
our work until we reach Chapter 10. At that time we will review them thoroughly. 


PROBLEMS 


Convert the given angle from degrees to radians: 

(a) 15°; (b) 150°; 

(c) 1500°; (d) -36°; 

(e) -110°; (f) 7°. 

Convert the given angle from radians to degrees: 

(a) ti/ 15; (b) tt/45; 

(c) -tt/ 36; (d) -3; 

(e) 77 2 ; (f) 30. 

Find the value of the given expression without using ta¬ 
bles or a calculator: 

(b) sin 780°; 

l 157 t \ 

(d) cos-- 


(a) cos (- 120 °); 
1777 
3 ; 

1977 
~6~ ; 


(c) sin - 


(e) sin ■ 


(f) cos 


9977 


Is the given number positive, negative, or zero? 

(a) sin 5007r; (b) cos 7; 

(c) sin 901°; (d) cos 2 4 . 

Verify the given identities: 

(a) sin 30 = 3 sin 0 - 4 sin 3 0 (Hint: 30 = 20 + 0); 

(b) cos 30 = 4 cos 3 0-3 cos 0. 

By sketching the angles in a unit circle and using the 
facts that sin 77/6 = 7, cos 77/6 = V3/2, find 


(a) 


/ 7r \ 

(b) 


7 77 

sin 1 

hr> 

sin 

~6~ ; 

(c) 


1377 

(d) 


( 77' 

sin 

”<r ; 

cos 

(“6, 

(e) 


Itt 

(f) 


1377 

cos 

”6~ ; 

cos 

6 * 


Express each trigonometric function as a corresponding 

function of an angle in the first quadrant (0 < 0 < ir/2) 

preceded by a 4- or - sign: 

9tt . 

(b) sin 777; 


(a) sin - 
(c) sin Y 
(e) cos IO 77 ; 
(g) cos 
(i) cos 


(d) sin — 


877 


m 977 
(f) cos —; 


677 

"I 

11 77 

3 ' 


(h) sin - 


1177 


8 Replace </> by — 4> in the addition formulas (3) and (4), 
and use the identities (1), to obtain the subtraction for¬ 
mulas: 








46 


NUMBERS. FUNCTIONS, AND GRAPHS 


sin (6 - c/>) = sin 6 cos c/> - cos 6 sin c/>, 
cos (0 — <t>) = cos 6 cos c/> + sin 6 sin c/>. 


9 By examining Fig. 1.50, obtain the following identities: 

(a) sin ( 77 - — 0) = sin 6 , cos (77 — 0) = —cos 0; 

(b) sin - 0 ) = cos 6 , cos ^y - = sin 6 . 

Use similar arguments — based on appropriate pictures 
—to obtain identities (c) and (d): 

(c) sin (6 + 7r) = —sin 6 , cos (0 + 77 ) = —cos 6\ 

(d) sin ^0 + yj = cos 6 , cos ^0 + yj = -sin 6. 

10 Derive the identities in Problem 9 as special cases of the 
addition and subtraction formulas. 

11 The half-angle formulas (7) and (8) are called by this 
name because if we set 26 = a, they can be written as 



cos ja = ± 


1 4- cos a 


Use these formulas to find the values of sin 15° and 
cos 15°. 

12 Apply the formulas in Problem 11 to find the values of 
sin 30° and cos 30° from the fact that cos 60° = 

13 Apply the half-angle formula for the cosine to find 

(a) cos y; (b) cos -y-. 


14 

15 


16 

17 


18 


Apply the half-angle formula for the sine to find 

(a) siny; (b) sin^-yj. 

Use the appropriate addition or subtraction formula to 
find 


, x . 277 . 27 T 77 

(a) sin —— from —— =77 —— ; 

3 3 3 

. 57T 577 77 

(b) COS -y from -y = 77 + y; 

. . 1777 1777 _ 77 

(c) sin —— from —— = 37 7 —— 

6 6 6 

Use the method of the preceding problem to find 

/ x 19 77 IO77 . 11 77 

(a) cos —7—; (b) cos ——; (c) sin —7—. 

o 5 o 

Check the identity for sin(0 + (f >) when 

77 77 

(a) 6 = — and </> = —; 

o J 

(b) 6 = y and 0 = y 


Check the identity for cos(0 + 4>) when 

77 77 

(a) 6 = — and = —; 
o J 


(b) 6 = j and <t> = f- 

19 Find sin 5 77 /12 b y using the fact that 577/12 = 77/4 -I- 77 / 6 . 

20 Find sin 77/12 by using the fact that 77/12 = n/4 - 77 / 6 . 
Reconcile your answer here with the first answer in Prob¬ 
lem 11. 

21 Establish the addition formula (4) for the cosine by the 
method suggested in the text, that is, by using Fig. 1.44. 


Figure 1.50 



CHAPTER 1 REVIEW: DEFINITIONS, CONCEPTS, METHODS 


Define, state , or think through the following. 

1 Rational and irrational numbers. 

2 Real line. 

3 Rules for inequalities. 


4 Absolute value of a number. 

5 Closed and open intervals. 

6 Coordinate plane. 

7 Pythagorean theorem. 
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8 Distance formula. 

9 Midpoint formulas. 

10 Slope of a straight line. 

11 Point-slope equation of a line. 

12 Slope-intercept equation of a line. 

13 Slope criterion for parallel lines. 

14 Slope criterion for perpendicular lines. 

15 Equation of a circle. 

16 Completing the square. 

17 Definition of a parabola. 

18 Equations of parabolas. 

19 Function. 

20 Domain and range of a function. 

21 Independent and dependent variables. 


22 Polynomials. 

23 Rational functions. 

24 Algebraic functions. 

25 Transcendental functions. 

26 Graph of a function. 

27 Zero of a function. 

28 Asymptote of a curve. 

29 Infinite discontinuity of a function. 

30 Radian measure. 

31 Sine and cosine of 6. 

32 Addition and subtraction formulas. 

33 Values and graphs of sin 6 and cos 6. 

34 Double-angle formulas. 

35 Half-angle formulas. 


ADDITIONAL PROBLEMS FOR CHAPTER 1 


SECTION 1.2 

1 If a and b are positive numbers, prove the inequality 
\fab < \{a + b) as Euclid did, by considering a right tri¬ 
angle inscribed in a semicircle (Fig. 1.51). 



2 If a and b are any two numbers, denote the larger by 
max {a, b) and the smaller by min (a, b). Show that 

max ( a , b) = \{a + b + \a - b\), 

and find a similar expression for min (a, b). 

3 Show that if a ^ b and c ^ d, then a + c < b + d . Use 
this fact to prove that \a + b\ ^ |a| + \b\. Hint: Begin by 
noticing that -\a\ ^ a < \a\ and - \b\ < b < \b[ 

4 If a is a positive rational number, explain why the fol¬ 
lowing method for calculating the square root of a works. 
First, choose a rational number which is a reasonable 
guess at the value of \fa , and call this initial approxi¬ 
mation x\. Next, divide a by x\ and average the result 
with jcj, thereby obtaining a second approximation jt 2 . 
Next, divide a by jc 2 and average the result with jc 2 , ob¬ 
taining a third approximation x 3 . This procedure is ex¬ 
pressed by the formula 

x " +i= i(*" + i) * =| ’ 2 ’ 3 . 

Hint: If x\ is reasonably close to \Ta but different from 
it, then Vtf lies between x\ and a/x \ (why?), and so the 


average of x\ and a/x\ is likely to be even closer to \fa\ 
also note that 

Xn +1 - Va = 2- (x n - iVa + -J-j = Y^~( x n - Va) 2 . 

5 Use the method of Problem 4 to calculate V2, first with 
x\ = 1 and then with x\ = j. 

6 Use the method of Problem 4 to calculate V3, first with 
X\ —2 and then with x\ — 7 . 

7 If a and b are real numbers with a <b, show that there 
exists at least one rational number c such that a < c < 
b , and hence infinitely many. In particular, between any 
two irrationals there exist an infinite number of rationals. 

8 If a is a nonzero rational number and b is irrational, show 
that a + b, a — b, ab, alb , and bla are all irrational. 

9 If a and b are irrational, is a + b necessarily irrational? 
Is abl 

10 If a and b are real numbers with a < b, show that there 
exists at least one irrational number c such that a < c < 
b , and hence infinitely many. In particular, between any 
two rationals there exist an infinite number of irrationals. 

1 1 Give another proof of the Pythagorean theorem by using 
the equations 

a e , b d 

— = — and — = — 

c a c b 

obtained from similar triangles in Fig. 1.52.* 



*Further proofs can be found in Section B. 1 of Simmons, Calculus 
Gems (McGraw-Hill, 1992). 
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12 In each case place the figure in a convenient position rel¬ 
ative to the coordinate system and prove the statement 
algebraically: 

(a) The sum of the squares of the distances of any point 
from two opposite vertices of a rectangle equals the 
sum of the squares of its distances from the other two 
vertices. 

(b) In any triangle, 4 times the sum of the squares of the 
medians equals 3 times the sum of the squares of the 
sides. 

13 If P i = to, V|) and P 2 = to, 3^2) are distinct points, and 
if P = U, y) is located on the segment joining them in 
such a position that the ratio of its distance from P\ to 
its distance from P 2 is q/p, show that 


px 1 + qx 2 
P + <7 


and 


pvi + qyi 

p + q 


14 Find the point on the segment joining (1,2) and (5, 9) 
that is H of the way from the first point to the second. 


SECTION 1.3 

15 If the line determined by two distinct points to, Ah) and 
to, ^ 2 ) is not vertical, and therefore has slope to — y\)l 
to — jci), show that the point-slope form of its equation 
is the same regardless of which point is used as the given 
point. 

16 Determine what each of the following statements implies 
about the constants A, B, C in the equation Ax + By + 
C = 0: 

(a) The line goes through the origin. 

(b) The line is parallel to the y-axis. 

(c) The line is perpendicular to the y-axis. 

(d) The line goes through (1, 1). 

(e) The line is parallel to 5x + 3y = 2. 

(f) The line is perpendicular to jc + lOy = 3. 

17 If the lines A \x + B\y + Cj = 0 and A 2 x + B 2 y + C 2 = 
0 are not parallel and k is any constant, show that 

{A i* + B\y + Ci) + k (A 2 x + B 2 y + C 2 ) = 0 

is a line through the point of intersection of the given 
lines. When k is assigned various values, this equation 
represents various members of the family of all lines 
through the point of intersection. 

18 Given the lines x + 3y — 2 = 0 and 2x - y + 4 = 0, use 
Problem 17 to find the equation of the line through their 
point of intersection which 

(a) passes through (-2, 1); 

(b) is perpendicular to the line 3y + jc = 21; 

(c) passes through the origin. 

19 The points (0, 0), (a, 0), and ( b , c) are the vertices of an 
arbitrary triangle which is placed in a convenient posi¬ 
tion relative to the coordinate system. 

(a) Find the equation of the line through each vertex per¬ 
pendicular to the opposite side, and show alge¬ 


braically that these three lines intersect at a single 
point. 

(b) Find the equation of the perpendicular bisector of 
each side, and show algebraically that these three 
lines intersect at a single point. Why is this fact geo¬ 
metrically obvious? 

(c) Find the equation of the line through each vertex and 
the midpoint of the opposite side, and show alge¬ 
braically that these three lines intersect at a single 
point. Also, verify that this point is two-thirds of the 
way from each vertex to the midpoint of the oppo¬ 
site side. 

20 Show that each of the following is the equation of a 
straight line: 

(a) * 3 - x 2 y - 2x 2 + 3x - 3y -6 = 0. 

(b) 3 xy 2 + 5y 2 - y 3 - 4y + 1 2 jc + 20 = 0. 

21 S how that the distance from a point (jc 0 , yo) to a line Ax + 
By + C = 0 is given by 

\Axp + By 0 + C| 

Va 2 + 5 2 

22 Find the distance between the parallel lines Ax + 3y + 
12 = 0 and Ax + 3y - 38 = 0. 

23 If two intersecting straight lines are given, then it is easy 
to see that the bisectors of the angles formed by these 
lines are two other straight lines whose points are 
equidistant from the given lines. Use this fact to find the 
equations of the bisectors of the angles formed by the 
lines 

(a) 3 jc + 4y - 10 = 0 and 4 jc — 3y — 5 = 0; 

(b) y = 0 and y = x. 

24 Why is it geometrically obvious (without calculation) 
that the bisectors of the angles of any triangle intersect 
at a single point? 

SECTION 1.4 

25 Find the values of b for which the line y = 3 jc + b in¬ 
tersects the circle x 2 + y 2 = 4. 

26 If the line y = mx + b is tangent to the circle x 2 + y 2 = 
r 2 , find an equation relating m, b, and r. 

27 Find the equation of the locus of a point P = to y) that 
moves in such a way that 

(a) its distance from (0, 0) is twice its distance from 
(a, 0); 

(b) the product of its distances from ( a , 0) and (- a , 0) 
is a 2 (this curve is called a lemniscmte). 

In each case, sketch the graph. 

28 A line segment of length 6 moves in such a way that its 
endpoints remain on the x-axis and y-axis. What is the 
equation of the locus of its midpoint? 

29 A point moves in such a way that the ratio of its dis¬ 
tances from two fixed points is a constant k =£ 1. Show 
that the locus is a circle. 
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30 Find the equation of the line which is tangent to the cir- 
cle x 2 4 y 2 4 Sx 4 6y 4 8 = 0 at the point ( — 8, — 2). 

31 Find the equations of the lines that pass through the point 
(1,3) and are tangent to the circle x 2 4 y 2 = 2. 

32 If two circles 

x 2 4 y 2 4 A\x 4 B\y 4 C\ = 0 
and 

x 2 4 y 2 + A 2 X 4 - B 2 y + C 2 = 0 

intersect in two points, and if A: is a constant 4 — 1, ex¬ 
plain why 

(x 2 4 y 2 4 A\x 4 B\y 4 Cj) 

4 k(x 2 4 y 2 4 A 2 x 4 B 2 y 4 C 2 ) = 0 

is the equation of a circle through the points of intersec¬ 
tion. If k = - 1, what does the equation represent? 

33 Use Problem 32 to find the equation of the line joining 
the points of intersection of the circles x 2 4 y 2 = 4x 4 
4y - 4 and x 2 4 y 2 = 2y. Also find these points of in¬ 
tersection. 

34 Show that a parabola with focus at the origin, axis the 
*-axis, and opening to the right has an equation of the 
form y 2 = 4 p(x 4 p), where p > 0. 

35 Find the equation of the parabola with focus (1, 1) and 
directrix r 4 y = 0, and simplify this equation to a form 
without radicals. Hint: See Problem 21. 

36 Let the vertex of the parabola x 2 = 4 py be joined to every 
other point of the parabola. Show that the midpoints of 
the resulting chords lie on another parabola. Find the fo¬ 
cus and directrix of this second parabola. 

37 Consider all chords with given slope m that have end¬ 
points on the parabola x 2 = 4 py. Prove that the locus of 
the midpoints of these chords is a straight line parallel 
to the y-axis. 

38 A focal chord of a parabola is the segment cut by the 
parabola from a straight line through the focus. 

(a) If A and B are the endpoints of a focal chord, and if 
the line through A and the vertex intersects the di¬ 
rectrix at a point C, show that the line through B and 
C is parallel to the axis of the parabola. 

(b) Show that the length of a focal chord is twice the dis¬ 
tance from its midpoint to the directrix. 

(c) Show that if the two tangents to a parabola are drawn 
from any point on the directrix, then the points of 
tangency are the endpoints of a focal chord. 

39 Given the two points A = (4 p, 0) and B = (4 p, 4p), di¬ 
vide the segments OA and AB into equal numbers of equal 
parts, number the points of division as shown in Fig. 1.53, 
and join the points of division on AB to the origin by 
straight lines. Show that the points of intersection of each 
of these lines with the corresponding vertical lines lie on 
the parabola x 2 = 4 py. 



SECTION 1.5 

40 Find the domain of each of the following functions: 

x 


(a) 5 - *; 

(c) V 3.1 - 2; 

, , x + 1 

(e) 2-Q’ 

x z - 9 

(g) V9 - 4x 2 ; 


(b) 


2x — 3 ’ 


(d) V5 - 3 jc; 

(f) Vr, 

(h) 


V* + 3 ’ 


(i) V7 jc 2 + 5. 

41 If f(x) = ax + b, show that 


J. ^1 + X2 j _ f{X\) +/(*2) 

Is this true for f{x) = x 2 ? 

42 If/(x) = (1 + x)/(l - x), find 

(a)/(-*); (b) / (^) ; 

( £ ) / (737) (d) /(/«)■ 

43 If/(*) = Vr, what function g(x) has the property that 
g(f(x)) = xl 

44 The perimeter of a right triangle is 6 and its hypotenuse 
is x. Express the area as a function of x. 

45 A cylinder has fixed total surface area A. Express its vol¬ 
ume as a function of the radius r of its base. 

46 A cone is inscribed in a sphere with fixed radius a . If r 
is the radius of the base of the cone, express its volume 
as a function of r. 

47 A cone is circumscribed about a sphere with fixed radius 
a. If r is the radius of the base of the cone, express its 
volume as a function of r. 

48 If f(x) = (x — 3)/(x 4 1), show that/(/(/(*))) = *. 

49 Let a, b, c , d be given constants with the property that 
ad - be 4 0. If /(*) = (ax 4 b)/(cx 4 d), show that 
there exists a function g(x) = (ax 4 (3)/(yx 4 8) such 
that/(g(x)) = x. Also show that for these two functions 
it is true that/(g(x)) = g(f(x)). 
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SECTION 1.6 

50 Let p(x) = a n x n + a n -\x n ~ ] + • • ■ + a\X + ao be a 
polynomial of degree n > 1, and prove the following 
statements: 

(a) If p( 0) = 0, then p(x) = xq(x), where q{x) is a poly¬ 
nomial of degree n — 1 . 

(b) If a is any real number, the function /(jc) defined by 
fix) = p{x + a) is a polynomial of degree n. 

(c) If a is a real number for which p(a) = 0, that is, if a 
is a zero of p(x\ then p(x) = (x — a)r(x\ where r(jc) 
is a polynomial of degree n — 1. Hint: Consider 
fix) = p(x + a). 

(d) p(x) has at most n zeros. 

51 If n is any integer > 1, show that there exists a polyno¬ 
mial of degree n with n zeros. If n is even, find a poly¬ 
nomial of degree n with no zeros; and if n is odd, find 
one with only one zero. 

52 Let p{x) = a n x tl + a n -\x n ~ ] + • • • + a\x + ao be a 
polynomial of degree n > 1. If p{x) has n zeros x\, X 2 , 
. . . , x n , and is therefore expressible in the form 

p(x) = a n (x - x\)(x - x 2 ) • ■ ■ (x - x n ). 


show that 
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54 

55 


56 


57 

58 


(a) X\X 2 ' * •*„ = (-1) 


ao, 

a n 


(b) *1 + *2 + * * ■ + = —^ Lj -. 

a n 

A function/is said to be even if/(-jc) = f(x) for every 
jc in its domain, and it is said to be odd iff(—x) = —fix) 
for every jc in its domain (in each case, it is understood 
that -jc is in the domain of/whenever jc is). Determine 
whether each of the following functions is even, odd, or 
neither: 

(a) fix) = jc 3 ; (b) fix) = x(x 3 + x); 


(c) fix) = |jc|; (d) fix) = x + —; 

JC 

(e) fix) = x 2 + —; (f) fix) = *2 T J ’ 

(g) fix) = x 5 + 1 ; (h) /(x) = x(x + 1). 

What is the distinguishing feature of the graph of an even 
function? Of an odd function? 

What can be said about 

(a) the product of two even functions? 

(b) the product of two odd functions? 

(c) the product of an even function and an odd function? 
If /(jc) is an arbitrary function defined on an interval of 
the form [~a, a], show that/(jc) is expressible in one and 
only one way as the sum of an even function g(jc) and an 
odd function h(x\ /(jc) = g(jc) + h(x). Hint: /(-jc) = 
8(x) ~ h(x). 

Write down a second-degree polynomial whose values 
at 1,2, and 3 are 77 , V3, and 550. 

If a and b are positive constants, sketch the graph of 


y = (|* + a\ + |x - a\ - 2|x|). 

59 The symbol [x] (read “bracket x”) is often used to de¬ 
note the greatest integer < a real number x. For exam¬ 
ple, [1] = 1, [2.1] = 2, [tt] = 3, and [-1.7] = -2. 
Sketch the graphs of the following functions: 

(a) y = [x]; 

(b) y = x - [x]; 

(c) y = Vx- [x ]; 

(d) y = [x] + Vx - [x]; 

(e) y = Vx — [Vx], 0 £ x £ 9 . 

60 Express the number of squares < a positive number jc in 
terms of the bracket function defined in Problem 59. Do 
the same for the number of cubes < jc. 

61 If the symbol (jc) (read “brace jc”) denotes the distance 
from a real number jc to the nearest integer, graph the fol¬ 
lowing functions: 

(a) y = {x}; (b) y = (2x}\ 

(c) y = {4x}; (d) y = i(4x). 


SECTION 1.7 

62 Verify the given identities: 

(a) sin 40=8 sin 6 cos 3 6 - 4 sin 6 cos 0; 

(b) cos 40 = 8 cos 4 0-8 cos 2 0+1. 

63 Prove that 

sin m6 sin n6 — {[cos (m — n)6 — cos (m + n)6 ], 
sin md cos nO = {[sin ( m + n)6 + sin (m - n)6], 
cos md cos n6 = {[cos (m + n)6 + cos (m - n)0]. 

64 Given a triangle with angles A, B, C and sides a , b, c op¬ 
posite these angles, prove the law of sines : 

a _ b _ c 
sin A sin B sin C’ 

65 Show that the area of the triangle in the preceding prob¬ 
lem is 

\ab sin C = \ac sin B — \bc sin A. 

66 What is the area of an equilateral triangle whose side has 
length si 

67 What is the area of an isosceles right triangle whose hy¬ 
potenuse has length hi 

68 Prove that the sine of an angle inscribed in a circle of 
unit diameter equals the length of the chord of the sub¬ 
tended arc. Hint: First consider the case in which one 
side of the angle is a diameter and use the fact that the 
resulting triangle is a right triangle; then use the fact that 
all inscribed angles subtending the same arc are equal. 
Deduce both of these facts from the theorem that any in¬ 
scribed angle is half the corresponding central angle, and 
prove this theorem. 










THE DERIVATIVE 
OF A FUNCTION 


We begin our study of calculus with a brief statement of what the subject is about 
and why it is important. Such a bird’s-eye view of the road that lies ahead can 
help us attain a clarity of purpose and sense of direction that will serve us well 
among the many technical details that constitute the bulk of our work. 

Calculus is usually divided into two main parts, called differential calculus and 
integral calculus. Each of these parts has its own unfamiliar terminology, puz¬ 
zling notation, and specialized computational methods. Getting accustomed to all 
this takes time and practice, much like the process of learning a new language. 
Nevertheless, this fact should not prevent us from seeing at the beginning that 
the central problems of the subject are really quite simple and clear, with noth¬ 
ing strange or mysterious about them. 

Almost all the ideas and applications of calculus revolve around two geomet¬ 
ric problems that are very easy to understand. Both problems refer to the graph 
of a function y = f(x). We avoid complications by assuming that this graph lies 
entirely above the x-axis, as shown in Fig. 2.1. 

PROBLEM 1 The basic problem of differential calculus is the problem of tan¬ 
gents: Calculate the slope of the tangent line to the graph at a given point P. 

PROBLEM 2 The basic problem of integral calculus is the problem of areas: 

Calculate the area under the graph between the points x = a and x — b. 

Our work in the rest of this book will be focused on these two problems, on the 
ideas and techniques that have been developed for solving them, and on the ap¬ 
plications that arise from them.* 

At first sight these problems seem rather limited in scope. We expect them to 
shed significant light on geometry, and they do. What is very surprising is to find 


2.1 

WHAT IS CALCULUS? 
THE PROBLEM OF 
TANGENTS 



y m fU) 


area = : 


slope = ? 


a b x 

Figure 2.1 The essence of calculus. 


*For readers who are interested in the origins of words, a calculus in ancient Rome was a small stone 
or pebble used in counting and gambling, and the Latin verb calculare came to mean “to figure out,” 
“to compute,” “to calculate.” Today a calculus is a method or system of methods for solving quanti¬ 
tative problems of a particular kind, as in calculus of probabilities, calculus of finite differences, ten¬ 
sor calculus, calculus of variations, calculus of residues, etc. Our calculus — the branch of math¬ 
ematics consisting of differential and integral calculus taken together—is sometimes called the 
calculus to distinguish it from all these other subordinate calculuses. 
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that they also have many profound and far-reaching applications to the various 
sciences. Calculus pays its way in the great world outside of mathematics through 
these scientific applications, and one of our major purposes is to introduce the 
student to as wide a variety of them as possible. At the same time we will con¬ 
tinue to emphasize geometry and geometric applications, for this is the context 
in which the ideas of calculus are most easily understood 

It is sometimes said that calculus was “invented” by those two great geniuses 
of the late seventeenth century, Newton and Leibniz.* In reality, calculus is the 
product of a long evolutionary process that began in ancient Greece and contin¬ 
ued into the nineteenth century. Newton and Leibniz were indeed remarkable 
men, and their contributions were of decisive importance, but the subject neither 
started nor ended with them. The problems stated above were much on the minds 
of many European scientists of the middle seventeenth century—most notably 
Fermat—and considerable progress was made on each of them by ingenious spe¬ 
cial methods. It was the great achievement of Newton and Leibniz to recognize 
and exploit the close connection between these problems, which no one else had 
fully understood. Specifically, they were the first to grasp the significance of the 
Fundamental Theorem of Calculus, which says, in effect, that the solution of the 
tangent problem can be used to solve the area problem. This theorem—certainly 
the most important in the whole of mathematics — was discovered by each man 
independently of the other, and they and their successors used it to weld the two 
halves of the subject together into a problem-solving art of astonishing power 
and versatility. 

As these remarks suggest, we begin our work by undertaking a fairly thorough 
study of the tangent problem in the next four chapters. Then, in Chapters 6 and 
7, we turn to the area problem. From there we push outward in a number of di¬ 
rections, extending our basic concepts and tools to broader classes of functions 
with a greater variety of significant applications. 

Before attempting to calculate the slope of a tangent line, we must first decide 
what a tangent line is, and this is not as easy as it seems. 

In the case of a circle there is no difficulty. A tangent to a circle (Fig. 2.2, left) 
is a line that intersects the circle at only one point, called the point of tangency; 
lines that are not tangents either intersect the circle at two different points or miss 
it altogether. This situation reflects the clear intuitive idea most of us have that 
a tangent to a curve at a given point is a line that “just touches” the curve at that 

■"The Latin spelling “Leibnitz” is sometimes used in order to suggest the correct pronunciation, which 
is “LIBE-nits.” 



Figure 2.2 


2.2 HOW TO CALCULATE THE SLOPE OF THE TANGENT 
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point.* It also suggests the possibility of defining a tangent to a curve as a line 
that intersects the curve at only one point. This definition was used successfully 
by the Greeks in dealing with circles and a few other special curves, but for curves 
in general it is wholly unsatisfactory. To understand why, consider the curve 
shown on the right in Fig. 2.2: It has a perfectly acceptable tangent (the lower 
line) that this definition would reject, and an obvious nontangent (the upper line) 
that this definition would accept. 

The modern concept of a tangent line was originated by Fermat around 1630. 
As students will come to see, this concept is not only a reasonable statement 
about the geometric nature of tangents, it is also the key to a practical process 
for the construction of tangents. 

Briefly, the idea is this: Consider a curve y =/(*), and let P be a given fixed 
point on this curve (Fig. 2.3). Let Q be a second nearby point on the curve, and 
draw the secant line PQ. The tangent line at P can now be thought of as the lim¬ 
iting position of the variable secant as Q slides along the curve toward P. We 
shall see in Section 2.2 how this qualitative idea leads at once to a quantitative 
method for calculating the exact slope of the tangent in terms of the given func¬ 
tion/(x). 

Let there be no misunderstanding. This way of thinking about tangents is not 
a minor technical point in the geometry of curves. On the contrary, it is one of 
the three or four most fruitful ideas that any mathematician has ever had, for 
without it there would have been no concept of velocity or acceleration or force 
in physics, no Newtonian dynamics or astronomy, no physical science of any 
kind except as the mere verbal description of phenomena, and certainly no mod¬ 
ern age of engineering and technology. 


General discussions have their place, but the time has come to get down to de¬ 
tails. 

Let P = (jcq, yo) be an arbitrary fixed point on the parabola y = x 2 , as shown 
in Fig. 2.4. As our first illustration of the basic idea of this chapter, we calculate 
the slope of the tangent to this parabola at the given point P. To begin the process, 
we choose a second nearby point Q = (x\,y{) on the curve. Next, we draw the 


*The Latin word tangere means “to touch,” as in the English word “tangible.” 



Figure 2.3 Fermat’s idea. 
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secant line PQ which is determined by these two points. The slope of this secant 
is evidently 


m S ec = slope of PQ = ———. (1) 

X] - Xo 

Now for the crucial step: We let x\ approach jc 0 , so that the variable point Q ap¬ 
proaches the fixed point P by sliding along the curve—much like a bead sliding 
along a curved wire. As this happens, the secant changes direction and visibly 
approaches the tangent at P as its limiting position. Also, it is intuitively clear 
that the slope m of the tangent is the limiting value approached by the slope m sec 
of the secant. If we use the standard symbol —»to mean “approaches,” then the 
last statement can be expressed in the concise and convenient form 


m = lim 
Q->P 


Msec 


= lim iL^a. 

X\ ~ Xq 


( 2 ) 


The abbreviation “lim,” with “xi —»x 0 ” written below it, is read “the limit, as x\ 
approaches jcq, of. . . .” 

We cannot calculate the limiting value m in (2) by simply setting jci = xq , be¬ 
cause then yi = yo and this would give the meaningless result 


*o ~ *o 0‘ 


We must think of X\ as coming very close to xq but remaining distinct from it. 
However, as this happens, both y\ — yo and x\ — xq become arbitrarily small, and 
it isn’t at all clear what limiting value their quotient approaches. 

The way out of this difficulty is to use the equation of the curve. Since P and 
Q both lie on the curve, we have yo = xq 2 and y\ = x 2 , so (1) can be written 


= yi ~ yo = *i 2 - xq 2 

X\ - Xq X] - Xq ' 


(3) 


The reason this numerator becomes small is that it contains the denominator 
x\ — Xq as a factor. If this common factor is canceled, we obtain 

yi ~ yo *\ 2 - V Ui - *o)(*i + *o) 

m scc = -=-=-= + xo, 

x\ - Xo X\ - Xo Xi - xo 

and (2) becomes 



m = lim ——— = lim (x\ + x 0 ). 

•*i—X] Xq * i — 

It is now very easy to see what is happening: As x\ gets closer and closer to xq, 
x\ + xq becomes more and more nearly equal to xq + xq = 2xq. Accordingly, 

m = 2x 0 (4) 

is the slope of the tangent to the curve y = x 2 at the point (xq, yo)- 

Example 1 The points (1,1) and (— 7 , ^) lie on the parabola y = x 2 (Fig. 2.5). 
By formula (4), the slopes of the tangents at these points are m = 2 and m = — 1. 
Using the point-slope form of the equation of a line, our two tangent lines clearly 
have equations 
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y- i 

X 1 


= 2 


and 


X + T 




In exactly the same way, 


y ~ *o 


2 


X ~ X 0 


2x 0 


is the equation of the tangent at a general point (xq, x 0 2 ) on the curve. 


We now introduce a widely used piece of symbolism called the delta notation. 
The procedure just described begins by changing the independent variable x 
from a first value x 0 to a second value x\. The standard notation for the amount 
of such a change is Ax (read “delta x”), so that 

Ax = x, - x 0 (5) 


is the change in x in going from the first value to the second. We can also think 
of the second value as being obtained from the first by adding the change: 

X) = xo + Ax. (6) 

It is essential to understand that Ax is not the product of a number A and a num¬ 
ber x, but a single number called an increment of x. An increment Ax can be ei¬ 
ther positive or negative. Thus, if x 0 = 1 and x\ = 3, then Ax = 3 — 1 = 2; and 
if x 0 = 1 and x\ = — 2, then Ax = — 2 - 1 = - 3. 

The letter A is the Greek d\ when it is written in front of a variable, it signi¬ 
fies the difference between two values of that variable. This simple notational 
device turns out to be extremely convenient and has spread into almost every part 
of mathematics and science. We illustrate its role in our present work by using 
it to reformulate the above calculations. 

In view of (5) and (6), formula (3) for the slope of the secant can be written 
in the form 


AH sec 


*i 2 - *o 2 _ (xp + A *) 2 ~ Xp 1 
x\ — xq Ax 


(7) 


This time, instead of factoring the numerator, we expand its first term and sim¬ 
plify the result, obtaining 

(x 0 + Ax) 2 - x 0 2 = x 0 2 + 2x 0 Ax + (Ax) 2 - x 0 2 

= 2x 0 Ax + (Ax) 2 


= Ax(2x 0 + Ax), 


so (7) becomes 


m sec = 2 xq + Ax. 


If we insert this in (2) and use the fact that x\ —> x 0 is equivalent to Ax —» 0, we 
find that 


m = lim (2xo + Ax) = 2xo, 

A.v—»0 

as before. Again it is very easy to see what is happening in the indicated limit 
process: as Ax gets closer and closer to zero, 2x 0 + Ax becomes more and more 
nearly equal to 2 xq. 
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This second method, using the delta notation, depends on expanding the square 
(jco + Ax) 2 , whereas the first depends on factoring the expression xi 2 — xo 2 In 
this particular case neither calculation is noticeably harder than the other. In gen¬ 
eral, however, expanding is easier than factoring, and for this reason we adopt 
the method of increments as our standard procedure. 

The calculation that we have just carried out for the parabola y — x 2 can be 
described in principle for the graph of any function y =/(x) (Fig. 2.6). We first 
compute the slope of the secant through the two points P and Q corresponding 
to xo and xq + Ax, 

_ f{x o + Ax) - f{x q) 

m sec “ A v 


We then calculate the limit of m sec as Ax approaches zero, obtaining a number 
m that we interpret geometrically as the slope of the tangent to the curve at the 
point P : 


m = lim 

Ax—>0 


fix 0 + Ax) - fix p) 
Ax 


The value of this limit is usually denoted by the symbol /'(xo), read “f prime of 
xo,” in order to emphasize its dependence on both the point xo and the function 
fix). Thus, by definition we have 


fix 0 ) = lim 

Ax—>0 


fix o + Ax) -fjx o) 
Ax 


( 8 ) 


In this notation, the result of the calculation given above can be expressed as fol¬ 
lows: If fix) = x 2 , then /'(x») = 2 xq. 


Example 2 Calculate f'(xo) if fix) = 2x 2 — 3x. 


Solution For this function, the numerator of the quotient in (8) is 
/(x 0 + Ax) - fix o) = [2(x 0 + Ax) 2 - 3(x 0 + Ax)] -[2x 0 2 - 3x 0 ] 

= 2xo 2 + 4xo Ax -1- 2(Ax) 2 - 3xo - 3Ax -2xo 2 + 3xo 



Figure 2.6 


*o 


- 

x 0 + A.V 


X 
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= 4x 0 Ax + 2(Ax) 1 2 3 4 5 - 3Ax 
= Ax(4x 0 + 2Ax - 3). 
The quotient in (8) is therefore 

/(x<) + Ax) - /(x 0 ) _ 


Ax 


4xq + 2Ax - 3, 


and 


f (x 0 ) = lim (4x 0 + 2Ax - 3) 

Ax—>0 

= 4x 0 - 3. 



We have assumed in the remarks leading to (8) that the curve under discus¬ 
sion actually has a single definite tangent at the point P. This is a genuine as¬ 
sumption, because some curves do not have such a tangent at every point (Fig. 
2.7). However, when a tangent exists, it is clearly necessary for the secant PQ to 
approach the same limiting position whether Q approaches P from the right or 
from the left. These two modes of approach correspond, respectively, to At ap¬ 
proaching zero through only positive or only negative values. It is therefore part 
of the meaning of (8) that for this limit to exist we must have the same limiting 
value for both directions of approach. 


PROBLEMS 

1 Find the equation of the tangent to the parabola y = x 2 

(a) at the point (-2, 4); 

(b) at the point where the slope is 8; 

(c) if the x-intercept of the tangent is 2. 

2 Show that the tangent to the parabola y = x 2 at a point 
(xo, To) other than the vertex always has x-intercept yxo- 

3 A straight line y = mx + b is presumably its own tan¬ 
gent line at any point. Verify this by using formula (8) 
to show that/'(xo) = m if/(x) = mx + b. 

4 Sketch the graph of y = x - x 2 on the interval — 2 < 
x < 3. 

(a) Use the method of increments to compute the slope 
of the tangent line at an arbitrary point (x 0 , yo) on the 
curve. 

(b) What are the slopes of the tangent lines at the points 
(-1, -2), (0, 0), (1, 0), and (2, -2) on the curve? 
Use these slopes to draw the tangents at these points 
in your sketch. 

(c) At what point on the curve is the tangent horizontal? 

5 Use formula (8) to calculate /'(xo) if/(x) is equal to 

(a) x 2 - 4x - 5; (b) x 2 - 2x + 1; 

(c) 2x 2 -1-1; (d) x 2 - 4. 

The results of these calculations will be needed in Prob¬ 
lems 6-9. 


6 Sketch the given curve and the tangent line at the given 
point, and find the equation of this tangent line: 

(a) y = x 2 -4x- 5, (4,-5). 

(b) y = x 2 — 2x + 1, (-1,4). 

7 Find the equation of the tangent line to the curve y = 
2 x 2 -h 1 that is parallel to the line 8x -I- y — 2 = 0. 

8 Find the equations of the two lines through the point 
(3, 1) that are tangent to the curve y = x 2 - 4. Hint: Draw 
the graph, let ( a , a 2 — 4) be the point of tangency, and 
find a. 

9 Prove analytically (that is, without appealing to geomet¬ 
ric reasoning) that there is no line through the point 
(1, —2) that is tangent to the curve y = x 2 — 4. 

10 One of the two lines that pass through the point (2, 0) 
and are tangent to the parabola y = x 2 is the x-axis. Find 
the equation of the other line. 

11 Find equations for the two lines through the point (3, 13) 
that are tangent to the parabola y = 6x - x 2 . 

12 Draw the graph of y = /(x) = |x - 11. 

(a) I s there any point o n the graph at which there i s no 
tangent line? 

(b) Find /'(xo) if xo > 1. If To < 1. What can be said 
about /'(tq) if tq = 1? 






58 

2.3 

THE DEFINITION OF 
THE DERIVATIVE 



Figure 2.8 


THE DERIVATIVE OF A FUNCTION 


If we separate formula (8) in Section 2.2 from its geometric motivation, and also 
drop the subscript on xq, then we arrive at our basic definition: Given any func¬ 
tion fix), its derivative fix) is the new function whose value at a point x is de¬ 
fined by 


fix) = lim 


fix + Ax) -fix) 
Ax 


( 1 ) 


In calculating this limit, x is held fixed while Ax varies and approaches zero. 
The indicated limit may exist for some values of x and fail to exist for other 
values. If the limit exists for x = a , then the function is said to be differentiable 
at a. A differentiable function is one that is differentiable at each point of 
its domain. Most of the specific functions considered in this book have this 
property. 

We know that the derivative/'(X) can be visualized in the way suggested by 
Fig. 2.8, in which fix) is the variable height of a point P moving along the curve 
and f\x) is the variable slope of the tangent at P. Strictly speaking, however, 
the above definition of the derivative does not depend in any way on geometric 
ideas. Our thoughts about Fig. 2.8 constitute a geometric interpretation , and 
important as this may be as an aid to understanding, it is not an essential 
part of the concept of the derivative. In the next section we will meet other 
equally important interpretations that have nothing to do with geometry. We 
must therefore be prepared to consider fix) purely as a function, and to rec¬ 
ognize that it has several interpretations but no necessary connection with any 
one of them. 

The process of actually forming the derivative fix) is called the differentia¬ 
tion of the given function fx). This is the fundamental operation of calculus, 
upon which everything else depends. In principle, we merely follow the compu¬ 
tational instructions specified in (1). These instructions can be arranged into a 
systematic procedure called the three-step rule. 


step 1 Write down the difference fx + Ax) — fix) for the particular function 
under consideration, and if possible simplify it to the point where Ax is a factor. 


step 2 Divide by Ax to form the difference quotient 

fix + Ax) -fix) 

Ax 

and manipulate this to prepare the way for evaluating its limit as Ax—» 0. In most 
of the examples and problems of the present chapter, this manipulation involves 
nothing more than canceling Ax from the numerator and denominator. 


step 3 Evaluate the limit of the difference quotient as Ax —> 0. If Step 2 has 
accomplished its purpose, a simple inspection is usually all that is needed here. 


If we remember that the innocent-looking notation fix) encompasses all con¬ 
ceivable functions, then we will understand that these steps are sometimes easy 
to carry out and sometimes very difficult. The following examples depend only 
on elementary algebra, but even this requires a little knowledge and ingenuity. 
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Example 1 Find /'(jc) if f(x) = jc 3 . 
step 1: 

f(x + Ax) - f(x) = (jc + Ax) 3 -x 3 

= jc 3 + 3x 2 Ax + 3jc(Ajc) 2 + (Ax) 3 - x 3 
= 3x 2 Ax + 3jc(Ajc) 2 + (Ajc) 3 
= Ajc[3jc 2 + 3jc Ajc + (Ajc) 2 ]. 


step 2: 


f(x + Ax) -f(x) 
Ax 


3x 2 + 3jc Ajc + (Ajc) 2 . 


step 3: 


f'(x) = lim [3 jc 2 + 3jc Ajc + (Ajc) 2 ] = 3jc 2 . 


Example 2 Find/'(;c) if /(jc) = \/x. 
step 1: 


/(jc + Ajc) -/(jc) = 


1 

jc + Ajc 


1 


JC 


jc - (jc + Ajc) _ -Ajc 
jc(jc + Ajc) jc(jc + Ajc) ‘ 


step 2: 


f(x + Ax)-f(x) = -1 

Ajc jc(jc + Ajc) ‘ 


step 3: 


f'(x) = lim 


jc(x + Ajc) 


Let us briefly consider what the result of Example 2 can tell us about the graph 
of the function y = f(x) = \/x. First, f(x) = — 1 /jc 2 is clearly negative for all 
jc =£ 0, and since this is the slope of the tangent, all tangent lines point down to 
the right. Further, when jc is near 0, /'(jc) is very large, which means that these 
tangent lines are steep; and when jc is large, /'(jc) is small, so these tangent lines 
are nearly horizontal. It is instructive to verify our observations by examining 
Fig. 1.28. Generally speaking, derivatives are capable of telling us a great deal 
about the behavior of functions and the properties of their graphs, since the de¬ 
rivative at a point gives the slope of the tangent at that point. We explore this 
topic more fully in Chapter 4. 
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Example 3 Find/'(x) if/(x) = Vx. 
step 1: 

/(x + Ax) - /(x) = Vjc + Ax — Vx. 


step 2: 

/(jc+ Ajc) ~/(jc) _ Vx + Ax - Vx 
Ajc Ax 


This is not in a form that is convenient for canceling the Ajc’s, so we use an in¬ 
genious algebraic trick to remove the square roots from the num erator. W e mul¬ 
tiply both numerator and denominator of the last fraction by Vjc + Ajc + Vic, 
which amounts to multiplying this fraction by 1, and then we simplify the nu¬ 
merator by using the fact expressed in the algebraic identity (a — b)(a + b) = 
2 / 


/(jc+ Ajc)-/(jc) 



Vx + Ax - Vx Vx -F Ax + \Tx 
Ax Vx + Ax + Vx 

(x + Ax) - x _1_ 

Ax(Vx + Ax + Vx) Vx + Ax + Vc 


Now the next step is easy. 
step 3: 

1 1 1 

/'(x) = lim - ■ - p = = — p. 

a.x->o Vx + Ax + Vx Vx + Vx 2Vx 


REMARKS ON NOTATION 

There is a slightly disconcerting feature of calculus that we might as well con¬ 
front here. It is the fact that several different notations are in common use for 
derivatives, with preference shifting from one to another according to the cir¬ 
cumstances in which the symbols are being used. Some may ask, What does it 
matter which symbols are used? The fact is that it matters a great deal, for good 
notations can smooth the way and do much of our work for us, while bad ones 
create a swamp under our feet through which easy movement is difficult. 

The derivative of a function/(x) has been denoted above by f\x). This nota¬ 
tion has the merit of emphasizing that the derivative of/(x) is another function 
of x which is associated in a certain way with the given function. If our function 
is given in the form y =/(x), with the dependent variable displayed, then the 
shorter symbol y' is often used in place of/'(x). 

The main disadvantage of this prime notation for derivatives is that it doesn’t 
suggest the nature of the process by which/'(x) is obtained from/(x). The no¬ 
tation devised by Leibniz for his version of calculus is better in this respect, and 
in other ways as well. 

To explain Leibniz’s notation, we begin with a function y = /(x) and write the 
difference quotient 

/(x + Ax) -f(x) 

Ax 
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in the form 


Ay 

Ax’ 


where Ay =/(;c + Ajc) —/(;c). Here Ay is not just any change in y; it is the spe¬ 
cific change that results when the independent variable is changed from jc to jc + 
Ajc. As we know, the difference quotient Ay/Ajc can be interpreted as the ratio of 
the change in y to the change in jc along the curve y = fix), and this is the slope 
of the secant (Fig. 2.9). Leibniz wrote the limit of this difference quotient, which 
of course is the derivative /'(jc), in the form dy/dx (read “dy over dx ”). In this no¬ 
tation, the definition of the derivative becomes 


^ = lim 
dx a*->o Ajc 


( 2 ) 


and this is the slope of the tangent in Fig. 2.9. Two slightly different equivalent 
forms of dy/dx are 


df(x) 

dx 


and 



In the second of these, the symbol d/dx should be thought of as an operation 
which can be applied to the function /(jc) to yield its derivative /'(jc), as suggested 
by the equation 


dx 


fix) = fix). 


The symbol d/dx can be read “the derivative with respect to x of . . whatever 
function of jc follows it. 

It is important to understand that dy/dx in (2) is a single indivisible symbol. 
In spite of the way it is written, it is not the quotient of two quantities dy and dx, 
because dy and dx have not been defined and have no independent existence. In 
Leibniz’s notation, the formation of the limit on the right of (2) is symbolically 
expressed by replacing the letter A by the letter d. From this point of view, the 
symbol dy/dx for the derivative has the psychological advantage that it quickly 
reminds us of the whole process of forming the difference quotient Ay/ Ajc and 
calculating its limit as Ajc^ 0. There is also a practical advantage, for certain 
fundamental formulas developed in the next chapter are easier to remember and 
use when derivatives are written in the Leibniz notation. 

But good though it is, this notation is not perfect. For instance, suppose we 
wish to write down the numerical value of the derivative at a specific point, say 
jc = 3. Since dy/dx doesn’t display the variable jc in the convenient way that/'(■*) 
does, we are forced into using some such clumsy notation as 




or 



The clear and concise symbol/'(3) is obviously superior to these awkward ex¬ 
pressions. 

As we have seen, each of the notations described above is good in its own way. 
All are widely used in the literature of science and mathematics, and to help the 
student become thoroughly familiar with them, we shall use them freely and in¬ 
terchangeably from now on. 
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PROBLEMS 


1 Use the three-step rule to show that if/(jc) = ax 2 + bx + 
c, then /'(jc) = 2ax + b. 

Use the general rule in Problem 1 to write down the indicated 
derivatives in Problems 2-16. 

2 f(x) = 5* - 3; find /'(*). 

3 g(x) = 50 - 8x 2 ; find g'(x). 

4 h(x) = jc(50 - jc); find h\x). 

5 F(x) = 20 - 72jc; find F'(x). 

6 G{x) = 3jc 2 + 5jc — 7; find G'(jc). 

7 //(jc) = 5 - 10jc + 15jc 2 ; find //'(jc). 

8 y = 5 jc 2 — 9jc + 15; find dyldx. 

9 x = 3y 2 + ly — 6; find dx/dy. 

10 u = It 2 - 1 lr + 109; find duldt. 

11 = 7jc(500 - jc); find diidx. 

12 w = -7z 2 + 22z - 71; find dw/dz. 

13 /(jc) = 5jc(jc + 5); find/'(*). 

14 y = 5 jc - (jc/ 10) 2 ; find dyldx. 

15 g(jc) = 3 - (4 jc 4- 5) 2 ; find g'(jc). 

16 h(x) = (3jc - 2) 2 - 5jc; find h'(x). 

In Problems 17-22, find all points on the curve y =/(j c) at 
which the tangent is horizontal. 

17 y = 6 —jc 2 . 

18 y = 6jc — jc 2 . 

19 y = jc 2 - 6jc + 9. 

20 y = jc 2 + jc — 5. 

21 y = jc(20-Jc). 

22 y = jc - (jc/20) 2 . 

In Problems 23-38, use the three-step rule to calculate/'( jc) 
if /(jc) is equal to the given expression. 


23 

5x - jc 3 . 

24 

X 3 + lx 2 

25 

2* 3 — 3x 2 + 6jc — 5. 

26 

x\ 

27 

1 

jc - . 

28 

1 


JC 


3jc + 2’ 

29 

JC 

30 

5 — 2jc 




jc + r 


13 + jc' 

31 

1 

JC 2 ' 

32 

1 

JC 3 ' 


33 

35 


1 

jc 2 + r 
2x 

jc 2 - r 


34 


3 

2 + jc 2 ' 


36 V2x. 


37 VT^T. 38 2V / 5 _ ^-"jc. 

39 Consider the part of the curve y = 1 /jc that lies in the first 
quadrant, and draw the tangent at an arbitrary point (jco, 
yo) on this curve. 

(a) Show that the portion of the tangent which is cut off 
by the axes is bisected by the point of tangency. 

(b) Find the area of the triangle formed by the axes and 
the tangent, and verify that this area is independent 
of the location of the point of tangency. 

40 Find /'(jc) if /(jc) = jc 3 — 3jc. Use this result to verify the 
positions of the high and low points on the curve y = 
jc 3 - 3jc that are shown in Fig. 1.26. Hint: At the high 
and low points the tangent is horizontal. 

41 Graph the function y = /(jc) = |jc| + jc, and prove that this 
function is not differentiable at jc = 0. Hint: In formula 
(1), first take Ajc positive, obtaining one limiting value; 
then take Ajc negative, obtaining a different limiting 
value. In a situation of this kind we say that the function 
has a right derivative and a left derivative , but not a de¬ 
rivative. 

42 If /(jc) = 2 jc 2 - 5, find /'(2) and use it to write the equa¬ 
tion of the tangent line to the parabola y = 2* 2 - 5 at 
the point (2, 3). 

43 If g(x) = 3 — 2jc 3 , find g'(0) and use it to write the equa¬ 
tion of the tangent line to the curve y = 3 — 2 jc 3 at the 
point (0, 3). 

44 Ifh(x) = Vjc + 5, find h'( 4) anduseitto w rite the equa- 
tion of the tangent line to the curve y = Vjc + 5 at the 
point (4, 3). 

45 Find the point on the graph of y =x 2 that is closest to 
the point (0, 3). Hint: Draw the graph, let (a, a 2 ) be the 
point, and find a as a root of a certain cubic equation that 
can be solved by inspection. 


2.4 

VELOCITY AND RATES 
OF CHANGE. NEWTON 
AND LEIBNIZ 


The concept of the derivative is closely related to the problem of computing the 
velocity of a moving object. It was this fact that made calculus an essential tool 
of thought for Newton in his efforts to uncover the principles of dynamics and 
understand the motions of the planets. It might appear that only students of physics 
would find it worthwhile to concern themselves with precise ideas about veloc¬ 
ity. However, we shall see that these ideas provide a fairly easy introduction to 
the general concept of rate of change, and this concept is important in many other 
fields of study, including the biological and social sciences. 

In this section we consider a special case of the general velocity problem: that 
in which the object in question can be thought of as a point moving along a 
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straight line, so that the position of the point is determined by a single coordi¬ 
nate ^ (Fig. 2.10). The motion is fully known if we know where the moving point 
is at each moment, that is, if we know the position s as a function of the time r, 

s=ftt). (1) 

The time is usually measured from some convenient initial moment t = 0. 

Example 1 Consider a freely falling object, say a rock dropped from the edge 
of a cliff 400 ft high (Fig. 2.11). It is known from many experiments that this 
rock falls 

s= \6t 2 (2) 

feet in t seconds. We see that when t = 5, s = 400. The rock therefore hits 
the ground 5 seconds after it starts to fall, and formula (2) is valid only for 0 < 
t < 5. 


Two basic questions can be asked about the motion described in this example. 
First, what is meant by the velocity of the falling rock at a given instant? And 
second, how can this velocity be computed from (2)? 

We are all familiar with the idea of velocity in its everyday sense, as a num¬ 
ber measuring the rate at which distance is being traversed. We speak of walk¬ 
ing 3 miles per hour (mi/h), driving 55 mi/h, and so on. We also speak of aver¬ 
age velocities, and these are the numbers we usually compute. If we drive a 
distance of 200 mi in 5 hours, then our average velocity is 40 mi/h, because 

distance traveled 200 mi 

-—: —-= ——— = 40 mi/h. 

elapsed time 5 h 

In general, 

. distance traveled 

average velocity = —;-—-, 

elapsed time 

and this is a formula almost everyone knows. 


Example 1 (continued) The position function for the falling rock, fit) = 16r 2 , 
tells us that in the first second after the rock is released it falls/(l) = 16 ft, in 
the first 2 seconds/(2) = 64 ft, in the first 3 seconds/(3) = 144 ft, and so on. 
The average velocities during each of the first 3 seconds of fall are therefore 


Y = 16 ft/s. 


64 - 16 
1 


= 48 ft/s, 


and 


144 - 64 
1 


= 80 ft/s. 


The rock is clearly falling faster and faster from moment to moment, but we still 
do not know exactly how fast it is falling at any given instant. 

To find the velocity v of the rock at a given instant t, we proceed as follows. 
In the time interval of length A t between t and a slightly later instant t + Af, the 
rock falls a distance As (see Fig. 2.11). The average velocity during this interval 
is the quotient As/At. When At is small, this average velocity is close to the ex¬ 
act velocity v at the beginning of the interval; that is, 
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where the symbol = is read “is approximately equal to.” Further, as A t is made 
smaller and smaller, this approximation gets better and better, so we have 


As 

v = Inn —. 
Af—>0 A t 


(3) 


Our point of view here is that the velocity v is a direct intuitive concept, and (3) 
shows us how to compute it. However, it is also possible to regard (3) as the def¬ 
inition of the velocity, with the preceding remarks serving as motivation. The 
limit in (3) is clearly the derivative ds/dt, and carrying out the details we have 

ds ,. As 
v = — = lim — 
dt A/—>o Aj 

.. 16(f + AO 2 - 16r 2 

= lim --- 

Ar—>o Af 

= lim (32r+ 16Ar) = 32 1. 

Af—»0 


This formula tells us that the velocity of the rock after 1, 2, and 3 seconds of fall 
is 32, 64, and 96 ft/s, and also that the rock hits the ground at 160 ft/s. We no¬ 
tice that the velocity increases by 32 ft/s during each second of fall. This fact is 
usually expressed by saying that the acceleration of the rock is 32 feet per sec¬ 
ond per second (ft/s 2 ), or, in the metric system, 9.80 meters per second per sec¬ 
ond (m/s 2 ). 


The reasoning used in this example is valid for any motion along a straight 
line. For the general motion (1), we therefore calculate the velocity v at time t 
in exactly the same way; that is, we approximate v more and more closely by the 
average velocity over a shorter and shorter interval of time beginning at the in¬ 
stant v. 


v = lim 
Ar—>0 


As 

~— = lim 
A t A/—>0 


/(f + Ar) - fit) 
A t 


We recognize this as the derivative of the function 5 = /(r), so the velocity' of a 
point moving on a straight line is simply the derivative of its position function, 


ds , 


Sometimes this is called the instantaneous velocity, in order to emphasize that it 
is calculated at an instant t. However, once this point has been made, it is cus¬ 
tomary to omit the adjective. The words “velocity” and “speed” are used inter¬ 
changeably in everyday speech, but in mathematics and physics it is useful to 
distinguish them from one another. The speed of our point is defined to be the 
absolute value of the velocity, 


speed = \v\ = 


ds 

~dt 


= I/'Ml- 


The velocity may be positive or negative, depending on whether the point is mov¬ 
ing along the line in the positive or negative direction; but the speed, being the 
magnitude of the velocity, is always positive or zero. The concept of speed is 
particularly useful in studies of motion along curved paths, for it tells us how 
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fast the point is moving regardless of its direction. In our everyday experience, 
we learn the speed of a car at any moment by looking at the speedometer. 

Example 2 Consider a projectile fired straight up from the ground with an ini¬ 
tial velocity of 128 ft/s. This projectile moves up and down along a straight line. 
However, the two parts of its path are shown slightly separated in Fig. 2.12, for 
the sake of visual clarity. Let 5 = f(t) be the height in feet of the projectile t sec¬ 
onds after firing. If the force of gravity were absent, the projectile would con¬ 
tinue moving upward with a constant velocity of 128 ft/s, and we would have 
s = f(t) = 128r. However, the action of gravity causes it to slow down, stop mo¬ 
mentarily at the top of its flight, and then fall back to earth with increasing speed. 
Experimental evidence suggests that the height of the projectile during its flight 
is given by the formula 

5 = fit) =128 1- 16 1 2 . (4) 


t - 4, v = 0 
5 = 256 


/ \ 

i I 

L , I 

t = 2, v = 64 i ' T I t = 6, v = -64 

5=192 i| l* 


T 5=192 


Tit 


I I 
1 

'/////////^^^^ 

Figure 2.12 


If we write this in the factored form 5 = 16f(8 — t), we see that 5 = 0 when 
t — 0 and when t — 8. The projectile therefore returns to earth 8 seconds after it 
starts up, and (4) is valid only for 0 < t < 8. 

To learn more about the nature of this motion, it is necessary to know the ve¬ 
locity. If the general rule for computing derivatives of second-degree polynomi¬ 
als is applied to (4), we find that the velocity at time t is 

v= -j- = 128 - 32/. (5) 

dt 


At the top of its flight the projectile is momentarily at rest, and therefore v = 0. 
By (5), t = 4 when v = 0; and by (4), 5 = 256 when t — 4. In this way we find 
the maximum height reached by the projectile and the time required to reach this 
height (see Fig. 2.12). As t increases from 0 to 8, it is clear from (5) that ^de¬ 
creases from 128 ft/s to -128 ft/s; in fact, ^decreases by 32 ft/s during each 
second of flight, and this is expressed by saying that the acceleration is — 32 feet 
per second per second (ft/s 2 ). We notice explicitly that the velocity is positive 
from t = 0 to t = 4, when 5 is increasing; and it is negative from t = 4 to t = 8, 
when s is decreasing. In particular, it is easy to see from (5) that v = 64 ft/s when 
t — 2 and v — -64 ft/s when t = 6 (the speed is 64 ft/s at both times). 


Velocity is an example of the concept of rate of change, which is basic for all 
the sciences. For any function y = /(;c), the derivative dy/dx is called the rate of 
change of y with respect to x. Intuitively, this is the change in y that would be 
produced by an increase of one unit in x if the rate of change remained constant 
(Fig. 2.13). In this terminology, velocity is simply the rate of change of position 
with respect to time. When time is the independent variable, we often omit the 
phrase “with respect to time” and speak only of the “rate of change.” 

Example 3 (a) We know that velocity is important in studying the motion of 

a point along a straight line, but the way the velocity changes is also important. 
By definition, the acceleration of a moving point is the rate of change of its ve¬ 
locity v, 
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Figure 2.14 



(b) Suppose that water is being pumped into the conical tank shown in Fig. 
2.14 at the rate of 5 ft 3 /min. If V denotes the volume of water in the tank at time 
r, then 


The rate of change of the depth x is the derivative dx/dt, and this is not constant. 
It is intuitively clear that this rate of change is large when the area of the surface 
of the water is small, and becomes smaller as this area increases. 

(c) In economics, the rate of change of a quantity Q with respect to a suit¬ 
able independent variable is usually called marginal Q. Thus we have marginal 
cost, marginal revenue, marginal profit, etc. If C{x) is the cost of manufacturing 
x pieces of a product, then the marginal cost is dC/dx. In most cases x is a large 
number, so 1 is small compared with jc and dC/dx is approximately C(x + 1 ) — 
C{x). For this reason, many economists describe marginal cost as “the cost of 
producing one more piece.” In Section 4.7 we discuss in some detail the appli¬ 
cations of calculus to economics. 

(d) We know that the area A of a circle in terms of its radius r is given by 
the formula A = rrr 2 , and the derivative of this function is easy to compute by 
the three-step rule: 


This says that the rate of change of the area of a circle with respect to its radius 
equals its circumference. To understand the geometric reason for this remarkable 
fact, let A r be an increment of the radius and A A the corresponding increment 
of the area (Fig. 2.15). It is clear that AA is the area of the narrow band around 
the circle, and this is approximately the product of the circumference 2irr and 
the width A r of the band. The difference quotient AA/Ar is therefore close to 
27rr, and by letting Ar—» 0 we obtain (6) by geometric reasoning. 


We have introduced two topics in this section: velocity, which is the rate of 
change of the position of a moving object, and rates of change in general, in¬ 
cluding acceleration, which is the rate of change of velocity. These are themes 
of major importance for calculus, and we shall return to them repeatedly through¬ 
out the rest of this book. 



NOTE ON NEWTON AND LEIBNIZ 


^3 As we suggested earlier, the historical roots 


of calculus lie deep in antiquity — in the mind of Archimedes 
(287-212 B.C.)—and also in several profound ideas of Fer¬ 
mat in the early seventeenth century. Nevertheless, the ac¬ 
tual discovery of calculus can be credited jointly to Isaac 
Newton (1642-1727) in England and Gottfried Wilhelm 
Leibniz (1646-1716) in Germany. 


Newton’s discovery is perhaps the only benefit that the 
Great Plague of London conferred upon humanity. This 
plague—which killed more than 75,000 people out of a total 
population of about 500,000—spread out from London into 
other parts of the country and forced the closing of Cam¬ 
bridge University in 1665. The young Newton left Trinity Col¬ 
lege and returned to his family farm in northern England for 
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two years of rustic solitary meditation. During this enforced 
long vacation his genius burst into flame: He discovered the 
binomial series for negative and fractional exponents; dif¬ 
ferential and integral calculus; universal gravitation as the 
key to the mechanism of the solar system; and the resolu¬ 
tion of sunlight into the visual spectrum by means of a prism, 
with its implications for understanding the colors of the rain¬ 
bow and the nature of light in general. At the age of 22 to 
23, in this miraculous period of his youth, he began the life- 
work in which he virtually created modem physical science. 
From the perspective of more than 300 years we see that he 
had a deeper influence on the direction of civilized life than 
the rise and fall of nations. When Newton died, England 
went into national mourning, and the poet Alexander Pope 
wrote his epitaph as follows: “Nature and Nature’s laws lay 
hid in night: God said, Let Newton be! and all was light.” 

But our concern here is with his mathematics. Newton 
usually thought of functions in terms of motion, and what 
we refer to as “differential and integral calculus” he called 
“the direct and inverse method of fluxions.” His treatise on 
these subjects was written in 1671 but remained unpublished 
until 1736, nine years after his death. His ideas became known 
in a limited way through correspondence and conversations 
with his friends, and were mentioned in vague general terms 
in his great treatise of 1687 on the laws of motion and the 
astronomy of the solar system.* Newton was pathologically 
secretive—as private as a snail in its shell. He had none of 
the itch to publish that afflicts the modern world, and most 
of his great works had to be dragged out of him by the ca¬ 
jolery and persistence of his friends. He was especially un¬ 
willing to publish anything about his mathematical discov¬ 
eries, and it is not surprising that Leibniz and other 
mathematicians on the Continent, though starting a few years 
later, soon caught up with Newton and passed beyond him. 

Mathematics and physics were only a small part of New¬ 
ton’s intellectual life, and he left these occupations behind 
him at the age of about 45. Throughout his life he spent 
thousands of hours of thought on theology and alchemy, and 


*Philosophiae Naturalis Principia Mathematica (The Mathematical 
Principles of Natural Philosophy), usually called the Principia. When 
this work was published, it was immediately recognized as one of the 
supreme achievements of the human mind. It is still universally con¬ 
sidered to be the greatest contribution to science ever made by one man. 


wrote bushels of manuscripts recording his studies of these 
subjects. In his later years he moved to London and became 
Master of the Mint. He was knighted for his success in sta¬ 
bilizing the British currency by catching and executing many 
counterfeiters with relentless and ferocious efficiency. A 
very strange man, unlike any other, and the more we learn 
about him the stranger he seems. 

Newton’s great rival Leibniz was a man of transcendent 
genius who made creative contributions across the entire 
spectrum of human knowledge. He is equally famous as 
mathematician and philosopher, and the graduate department 
of philosophy in every respectable university offers a course 
on Leibniz. He was also a lawyer, diplomat, historian, li¬ 
brarian, physicist, geologist, logician, theologian, landscape 
architect, economist, and much else. He spent most of his 
life in the service of the successive Dukes of Brunswick at 
Hanover in northern Germany, working as court historian 
and librarian. Without his researches as historian and ge¬ 
nealogist, his employer the Elector George Louis of Hanover 
could never have become George I, the first German King 
of England; and George’s descendants, including Queen Vic¬ 
toria and the present royal family of Britain (known as the 
House of Windsor since 1917) would never have been heard 
of. His ideas about the purposes and organization of schol¬ 
arly libraries were so farsighted that the Director and Prin¬ 
cipal Librarian of the British Museum from 1959 to 1968 
called him “the greatest librarian of his age.” 

He founded the Berlin Academy of Sciences and also Acta 
Eruditorum, the most influential European journal of the 
time in science and mathematics, and he was its editor-in- 
chief for many years. In this journal he published the first 
accounts of his version of calculus, in 1684 and 1686. He 
had started his mathematical work in 1673, eight years af¬ 
ter Newton, and in 1675 he invented the basic notations dy/dx 
and fy dx His early publications had little effect in Ger¬ 
many or England, but in Switzerland the Bernoulli brothers 
eagerly absorbed Leibniz’s ideas and methods and con¬ 
tributed many of their own. Calculus grew rapidly from 1690 
on and reached roughly its present state around 1800. How¬ 
ever, subtle difficulties in the theory of calculus were not 
fully settled until the twentieth century. 


iThe latter notation will be introduced in Section 5.3. 


PROBLEMS 

According to Problem 1 of Section 2.3, the general quadratic 
function 

5 = f{x ) = a t 2 + bt + c Each of the formulas in Problems 1-7 below describes the 

motion of a point along a horizontal line whose positive di- 
has derivative rection is to the right. In each case use the result stated here 


^=f(t) = 2at + b. 
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to write down the velocity v = dsldt by inspection. Also, find 
{a) the times when the velocity is zero, so that the point is 
momentarily at rest; and ( b ) the times when the point is mov¬ 
ing to the right. 

1 5 = 3t 2 - \2t + 7. 2 s=\ - 6t- t 2 . 

3 £ = 2f 2 + 28f - 6. 4 £ =-19 + lOr - 5r 2 . 

5 £ = It 2 + 2. 6 £ = 2 + It. 

7 £ = (2f - 6) 2 . 

8 Two points start from the origin on the £-axis at time t = 
0 and move along this axis in accordance with the for¬ 
mulas 

£ | = t 2 - 6t and £2 = 9t — 21 2 , 

where £| and £2 are measured in feet and t in seconds. 

(a) When will the two points have the same speed? 

(b) What are the velocities of the two points at the times 
when they have the same position? 

9 A camera is accidentally knocked off a ledge on the 
World Trade Center in New York City and falls to the 
ground below. The ledge is 784 ft above the ground. The 
camera falls a distance of £ = 16 1 2 feet in t seconds. 

(a) How long does the camera fall before it hits the 
ground? 

(b) What is the average velocity at which the camera falls 
during the first 3 seconds? 

(c) What is the average velocity at which the camera falls 
during the last 3 seconds? 

(d) What is the instantaneous velocity of the camera 
when it hits the ground? 

10 A point moves along a straight line in such a way that 
after t seconds its distance from the origin is £ = 6 1 2 + 
2 1 feet. 

(a) Find the average velocity between t = 3 and t = 6. 

(b) Find the instantaneous velocity when t = 3. 

(c) Find the instantaneous velocity when t = 6. 

11 Consider the function y = 3x 2 + 4. 

(a) Find the average rate of change of y with respect to 
x between the points x = 1 and jc = 3. 

(b) Find the instantaneous rate of change of y with re¬ 
spect to x at the point jc = 1 . 


(c) Find the instantaneous rate of change of y with re¬ 
spect to jc at the point jc = 3. 

12 Consider the function y = —— —- (see Problem 33 in 

Section 2.3). * + 

(a) Find the average rate of change of y with respect to 
jc between the points jc = — 1 and jc = 1 . 

(b) Find the instantaneous rate of change of y with re¬ 
spect to jc at the point jc = - 1 . 

(c) Find the instantaneous rate of change of y with re¬ 
spect to jc at the point jc = 1 . 

13 Starting from rest, a certain car moves £ feet i n t seconds, 
where £ = A At 2 . How long does it take the car to reach 
the velocity of 60 mi/h (= 88 ft/s)? 

14 Assume that a projectile fired straight u p from the ground 
with an initial velocity of v 0 ft/s reaches a height of s 
feet in t seconds, where 

s — v%t — 16 1 2 . 

(a) Find the velocity v at time t. 

(b) How much time is required for the projectile to reach 
its maximum height? 

(c) What is the maximum height? 

(d) What is the velocity when the projectile hits the 
ground? 

(e) What must the initial velocity be for the projectile to 
hit the ground 15 seconds after firing? 

15 An oil tank is to be drained for cleaning. If there are V 
gallons of oil left in the tank t minutes after the draining 
begins, where V = 40(50 - f) 2 , find 

(a) the average rate at which oil drains out of the tank 
during the first 20 minutes; 

(b) the rate at which oil is flowing out of the tank 20 
minutes after the draining begins. 

16 Consider a square of area A and side £, so that A = £ 2 . 
If jc = j£, use the idea of Example 3 d to make a conjec¬ 
ture about the value of dA/dx. Verify your conjecture by 
calculation. 

17 Suppose a balloon of volume V and radius r is being in¬ 
flated, so that V and r are both functions of the time t. If 
dV/dt is constant, what can be said (without calculation) 
about the behavior of dr/dt as r increases? 


2.5 

THE CONCEPT OF 
A LIMIT TWO 
TRIGONOMETRIC 
LIMITS 


It is evident from the preceding sections that the definition of the derivative rests 
on the concept of the limit of a function, which we have freely used with only 
the briefest explanation. Now that we understand the purpose of this concept, the 
time has come to examine its meaning with somewhat more care and attention. 

Let us consider a function /(jc) that is defined for all values of jc near a point 
a on the jc-axis but not necessarily at a itself. Suppose there exists a real num¬ 
ber L with the property that /(jc) gets closer and closer to L as jc gets closer and 
closer to a (Fig. 2.16). Under these circumstances we say that L is the limit of 
f(x) as x approaches a , and we express this symbolically by writing 
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lim fix) = L. 

X — 


( 1 ) 


If there is no real number L with this property, we say that /(jc) has no limit as 
x approaches a , or that \im x ^ a f(x) does not exist. Another widely used notation 
equivalent to (1) is 

fix) —> L as x —> a, 


which is read “/(jc) approaches L as jc approaches a ” In thinking about the mean¬ 
ing of (1), it is essential to understand that it does not matter what happens to/(jc) 
when x equals a\ all that matters is the behavior of/(jc) for jc’s that are near a. 

These informal descriptions of the meaning of (1) are helpful to the intuition 
and are adequate for most practical purposes. Nevertheless, they are too loose 
and imprecise to be acceptable as definitions, because of the vagueness of such 
expressions as “closer and closer” and “approaches.” The exact meaning of (1) 
is too important to be left mainly to the student’s imagination, and at the risk of 
being overly technical, we will try to give a satisfactory definition as briefly and 
clearly as possible. For the next few paragraphs we ask students to read even 
more carefully and thoughtfully than usual, and to suspend their natural impa¬ 
tience with what appears to be excessive, nit-picking precision. 

We begin by analyzing a specific example with the hope of extracting the 
essence of the general situation: 


lim 

x —>0 


lx 2 + X 
X 



Figure 2.16 


Here the function we must examine is 


y =/M 


lx 1 + X 

X 


This function is not defined for x = 0, and for x A 0 its values are given by the 
simpler expression 


fix) = 


x(2x + 1) 
x 


= 2x + 1. 


If we examine the graph (Fig. 2.17), it is clear that/(jc) is close to 1 when jc is 
close to 0. In order to give a quantitative description of this qualitative behavior, 
we need a formula for the difference between /(jc) and the limiting value 1: 

fix) - 1 = (2jc + 1) - 1 = 2jc. 

We see from this formula that/(jc) can be made as close as we please to 1, that 
is, this difference can be made as small as we please , by taking x sufficiently 
close to 0. Thus, 

fix) ~ 1 = W When X = 2 

fix) - 1 = 1^0 when X = 2 ^ 00 , 

and so on. More generally, let e (epsilon) be any positive number given in ad¬ 
vance, no matter how small, and define 8 (delta) by 8 = {e . Then the distance 
from/(jc) to 1 will be smaller than e, provided only that the distance from jc to 
0 is smaller than <5; that is, 

if |;t| < 8 = \e then |/(jc) — 11 = 2|jc| < e. 



This assertion is much more precise than the vague statement that/(jc) is “close” 
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to 1 when x is “close” to 0. It tells us exactly how close x must be to 0 in order 
to guarantee that f{x) will attain a previously specified degree of closeness to 1. 
Of course, x is not permitted to equal 0 here, because f(x) has no meaning for 
x = 0. 

The so-called epsilon-delta definition of the meaning of (1) should now be 
easy to grasp. The defining condition is this: 



For each positive number e there exists a positive number 8 with the property that 

I fix) -L\<e 

for any number x =£ a that satisfies the inequality 

\x — a\ < 8. 



In words: If e > 0 is given, then 8 > 0 can be found with the property that f(x) 
will be “e-close” to L whenever x is “5-close” to a. As usual, we are concerned 
only with the behavior of f(x) near the point x = a, and not at all with what hap¬ 
pens at x = a. 

It may be helpful to students if we interpret these ideas in terms of the graph 
of the function y = f(x\ as shown in Fig. 2.18. In this figure 2e is the width of 
the horizontal strip centered on the line y = L, 28 is the width of the vertical strip 
centered on the line x = a, and the defining condition stated above can be ex¬ 
pressed this way: 

For each horizontal strip, no matter how narrow, there exists a vertical strip such that 
if jc =£ a is confined to the vertical strip, then the corresponding part of the graph will 
be confined to the horizontal strip. 

Students should read the precise definition of the meaning of (1) very carefully 
and be aware of its crucial role in the theory of calculus. However, an intuitive 
understanding of limits is quite enough for our purposes, and from this point of 
view the following examples should present no difficulties. 


Example 1 First, 


lim (3 jc + 4) = 10. 


Here it is clear that as x approaches 2, 3x approaches 6 and 3x + 4 approaches 
6 + 4 = 10. Next, 


lim 

JC-»1 



lim 


U+ l)u- 1) 

X - 1 


= lim (x + 1) = 2. 

X-»l 


The first thing we notice here is that the function (x 2 — l)/(x - 1) is undefined 
at x = 1, since both numerator and denominator equal 0. But this fact is irrele¬ 
vant, since all that matters is the behavior of the function for ;fs that are near 1 
but different from 1, and for all such x’s the indicated cancellation is valid, the 
function equals x + 1, and this is near 2. 


Example 2 It is illuminating to consider a few limits that do not exist, for in¬ 
stance, 
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lim -rj, 
*->0 \x\ 


lim and lim — 5 -. 

jc-*0 X *-»0 


The behavior of these limits is most easily understood by looking at the graphs 
of the functions jc/|jc|, \/x , and \/x 2 (Fig. 2.19). In the first case the function equals 
1 when x is positive and — 1 when x is negative (and is undefined for x = 0 ), so 
there is no single number that the values of the function approach as x approaches 
0 without regard to sign. We can be a bit more specific about the way this limit 
fails to exist, by writing 


lim 

x—> 0 + 



and 


lim tt = ~ 1 • 
*-*o- \x\ 


The notations 0+ and ;c—> 0— are intended to suggest that the variable x 
approaches 0 from the positive side (the right) and from the negative side (the 
left), respectively. The other two limits fail to exist because in each case the val¬ 
ues of the function become arbitrarily large in absolute value as x approaches 0 . 
In symbols, 

lim — = 00 Hm — = — °o ? and lim -V = 00 . 

*->0+ X *-»0- x x->0 x z 


Remember: The number L in (1) must be a real number; L = °o does not qual- 
ify. 


The main rules for calculating with limits are exactly what we would expect. 
For instance, 


lim x = a; 

x—>a 


and if c is a constant, then 

lim c = c. 

x—^a 

Also, if \im x ^ a f(x) = L and g(x) = M, then 

lim [/(*) + g(x)] = L + M 9 

x->a 

lim [f(x) ~ g(*)] = L- M, 

x—>a 

lim f{x)g(x) = LM, 


and 


limfl 

gw 


L_ 

M 


(if M * 0). 
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Q 

sin d 

6 

1 

0.84147098 

0.5 

0.95885108 

0.4 

0.97354586 

0.3 

0.98506736 

0.2 

0.99334665 

0.1 

0.99833417 

0.05 

0.99958339 

0.01 

0.99998333 

0.005 

0.99999583 

0.001 

0.99999983 


In words, the limit of a sum is the sum of the limits, with similar statements for 
differences, products, and quotients. These are called the limit laws , or limit the¬ 
orems. 

We remarked earlier that calculus is a problem-solving art and not a branch of 
logic. It has more to do with insight nourished by intuitive understanding than it 
does with careful deductive reasoning. Naturally, we will try to convince the 
reader of the truth of our statements and the legitimacy of our procedures. How¬ 
ever, these efforts will be brief and rather informal, in order to avoid clogging 
the text with massive indigestible chunks of theoretical material. Those who wish 
to devote more attention to the purely mathematical side of the subject will find 
logically rigorous proofs of the major theorems in Appendix A at the back of the 
book. In particular, the properties of limits stated here are proved in Appendix 
A.2. 

Before leaving these topics, we discuss two specific trigonometric limits that 
will be of crucial importance in the next chapter. The first is 


lim & (6 in radians). 

0^0 6 


( 2 ) 


We cannot simply set 0 = 0 here, because the result is the meaningless quotient 
0/0. We notice how different this is from an algebraic limit like 


lim 

->o 


3x 2 + 2x 
x 


= lim 

*->o 


jc(3jc + 2) 

X 


= lim (3 jc + 2) = 2, 
*->o 


because there is no apparent way to cancel 6 from sin 0. To get an impression 
of what is happening in (2), let us calculate the numerical value of the ratio for 
several small values of 6. We begin by observing that if we replace 6 by — 6 in 
the ratio, then we have 


sin(— 6) _ —sin 6 _ sin 6 
-6 ~ -6 ~ 6 ’ 

so we can restrict our attention to positive 0’s. Using a calculator set to the ra¬ 
dians mode, we can easily construct the adjoining table of values correct to eight 
decimal places. This numerical evidence strongly suggests (but does not prove!) 
that 


lim 

0—>o 


sin 6 
'~ 6 ~ 


(3) 



We now establish (3) by a simple geometric argument. Let P and Q be two 
nearby points on a unit circle (Fig. 2.20), and let PQ and PQ denote the lengths 
of the chord and the arc connecting these points. Then the ratio of the chord 
length to the arc length evidently approaches 1 as the two points move together: 


chord length PQ 
arc length PQ 


as PQ —> 0. 


With the notation in the figure, this geometric statement is equivalent to 


2 sin 6 _ sin 6 

26 ~ e 


as 26 -»0 or 6-> 0, 


Figure 2.20 


and this is (3). 
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The second limit necessary for our work in the next chapter is 

1 — cos 6 


lim 

0->O 


0 


= 0. 


(4) 


This follows from (3) by an ingenious use of the trigonometric identity sin 2 8 E 
cos 2 0 = 1 : 


.. 1 - cos 6 .. 1 1 — cos 6 

hm---= hm --- 

e~>o 0 0->o \ 6 


E cos 6 
E cos 6 


= lim 


1 — cos 2 6 
e-to 0( 1 E cos 6) 

sin 2 6 


= lim 


0 ->o S( 1 E cos 6) 
sin 6 sin 6 


= lim 
6->0 


0 


1 E cos 6) 
sin 6 


= [ lim ^ II lim 

\©->o 0 )\e-> o 1 E cos 6 



0 


1 E 1 


= 0 . 


The end of this calculation uses the facts that sin 0-^0 and cos 0—> 1 as 0—> 
0 , which are easily verified by examining the geometric meaning of sin 6 and 
cos 6 in Fig. 2.20. 


PROBLEMS 


Some of the following limits exist, and others do not. Evalu¬ 
ate those that do. 


1 lim (lx - 6 ). 

x->3 


3 lim 

; t -»0 

5 lim 


2 lim 


:->0 X — 1 ' 

3jc — 9 


x ->3 X ~ 3 ‘ 

X- 3-2x 2 


1 lim 

x—>5 


9 lim 


l E 3x 
4x 


12 


11 lim 


x—>—3 \ X E 3 x E 3 
x 2 + x — 56 


jc 2 — 1 \x E 28 

.• (x + 2)(x 2 - X + 1) 

12 lim - 5 — - - - 

Jt-»-2 X 2 E JC — 2 


10 


4 lim 


x — >2 3 E X 
6 


2jc — 4’ 


6 lim 


; E 3x 


*->3 X 2 — X + 3 


8 lim 


4jc 


x->-3 X + 3 
10 lim 77 . 

*—>0.001 JC 


X* 

lim tt- 

14 

lim 

*->0 |jc| 


*->4 

jc - 4 
lim 

16 

lim 

jc - Vjc - 2 


*->3 


JC - 4 

Hill r- 

VjC — 2 


Vjc 2 E 16 - 5 


v2 _ 


3jc 


17 


If lim x .^ a fW = 4,lim^gU) = -2, andlim^/iU) = 
0 , evaluate the following limits: 

(a) lim f f(x) ~ gU)]; (b) lim [gU)l 2 ; 


(c) lim 

g(*) 


(d) lim 

/to 


(e) lim 


/to . 


18 


(O lim . ~ - ~ . , l2 . 
x^a [/(x) + g(x)Y 


x->a h(x)’ 

In many situations we are interested in the behavior of 
f(x ) when jc is large and positive. If there exists a real 
number L with the property that /(jc) gets closer and 
closer to L as jc gets larger and larger (Fig. 2.21), then 
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we say that L is the limit of /(jc) as x approaches infin¬ 
ity, and we symbol this by writing lim. v _>oo/(jc) = L. 
Evaluate the follow u»g limits: 

( 2 + m 


(a) lim — 


(b) lim 


5x + 3 
xlX™ 2x —7 ' 
x 


(d) lim 


(c) lim 

X—>oo 

(e) lim 


19 Evaluate the following limits: 

, x i- sin 50 „ N ,. 

(a) lim —-—; (b) lim 


x 

2x 2 + jc 


x->~ 3x 2 — 7x + 2 ’ 


(f) lim 


2jc + 5 


jc 3 + lx 2 + 2x 
sin 6 


1 ‘ 


6 ’ 

(c) lim sin —; 
*“>°° x 

, , r 500 

(e) lim cos-; 

x->°° X 

sin 2jc 


0 ->o 26 ’ 
(d) lim x sin —; 

JC^-oo x 


/pn ,• 5111 A 

(f) lim —tt" 
x->o 3 jc 2 


(8) !™n 


v—>0 sin 3jc‘ 

20 Evaluate the following limits: 


(a) lim 

x—>0 

sin x 

3VJ’ 

(b) lim 

x—>0 

sin 2 jc 

JC 

(c) lim 

x—>0 


(d) lim 
0—>0 

0 

1 —cos 2 jc’ 

cos 0’ 

(e) lim 

x—>0 

3jc + sin x 

(f) lim 

0->O 

0 2 - 2 sin 0 

X 

0 

(g) lim 

x—>0 

3x 2 + 4x 
sin 2x 




Each of the following problems requires the use of a calcula¬ 
tor. Hereafter, problems of this kind will be signaled by the 
— .symbol §. 

•ml 21 Verify the limit (4) numerically by using a calculator to 
construct a table of values of (1 - cos 6)16 correspond¬ 
ing to the same 0’s used in the text. 

*Hint: Notice that dividing both numerator and denominator of this 
quotient by jc gives 


5x + 3 
lx-1 



What becomes of the expression on the right as jc —» oo ? 



Consider the limit 


lim 

0-»O 


1 — cos 6 
6 2 


(a) Use a calculator to construct a table of values of the 
function for small 0’s, and thereby form a conjecture 
about the value of this limit. 

(b) Prove your conjecture. 

23 The limit (3) says that = 1 or sin 0=0 for small 

0 

0. Test this approximation by using a calculator to find 
the value of sin 0 for 

(a) 0 = 0.1; (b) 0 = 0.01; (c) 0 = 0.001. 

Give a geometric explanation for the fact that each sin 0 
,—, is slightly less than its corresponding 0. 

Iiml 24 Using the trigonometric identity cos 2a = 1 - 2 sin 2 a 
with 2a = 0, and the approximation sin 0 = 0 for small 
0 , show that for these 0 we have 

cos 0 = 1 — y0 2 . 

Use a calculator to test this approximation for 
^ (a) 0 = 0.1; (b) 0 = 0.01; (c) 0 = 0.001. 

■ml 25 Consider the limit 


(a) Use a calculator to construct a table of values of x K 
for x = 1, 0.9, 0.8, 0.7, 0.6, 0.5, 0.4, 0.3, 0.2, 0.1, 
0.05, 0.01, 0.005, 0.001. Use this evidence to form 
a conjecture about the value of the limit. 

(b) Use the information in (a) to sketch the graph of y = 
jc* for 0 < x < 1. Estimate the location of the low- 

— est point. 

mil 26 The existence of the limit 

lim (1 + x) Ux 

x —>0 

will be established later, in Chapter 8 and Appendix A.8. f 
Estimate the value of this limit to five decimal places by 
using a calculator to find the value of the function for 
x= 1, 0.1, 0.01, 0.001, 0.0001, 0.00001, 0.000001, 
0 .0000001, 0.00000001, 0.000000001. 


+ This number defines the constant e, which is the most important con¬ 
stant in mathematics after 7r. 


2.6 

CONTINUOUS 
FUNCTIONS. THE 
MEAN VALUE 
THEOREM AND 
OTHER THEOREMS 


As we penetrate further into our subject, it will often be important for us to know 
what is meant by a continuous function. In everyday speech a “continuous” 
process is one that proceeds without gaps or interruptions or sudden changes. 
Roughly speaking, a function y = f(x ) is continuous if it displays similar behav¬ 
ior, that is, if a small change in x produces a small change in the corresponding 
value /(jc). The function shown in Fig. 2.22 is continuous at the point a because 
/(jc) is close to f(a) when jc is close to a, or more precisely, because /(jc) can be 
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made as close as we please to f(a) by taking x sufficiently close to a. In the lan¬ 
guage of limits this says that 

lim f(x)=f(a). (1) 

x—>a 

U p to this stage our remarks about continuity have been rather loose and in¬ 
tuitive, and intended more to explain than to define. We now adopt equation (1) 
as the definition of the statement that f(x) is continuous at a. The reader should 
observe that the continuity of f(x) at a requires three things to happen: a must 
be in the domain of/(x), so that f(a) exists;/(x) must have a limit as x approaches 
a\ and this limit must equal f(a). We can understand these ideas more clearly by 
examining Fig. 2.22, in which the function is discontinuous in different ways at 
the points b, c, and d: At the point b , lim X ~>bf( x ) exists but f(b) does not; at c, 
/(c) exists but lim x -» c f( x ) does not; and at d,f(d) and lim x -^df( x ) both exist but 
have different values. The graph of this f unction therefore has “gaps” or “holes” 
of three different kinds. 

The definition given here tells us what it means for a function to be continu¬ 
ous at a particular point in its domain. A function is called a continuous function 
if it is continuous at every point in its domain. In particular, by the properties of 
limits this is easily seen to be true for all polynomials and rational functions; and 
by looking at their graphs, we see that the functions Vi, sin x , and cos x are also 
continuous. We will be especially interested in functions that are continuous on 
closed intervals. These functions are often described as those whose graphs can 
be drawn without lifting the pencil from the paper. 

With a slight change of notation, we can express the continuity of our func¬ 
tion /(x) at a point x (instead of a) in either of the equivalent forms 



lim f{x + Ax) =f(x) 

Ax —>0 


or 


lim [f(x + \x) - f{x)] = 0; 
Ax—>0 


and if we write Ay = /(x + Ax) — /(x), then this condition becomes 


lim 


Ax—>0 


Ay = 0. 


The purpose of this reformulation is to make it possible to give a very short proof 
of a fact that we will need in the next chapter, namely, that a function which is 
differentiable at a point is continuous at that point. The proof occupies only a 
single line: 


lim 

Ax-»0 


Ay = lim • Ax = 
Ax—>0 Ax 


lim 

Ax-»0 Ax 


lim Ax 

Ax—>0 


dy_ 

dx 


0 = 0 . 


The converse of this statement is not true, since a function can easily be contin¬ 
uous at a point without being differentiable there (for example, see the point a 
in Fig. 2.22). 

There are several other facts in the theory of calculus that will be needed in 
the next few chapters to convert plausible reasoning into solid proof. Students 
should be aware of these facts but not smothered by them. We state them in the 
form of three basic theorems, which we present without proof but with a few 
comments on each that we hope will illuminate their meaning. Proofs are not 
given here because a careful examination of the theoretical foundations of cal¬ 
culus does not belong in a first course. All three theorems are extremely plausi¬ 
ble—what some might call “intuitively obvious.” Part of the difficulty they cause 
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for beginning students lies in the effort required to doubt them in the face of their 
compelling believability: 


The Mean Value Theorem Let y — f(x) be a function with the following two 
properties: 

f(x ) is continuous on the closed interval [a, b]\ and 
f(x ) is differentiable on the open interval ( a , b). 

Then there exists at least one point c in the open interval ( a, b) such that 


f'(c) = 


fib) ~f(a) 
b — a 


( 2 ) 


or equivalently, 


m-f{a)=f\c\b-a\ 


(3) 


COMMENTS ON THE MEAN VALUE THEOREM (MVT) 

We see that the statement is reasonable by looking at its geometric meaning as 
shown in Fig. 2.23. The right side of equation (2) is the slope of the chord join¬ 
ing the endpoints A and B of the graph, and the left side is the slope of the tan¬ 
gent line at the point on the graph corresponding to i = c; and the MVT says 
that for at least one intermediate point on the graph the tangent is parallel to the 
chord. In Fig. 2.24 there are two such points, corresponding to x — c\ and x = 
c 2 . But this is perfectly all right, because the phrase “at least one point c” allows 
for the possibility of two such points, or three, or any number whatever. 

The conclusion of the MVT is crucially dependent on its hypotheses, because 
this conclusion does not follow if the hypotheses are weakened ever so slightly. 
We see this by considering the example of the function y — \x\ defined on the 
closed interval [—1, 1]. This function (Fig. 2.25) is continuous on the closed in¬ 
terval [-1, 1] and is differentiable on the open interval ( —1, 1), except at the 
single point x = 0, where the derivative does not exist. The conclusion fails, be¬ 
cause the chord joining A and B is horizontal and clearly the graph has no hor¬ 
izontal tangent. 

To understand the significance of the Mean Value Theorem, we briefly and in¬ 
formally consider three simple consequences that we will need in Chapters 4 and 




Figure 2.23 


Figure 2.24 


Figure 2.25 
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5. In each case we have a property of the derivative that implies a property of 

the function, and the MVT is the link between the two properties. 

1. If f\x) > 0 on an interval, then f(x) is increasing on that interval [“in¬ 
creasing” means that a < b implies/(a) < /(£)]. Our geometric intuition as¬ 
sures us that this is true, because f(x) > 0 means that the tangent points up 
to the right everywhere (Fig. 2.26). For a more explicit argument based on 
the MVT, we point out that in this situation the right side of (3) is positive, 
so the left side is also positive, and this means that f(a) < f(b). 

2. Similarly, iff\x) < 0 on an interval, then f(x) is decreasing on that inter¬ 
val [“decreasing” means that a < b implies f(a) >f(b)]. 

3. Iff(x) = 0 on an interval, then f(x) is constant on that interval To show 
this we assume the contrary, namely, that the function is not constant. Then 
there exist two points a and b with a < b at which the function has diff er¬ 
ent values f(a ) and f(b). But this implies that the left side of (3) is not equal 
to 0, whereas the right side must equal 0. This contradiction shows that our 
assumption—that the function is not constant—cannot be true. 


The Extreme Value Theorem If y = f(x ) is a function that is defined and con¬ 
tinuous on a closed interval [a, b], then this function attains both a maximum 
value and a minimum value at points of the i nterval; that is, there exist points 
c and d in [a, b] such that f(c) > f(x) > f(d) for all x in [a, b].* 


COMMENTS ON THE EXTREME VALUE THEOREM (EVT) 

Informally, this theorem asserts that the graph of a continuous function on a closed 
interval always has both a high point and a low point. If we think of the graph 
as drawn by moving a pencil across the paper from the point A to the point B (see 
Fig. 2.27), then the statement is so visibly true that we wonder how anyone could 
doubt it. However, it is difficult to prove in a fully rigorous manner, because it de¬ 
pends on a subtle property of the real line (completeness, meaning that no points 
are “missing” from the line) that is normally discussed only in advanced courses. 

Also, just as in the case of the Mean Value Theorem, the conclusion here is 
crucially dependent on the hypotheses that the function is continuous and the in¬ 
terval is closed. For example, the function in Fig. 2.28 is continuous on the in- 


'Maximum values and minimum values are known collectively as extreme values. 
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Figure 2.31 



terval [0, 1), but this interval is not closed because it lacks the right endpoint. 
We see that this function attains no maximum value at any point of [0, 1), be¬ 
cause the only possible candidate for a maximum value is 1 at x = 1, but/(1) is 
not defined. On the other hand, in Fig. 2.29 the interval [0, 2] is closed and the 
function is continuous at every point of this interval except for the single point 
x = 0, and again the function attains no maximum value at any point of the in¬ 
terval. 

There is a further important fact about extreme values that is known as Fer¬ 
mat 1 s theorem: If a continuous f motion fix) on a closed interval [a, b] attains 
its maximum or minimum value at an interior point c of [a, b], and if fix) is dif¬ 
ferentiable at c, then f\c) = 0. In later chapters we will often be trying to locate 
extreme values of continuous functions on closed intervals. Fermat’s theorem 
tells us that we must seek such points either at the endpoints of the interval or 
at those interior points where f\x) = 0 or fix) does not exist. 


The Intermediate Value Theorem If y = fix) is a function that is defined and 
continuous on a closed interval [a, b], then this function assumes every value 
between fia) and /(£); that is, if K is any number strictly between fia) and 
ffb), then there exists at least one point c in (a, b) such that/(c) = K. 


COMMENTS ON THE INTERMEDIATE VALUE THEOREM (IVT) 



In the language of graphs (Fig. 2.30), every horizontal line of height K intersects 
the graph of y = fix) if K is between fia) and fib). 

The most vivid form of the IVT says that if y = fix) is continuous on [a, b] 
and fia) and fib) have opposite signs, then/(c) = 0 for at least one point c in 
ia, b). In other words, the graph cannot get from one side of the Jt-axis to the 
other without actually crossing this axis (Fig. 2.31). This may seem to be very 
obvious indeed, but the statement can be false if the function fails to be contin¬ 
uous at even a single point. We see this by considering the function defined on 
[0, 2] by 


y=fix) = \ j 


if 0 < jc< 1, 
if 1 < jc < 2 . 


Figure 2.32 


It is clear that for this function (Fig. 2.32) we have/(0) < 0 and /(2) > 0, and 
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yet—because of the discontinuity at the single point x = 1 —there does not ex¬ 
ist any point c in (0, 2) for which/(c) = 0. 

The practical significance of the IVT can best be understood by means of an 
example. We observe that the equation 

x 3 + 2x -4 = 0 (4) 

is not easy to solve by factoring, because the left side has no obvious factors. 
However, the continuous function f(x) = x 3 + 2x - 4 is negative at x = 1 and 
positive at * = 2 [/(l) = -1 and/(2) = 8]. The IVT therefore guarantees that 
fix) has a zero at some point in (1, 2), so equation (4) has a solution in this in¬ 
terval. Further, fix) = 3x 2 + 2 > 0 for all x, so f(x) has only one zero and (4) 
has only one solution. This follows from the fact that if there were two zeros, 
then the Mean Value Theorem would imply that f\c) = 0 for some intermediate 
point c, which cannot happen. In Section 4.6 we develop a method for calculat¬ 
ing this solution of (4) to any desired degree of accuracy. 

Remark Most students of calculus are impatient with the theory of the subject, 
and rightly so, because the essence of calculus does not lie in theorems and how 
to prove them but rather in tools and how to use them. A moderate dose of rig¬ 
orous thinking in mathematics—rigidly correct, leakproof thinking that can with¬ 
stand the closest skeptical scrutiny—is a good thing; but like virtue, it can be 
overdone, and in calculus courses it often is. This preoccupation with the tech¬ 
niques and fine points of proof, rather than with the central ideas of the subject, 
can make a forbidding mystery out of concepts that are essentially simple and 
clear. We mention these issues to point out that even though most of our discus¬ 
sions in this book involve plausibility arguments intended to be reasonably con¬ 
vincing to reasonable people, full and rigorous proofs of all theorems are avail¬ 
able in Appendix A for those who may wish to examine them. 


PROBLEMS 

1 Find the points of discontinuity of the following func¬ 
tions: 


X 

(b) 

X 

x 2 + r 

x 2 - r 

x 2 - 1 

X 1 ’ 

(d) 

Vx; 

1 

VT 

1 

(f) 

(h) 

V?; 

1 

x 2 + x - 12’ 

x 2 + 4x + 5 ‘ 


In Problems 2-9, verify that the function/(x) satisfies the hy¬ 
potheses of the Mean Value Theorem on the given interval, 
and find all points c whose existence is guaranteed by the 
theorem. 

2 f(x) = sin x, [0, 27 t]. 

3 f(x) = x 2 + 1, [1, 2]. 


4 /(x) = x 3 + 1, [1,2]. 

5 fix) =x 2 -4x + 6, [2,4]. 

6 fix) = x 2 + x, [-2,8]. 

7 fix) = Vx+ 1, [0, 3]. 

8 fix) = V25 —x 2 , [-5,5], 

9 fix) = l/x, [i2]. 

10 If fix) is an arbitrary quadratic polynomial, that is, if 
fix) = Ax 2 + Bx + C (A =£ 0), show that the point c 
whose existence is guaranteed by the Mean Value Theo¬ 
rem is the midpoint of the interval [ a , b\ 

11 If fix) = 1 lx and g(x) = l/x + x/|x|, show that these 
functions have identical derivatives, so that [fix) - 
g(x)]' = 0. However, their difference is not constant. Ex¬ 
plain how this is possible in view of consequence 3 of 
the Mean Value Theorem. 

12 A car starts from rest and travels 4 mi along a straight 
road in 6 minutes. Use the Mean Value Theorem to show 
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13 

14 

15 


16 


17 


18 

19 


20 


21 


that at some moment during the trip its speed was ex¬ 
actly 40 mi/h. 

If/'(; c) = c, a constant, for all jc, show that /(jc) = cx + 
d for some constant d. 

If /(jc) and g(jc) are two functions with equal derivatives 
on an interval, what can be said about their difference 
/to ~ g(x)l 

For each of the given intervals, find the maximum value 
of sin jc on that interval, and also find the value of x at 
which it occurs: 


(a) 





i°’U 

; (b) 

N 


(C) [0, 77 ]. 


For each of the given intervals, find the maximum value 
of cos x on that interval, and also find the value of x at 
which it occurs: 




77 

[ 0 ’2j 

; (b) 

q 

1 _ 


(a) 


Does the function 


(c) [0, 277]. 


+ 9jc 4 + 5jc 6 


have 


(a) a maximum value on [5, 8 ]? 

(b) a minimum value on [5, 8 ]? 

Does the function jc 3 have a maximum value on 
(a) [1, 5]; (b) [-5, 2]; (c) (3, 4)7 
If so, where? 

Does the function jc 4 have a minimum value on 
(a) [-3, 5]; (b) (-4, 2); 

(c) (2, 3); (d) (- 1 , 5)7 

If so, where? 

Does the function 4 — x 2 have 

(a) a maximum value on (- 2 , 2)7 

(b) a minimum value on ( — 2 , 2)7 
If so, where? 

Does the function 4 + x 2 have 

(a) a maximum value on (- 2 , 2)7 

(b) a minimum value on ( — 2 , 2)7 
If so, where? 


In Problems 22-29, find the maximum and minimum values 
attained by the given function on the given interval. 

22 1/jc, (0, 1). 

23 1/jc, (0,1]. 

24 1 - jc 2 , (0,1). 

25 1 x 2 , [0,1). 

26 1 x 2 , [0,1]. 

27 1 — jc 2 , [-3,-2]. 

28 |3jc — 4|, [1,2]. 

29 2 + |2jc - 3|, ( 0 , 2 ). 

In each of Problems 30-33, apply the Intermediate Value The¬ 
orem to show that the given equation has a solution in the 
given interval. 


30 jc 3 + 2jc + 5 = 0, [-2,-1]. 

31 jc 4 + 3jc-5 = 0, [1,2]. 

32 jc 5 - 4x 3 + 127 = 0, [-3, -2]. 

33 jc 6 - 3jc + 1 =0, [-1, 1]. 

34 Show that the equation jc 3 - 5jc + 1 = 0 has three dis¬ 
tinct roots by calculating the value of/(jc) = jc 3 - 5jc + 
1 at the points jc = -3, -2, — 1, 0, 1, 2, 3. State the in¬ 
tervals in which the roots lie. 

35 If p(x) is a polynomial of odd degree, show that the equa¬ 
tion p(x) = 0 has at least one solution. 

36 If A and B are positive constants, show that the equation 


has a solution in the interval ( 1 , 2 ). 

37 If /(jc) and g(jc) are continuous on [ a , b], and if 
f(a) < g(a) and f(b) > g(b ), show that the equation 
/(jc) = g(jc) has at least one solution in (a, b). 

38 Assume for a moment that the rational numbers are the 
only numbers that exist. Under this assumption, show 
that the Intermediate Value Theorem is false by consid¬ 
ering the function y = /(jc) = jc 2 — 2 on the interval 
[1, 2]. This example shows that the truth of the Inter¬ 
mediate Value Theorem depends on the profound fact 
that no points are “missing” from the real line. 

39 Let y = /(jc) be a continuous function defined on the 
closed interval [ 0 , b] with the property that 0 < /(jc) < b 
for all jc in [0, b\ Show that there exists a point c in 
(0, b) with the property that/(c) = c. Hint: Consider the 
function g(jc) = /(jc) — jc. 

40 The rectangle in Fig. 2.33 represents the floor of a room, 
and AB a straight piece of string lying on the floor whose 
ends touch the opposite walls W\ and W 2 . The tangle rep¬ 
resents the same piece of string wadded up and thrown 
back down on the floor. Show that there is at least one 
point of the wadded string whose distances from the two 
walls are exactly the same as they were before. Hint: See 
the preceding problem. 
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CHAPTER 2 REVIEW: DEFINITIONS, CONCEPTS, METHODS 


Define , state, or think through the following. 

1 Tangent line according to Fermat. 

2 Delta notation. 

3 Derivative of a function. 

4 Differentiable function. 

5 Three-step rule (or process). 

6 Leibniz notation. 

7 Derivative of fix) = ax 2 + bx + c. 

8 Average and instantaneous velocity. 

9 Speed. 

10 Rate of change. 

11 Acceleration. 


12 

13 

14 

15 

16 

17 

18 

19 

20 
21 


lim fix) = L. 


Limit laws, 
sin 8 


lim 

0->O 


lim 

0-^0 


8 


= 1 . 


1 — cos 8 
8 


= 0. 


Continuity of f{x) at x = a. 
Continuous function. 
Differentiability implies continuity. 
Mean Value Theorem. 

Extreme Value Theorem. 
Intermediate Value Theorem. 


ADDITIONAL PROBLEMS FOR CHAPTER 2 


SECTION 2.2 

1 For what value of b does the graph of y = x 2 + bx + 1 
have a horizontal tangent at jc = 3? 

2 Find the two points on the curve y = x — \x 2 at which 
the tangent passes through the point (f, 0). 

3 Let P = (jco, yo) be a point on the parabola y = x 2 . Show 
that a nonvertical line passing through P which does not 
intersect the curve at any other point is necessarily the 
tangent at P\ that is, show that if the line 

y - yo = m(x - x 0 ) 

intersects y = x 2 only at (xo, yo), then m = 2xo- 

4 If (x\, yi) and (x 2 , y 2 ) are distinct points on the parabola 
y = jc 2 , at what point on the curve is the tangent paral¬ 
lel to the chord joining these two given points? 

5 The curve y = jc 2 is a particular parabola, but if a is an 
unspecified positive constant, y =fix) = ax 2 is a com¬ 
pletely general parabola located in a convenient position. 

(a) Show that/'Uo) = 2ax 0 . 

(b) Show that the tangent at a point P = (jco, yo) other 
than the vertex has y-intercept -yo, and use this fact 
to formulate a geometric method for constructing the 
tangent at P. 


SECTION 2.3 

6 Use the three-step rule to calculate /'(jc) if/(jc) is equal 
to 


7 


(a) 


jc + 1 


(b) 


2x 


(c) V3x + 2; (d) Vx 2 -F 1. 

Sketch the graph of each of the following functions and 
state where it is not differentiable: 


(a) Vj*j; (b) |* 2 - 4|; 

(c) \2x - 3|; (d) *1*1. 


8 Let/(x) be a function with the property that/(xi + x 2 ) = 
fix\)fix 2 ) for all x\ andx 2 . If/(0) = 1 and/'(0) = 1, show 
that /'(jc) = /(jc) for all x. 

9 If the derivative/'(x) exists, then it can be calculated from 
the formula 


/' (x) - lim 

J A.v->0 


fix + Ax) -fix - Ax) 
2Ax 


Verify this statement for the special case/(x) = x 2 , and 
then prove it in general. [To understand the statement, let 
P, Q, R be the points on the curve y = fix) that corre¬ 
spond to x, x + Ax, x — Ax, and write the slope of the 
secant through Q and R\ and to prove it, notice that 
fix + Ax) -fix - Ax) = fix + Ax) - /(x) +/(x) - 
fix - Ax).] 

10 Show that the following function is differentiable at x = 
0: 


/« = 


0 


if x is rational, 
if x is irrational. 


11 Show that the following function is not differentiable at 
x = 0: 


fix) = 


x 

0 


if x is rational, 
if x is irrational. 


12 If fix) is a function with the property that |/(x)| < x 2 for 
all x, prove that fix) is differentiable at x = 0. 

13 Consider the function fix) defined by 


f(x) = 


x ^ 

mx 4- b 


if x < a, 
if x > a, 


where a, b, m are constants. Find what values m and b 
must have (in terms of a) in order for this function to be 
differentiable at all points. 
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SECTION 2.4 

14 On a certain bicycle trip, the first half of the distance was 
covered at 30 mi/h and the second half at 20 mi/h. What 
was the average velocity? (It was not 25 mi/h.) 

15 A silver dollar is thrown straight up from the roof of a 
200-ft building. After t seconds, it is 

s= 200 + 24 t - 16 t 2 

feet above the ground. When does the dollar begin to fall? 
What is its speed when it has fallen 1 ft? 

16 A capacitor (or condenser) in an electric circuit is a de¬ 
vice for storing electric charge. If the amount of charge 
on a given capacitor at time r is Q = 3t 2 + 5t + 2 
coulombs, find the current I = dQ/dt in the circuit when 
t = 3. 

17 Use the three-step rule to show that the rate of change 
of the volume of a sphere with respect to its radius equals 
the surface area. 


42 

44 

46 

48 

50 

52 

54 

56 

58 


SECTION 2.5 

Evaluate the following limits. 
18 lim 


20 lim 

m3 


22 lim 

mO 


24 lim 


26 lim 


28 lim 

x- - 

30 lim 


32 lim 

M —2 


x 2 — 6x + 8 
m 2 x 2 — 5x + 6 
(x 2 + 3x - 4) 2 
mi x 2 - lx + 6 
2x 2 +x-6 


34 lim 


x + 2 
x - 3 


m~4 X 2 + X ■ 
X 2 -X- 


12 ' 

^ r X 2 ~ X- 6 

36 lim ---—• 

m4 x 2 — lx + 12 


2x - x 2 

19 

lim 

U+). 


2 - x ' 

x 2 - 6x + 9 

21 

x-»0 

lim 

\ x ) 

4x 2 - 5x 


x - 3 
x 2 (l -x) 

23 

lim 

X 

41 - x) 

59 

3x 

x + 2 

25 

.mO 

lim 

3 x 2 
x + 2 

60 

x 2 - 4' 

2x 2 + 3 

27 

m2 

lim 

x 2 - 4 ' 

2 - 3Vx 

61 

x + 4 ’ 


m0 

1 + 9Vx 



2x - 3 


29 lim ~ 2 . 

M~1 X 2 ~ 1 

2x 2 + x - 6 
x + 2 ' 

33 lim 

X—>3 x 2 + x — 12 


31 lim 


35 lim 

m3 

37 lim 


38 lim 


40 lim 


t 3 — 6x 2 + 3x + 2 5 
Ml x 3 + x 2 - 3x + 1 
x 3 — 64 


m4 x 


mil -z-7ZT- 

m3 X 2 — lx + 12 

x + Vx - 2 

Ml X 3 ~ 1 

x 3 — 4x 

39 lin l ~3 - T 2 . T 

x->2 X 3 — iX + 2x 

41 lim 


* 3 — a 3 


62 


63 


x->a X 1 


a / 


*If x = a is a zero of a polynomial p(x ), then x - a is a factor of p{x) 
and the other factor can be found by long division (see Additional 
Problem 50 at the end of Chapter 1). 



lim — 

M-a jr 3 + a 


3 ■ 


43 lim 2 X \ 

x-^>0 


lim 2 * 2 . 

mO 

2 17 * 2 + 1 

7- 

X 

lim r — 

*->°° Vx 2 + 1 

2x 3 - x 2 + lx — 3 
2 — x + 5x 2 — 4x 3 ' 


45 lim 2 llx \ 

jt —>o 

2 Ux + 1 
47 “aiiM- 

49 lim *— . 

x _^oo + i 


51 lim 


9x 45 - x 9 + 2 
m~ 3x 45 + x 29 — 19' 


lim 2 X . 

X-^oo 

lim 2 Vx . 

X —>°o 

V7TT 

lim 


V9-C+1 


53 lim 2 

x —>00 


55 lim (Vx + 1 - Vx). 

X—>OC3 

2 X - 2~ x 

57 lim t, 

x—2^ + 2 * 


Consider the function /(x) defined for x # 0 by /(x) = 
[1/x], where [1/x] denotes the greatest integer < 1/x, as 
in Additional Problem 59 at the end of Chapter 1. Sketch 
the graph of this function for \ < x < 2, and also for 
—2 < x < — How does /(x) behave as x approaches 0 
from the positive side? From the negative side? Does 
lim^o/M exist? 

Follow the directions in Problem 58 for the function 
/W = (-l)[^. 

Follow the directions in Problem 58 for the function 

/(*) = W(-i) Ilfa1 ' 

Consider the function f{x) defined by 


fix) = j 


if x is rational, 
if x is irrational. 


For every a, lim Mfl /(x) does not exist. Why? 
Define a function/(x) by 


/« = 


0 

J_ 

n 


if x is irrational, 

if x is a rational number m/n 

in lowest terms with n > 0. 


Show that/(x) is continuous at irrational points and dis¬ 
continuous at rational points. 

The slope of the tangent line to the graph of the expo¬ 
nential function y = 2* at the point (0, 1) is 


m0 x 


Estimate this slope to three decimal places by using a 
calculator to find the value of {2 X — 1 )/x for x = 1,0.5, 
0.1, 0.05, 0.01, 0.005, 0.001, 0.0005, 0.0001. 

Same as Problem 63 for the function y = ¥ and its tan¬ 
gent line at the point (0, 1). 














































THE 

COMPUTATION 
OF DERIVATIVES 


Differential calculus—the calculus of derivatives—takes its special flavor and 
importance from its many applications to the physical, biological, and social sci¬ 
ences. It would be pleasant to plunge into these applications immediately and get 
to the heart of the matter without any further delay. However, from the point of 
view of overall efficiency it is better to postpone this to the next chapter, and in¬ 
stead take a little time now to learn how to calculate derivatives with speed and 
accuracy. 

As we know, the process of finding the derivative of a function is called dif¬ 
ferentiation. In Chapter 2 this process was based directly on the limit definition 
of the derivative, 


f(x) = lim 
J K ' Aa—>0 


f(x 4- Ax) -fix) 

Ax 


or equivalently, 


3.1 

DERIVATIVES OF 
POLYNOMIALS 


-- = lim 
dx Ax 


We have seen that this approach is rather slow and clumsy. Our purpose in the 
present chapter is to develop a small number of formal rules that will enable us 
to differentiate large classes of functions quickly, by purely mechanical proce¬ 
dures. In this section we learn how to write down the derivative of any polyno¬ 
mial by inspection, without having to think about limits at all; and by the end of 
the chapter we will be able to cope quite easily with messy algebraic functions 
like 


Vx 2 + 1 ’ Lx - Vx + 1 . 


+ Vx + 1 


1/3 


and 


Vi + VT+ vi + x. 


We will also learn how to differentiate many trigonometric functions. Our goal 
in this phase of our work is computational skill , and needless to say , such skill 
comes only with practice. No one learns how to spell , or ski , or play a musical 
instrument , without constant practice accompanied by constant self-correction, 
and differentiation is no exception to this rule. 

Students will recall that a polynomial in x is a sum of constant multiples of 
powers of x in which each exponent is zero or a positive integer: 

P(jc) = a n x n + a n ~\x n ~ x + • • • + a\x + a 0 . 
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The way a polynomial is put together out of simpler pieces suggests the differ¬ 
entiation rules that we now discuss. 

1 The derivative of a constant is zero , 


The geometric meaning of this statement is that a horizontal straight line y = 
f(x) = c has zero slope. To prove the statement from the definition, we notice 
that Ay = /(x + Ax) - f(x) = c — c = 0, so 


^ = lim A" = 'im -p- = lim 0 = 0. 
dx At— >0 Ax Ajc— >0 Ax At— >0 


2 If n is a positive integer, then 

4~x n = nx n ~K 
dx 


In words, the derivative of x n is obtained by bringing the exponent n down in 
front as a coefficient, then subtracting 1 from it to form the new exponent 
n — 1. We already know three special cases of this rule from Chapter 2: 

-y- x 2 = 2x, -y- x 3 = 3x 2 , and -t-x 4 = 4x 3 . 

dx dx dx 


To prove this rule in general, we write y = f(x) = x n and use the binomial the¬ 
orem" to obtain 


Ay =/(x + Ax) -/(x) = (x + Ax)" - x" 


,«-> A, 4- "fr-il v-2 


x n + nx" 1 Ax + 


x n ~ 2 (A*) 2 + • ■ • + (Ax) n 


= nx «-> Ajc + ” (n 2 1] x"- 2 (Ajt) 2 + • • • + (Ar)". 


This yields 


^ = 1 im ^ 
dx Ajc—> o Ax 


= lim 
Ajc—>0 


K n ~ 1 + n( ” 2 x n ~ 2 Ac + • • • + ( Aa :)"- 1 


because Ax is a factor of each term in brackets beyond the first. 


Tor students who have forgotten the details of the binomial theorem, we state it as follows: If n is 
a positive integer, then 


The precise form of this expansion can be understood without too much difficulty by simply think¬ 
ing about the ^-factor product 

(a + b) n = (a + b)(a + b) • • • (a + b). 

To multiply these factors out, we begin by choosing a from each factor, which gives the term a n . If 
we next choose b from one factor and a from all the others, this can be done in n ways, so we get 
ba n ~ ] n times, or na n ~ l b. Similarly, n(n-\)!2 is the number of ways b can be chosen from two fac¬ 
tors and a from all the others, etc. The “etc.” is explained more fully in Appendix B.l. 
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Our rule remains true when the exponent is a negative integer or a fraction. 
However, it is convenient to postpone giving a proof of this to a later part of the 
chapter. 


3 If c is a constant and u = f(x) is a differentiable function of x, then 



c 


du 

dx' 


That is, the derivative of a constant times a function equals the constant times 
the derivative of the function.* To prove this, we write y = cu = cf(x) and ob¬ 
serve that Ay = cf(x + Ax) - cf(x) = c[f(x + A^:) — f(x)] = c Aw, so 

dy .. Ay .. c Aw Aw du 

—f- = lim —- = lim —— = c lim —— = c —. 

dx Ax->o Ajc Ax—»o Ajc Av—>o Ajc dx 


Combining rules 2 and 3, we see that 

d „ _i 

— cx n = cnx n 1 
dx 


for any constant c and any positive integer n. 


Example 1 We are now in a position to calculate the following derivatives as 
fast as we can write: 


— 3x 7 = 21x 6 , — 

dx dx 


55x = 55x° = 55, 
dx 


(“I * 12 ) = ~ 6x "' ft 22x ' m = 2222x ' 00 ' 

d / 10^ +login 77 o 

dx \ V\9 + 1024 / 


4 If u = f(x) and v = g(x) are functions of x, then 


— (w + v) - 


du + dv_ 
dx dx 


That is, the derivative of the sum of two functions equals the sum of the indi¬ 
vidual derivatives. The proof is routine: If we write y = u + v= f(x) + g(x), then 
Aj = [fix + Ax) + g(x + Ax)] - [/(x) + g(x)] = [f(x + Ax) - fix)] + [g(x + 
A.v) — g(jc)] = Aw + Av, and therefore 

dy Ay .. Aw + Av [Aw Av 

= lim - = lim ---= lim —- \- —— 

dx Ac->o Ajc At-»o Ajc Ax^o zYr Ajc 

t . Aw , Av du , dv 

= lim —-h lim —— = ——I——. 

Ax—»o Ajc Ax—>o Ax dx dx 


In essentially the same way we can show that the derivative of a difference equals 
the difference of the derivatives, 


d du 

Td u ~ v) = ^ 


dv 
dx ‘ 


*From now on we assume that every function we deal with is differentiable unless a specific state¬ 
ment is made to the contrary. 
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Further, these results can be extended without difficulty to any finite number of 
terms, as in 


d , N du 

— (u - V + w) = — 
dx ax 


dv + dw_ 
dx dx' 


Example 2 It is now easy to differentiate any polynomial. For instance, 

- 7 - (15x 4 + 9x 3 —lx 2 — 3x + 5) = -j- 15x 4 + - 7 - 9x 3 —- 7 - 7x 2 - -j- 3x + -j-5 

dx dx dx dx dx dx 


= 60x 3 + 27x 2 - \4x - 3. 


With a little practice we can omit the middle step and write down the final re¬ 
sult immediately by inspection. 


Example 3 The function y = (3x — 2 ) 4 is a polynomial but is not in standard 
polynomial form. None of the rules established so far apply to this function di¬ 
rectly, though later we will prove a formula that can be used here. Meanwhile 
we must first expand by the binomial theorem. This gives 

y = Ox - 2) 4 = [3x + ( —2)] 4 

= (3x) 4 + 4(3*) 3 (-2) + (3*) 2 (-2) 2 + (3x)(—2) 3 + (-2) 4 

= 8 lx 4 — 216x 3 + 216x 2 — 96x + 16, 


so 


= 324x 3 - 648x 2 + 432x - 96. 
dx 


Example 4 Even though the letters x and y are often used for the independent 
and dependent variables, there is obviously nothing to prevent us from using any 
letters we please, and the calculations work in just the same way. Thus, 

5 = 13r 3 - Hr 2 + 25 

is a polynomial in r; and by the rules developed in this section, its derivative is 
clearly 

ds 

= 39r 2 - 22r. 
dt 


Example 5 An object moves on a straight line in such a way that its position s 
at time t is given by 

s = t 3 + 5t 2 - 8 r. 


What is its acceleration when it is at rest? 
The velocity v and acceleration a are 
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The object is at rest when v = 0 or 

3 1 2 + lOr - 8 = (3 1 - 2)(r + 4) = 0, 

that is, when t = f, —4. The corresponding values of the acceleration are a = 
14, -14. 


PROBLEMS 


1 Find the derivative of each function: 

(a) 6 jc 9 ; 

(b) 7T 5 ; 

(c) — 15 jc 4 ; 

(d) 3x 500 + 15jc 100 ; 

(e) (x - 3) 2 ; 

(f) \x 5 + \x 4 + yjr 3 + \x 2 + x; 

(g) x A + jc 3 + x 2 + jc + 1 ; 

(h) (x - 2) 5 ; 

(i) x 12 + 2x 6 — 4x 3 —6 jc 2 ; 

(j) {2x - l)(3x 2 + 2). 

2 Differentiate each of the following functions: 

(a) f(x)=x 200 - 10(k + 50; 

(b) g(x) = (l(k) 4 ; 

(c) hit) = r 10 + 7r 8 - 9r 3 + 5r; 

(d ) F(y) = (y 2 - l)(3y — 5); 

(e) G(jc) = (x 2 + 3)(jc 2 + jc + 1); 

(f) H(t) = (1 + t 2 )i 1 + t 2 + r 3 ); 

(g) fix) = (3jc 3 - 2jc 2 )(15jc 4 - 2jc + 5); 

(h) g(x)**x(2x+ l)i2x- 1); 

(i) hit) = (3r - 5) 2 ; 

(j) y = 2jc(3jc 2 + 1)(jc 2 -jc + 2); 

(k) y = (3jc + 2)(2jc — 3); 

(l) y = i2x 2 + 3)(3jc 3 - 4). 

3 If s is the position at time t of an object moving on a 
straight line, find the velocity v and the acceleration- a : 

(a) 5 = 12-6r+ 3r 2 ; 

(b) 5 = 13 - 9t + 6r 3 ; 

(c) s = i3t- 2) 2 . 

4 Find a function of x whose derivative is the given func¬ 
tion: 

(a) 3 jc 2 ; 

(b) 4 jc 2 ; 

(c) 3 jc 2 + 2jc — 5. 

5 Find the line tangent to the curve y = 3x 2 — 5jc + 2 at 
the point (2, 4). 

6 Find the points on the curve y = 4x 3 + 6x 2 — 24x + 10 
at which the tangent is horizontal. 

7 At what points on the curve y = x 3 —x 2 — x - 1 is the 
tangent horizontal? 

8 At what points on the curve y = 2x 3 —3x 2 + 6x — 39 is 
the tangent horizontal? 


9 Show that the curve y = 4x 3 + 4jc — 2 has no tangent 
line with slope 3. 

10 Show that the curve y = 2x 5 + 3;thas no horizontal tan¬ 
gent line. What is the smallest slope a tangent line can 
have? 

11 The line x = a intersects the curve y = \x 3 + 4jc + 2 at 
a point P and the curve y = 2x 2 + x at a point Q. For 
what value (or values) of a are the tangents to these 
curves at P and Q parallel? 

12 Find the vertex of the parabola y = jc 2 — 8 jc + 18. Hint: 
The tangent at the vertex is horizontal. 

13 Find the vertex of the parabola y = ax 2 + bx + c by the 
method of Problem 12. 

14 What values must the constants a , b, c have if the two 
curves y = x 2 + ax + b and y = cx - x 2 have the same 
tangent at the point (3, 3)? 

15 State conditions on the coefficients a, b, c, d so that the 
graph of the polynomial y = ax 3 + bx 2 + cx + d has 
precisely 

(a) two horizontal tangents; 

(b) one horizontal tangent; 

(c) no horizontal tangent. 

16 Show that any two tangent lines to the parabola y — ax 2 
intersect at a point that lies on the vertical line halfway 
between the points of tangency. 

17 Find the equation of the tangent to the curve y — x 3 at 
the point (a, a 3 ). For what values of a does this tangent 
intersect the curve at another point? 

18 Find the tangent to the curve y = x 3 that passes through 
the point ( 0 , 2 ). 

19 There are two lines through the point (2, 8 ) that are tan¬ 
gent to the curve y = x 3 . Find them. 

20 Sketch the curves y = x 2 and y = —x 2 + 2x - 2 on a 
single coordinate system, and use the sketch to decide 
whether there are any lines that are simultaneously tan¬ 
gent to both curves. If there are any, find their equations. 

21 Let p be a positive constant and consider the parabola 
jc 2 = 4 py with vertex at the origin and focus at the point 
(0, p), as shown on the left in Fig. 3.1. Let (jco, To) be a 
point on this parabola other than the vertex. 

(a) Show that the tangent at Uo, yo) has y-intercept -y 0 . 

(b) Show that the triangle with vertices (jt 0 , yo), (0, —yo), 
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Figure 3.1 A parabolic reflector. 




and (0, p) is isosceles. Hint: Use the distance for¬ 
mula. 

(c) Suppose that a source of light is placed at the focus, 
and assume that each ray of light leaving the focus 
is reflected off the parabola in such a way that it 
makes equal angles with the tangent line at the point 
of reflection (the angle of incidence equals the angle 
of reflection). Use (b) to show that after reflection 
each ray points vertically upward, parallel to the axis 
(Fig. 3.1, center).* 

22 The line through a point on a curve which is perpendic¬ 
ular to the tangent at that point is called the normal to 

*This is called the reflection property of parabolas. To form a three- 
dimensional idea of the way this property is used in the design of 
searchlights and automobile headlights, we have only to imagine a 


the curve at that point. Find the normal to the curve 4 y + 
x 2 = 5 at the point (1, 1 ). 

23 Consider the normal to the curve y ~ x — x 2 at the point 
(1, 0). Where does this line intersect the curve a second 
time? 

24 Find the normal to the curve y = 1 — x 2 at the point 
(3, - 8 ). 


mirror constructed by rotating a parabola about its axis and silvering 
the inside of the resulting surface. Such a parabolic reflector can also 
be used in reverse (Fig. 3.1, right), to gather faint incoming rays par¬ 
allel to the axis and concentrate them at the focus. This is the basic 
principle of radar antennas, radio telescopes, and reflecting optical 
telescopes. The great telescope on Palomar Mountain in California 
has a 15-ton glass reflector that is 200 in (almost 17 ft) in diameter, 
and the accurate grinding of this enormous mirror required 11 years 
of work. 


3.2 

THE PRODUCT AND 
QUOTIENT RULES 


In Section 3.1 we learned how to differentiate sums, differences, and constant 
multiples of functions. We now consider 


products uv 


and quotients —, 


where u and v are understood to be differentiable functions of x. 

Since the derivative of a sum is the sum of the derivatives, it is natural to guess 
that the derivative of a product might equal the product of the derivatives. How¬ 
ever, it is quite easy to construct examples showing that this is not true. For in¬ 
stance, the product of x 3 and x 4 is x 7 , so the derivative of the product is lx 6 , but 
the product of the individual derivatives is 3x 2 • 4x 3 = \2x 5 . This shows that our 
preliminary guess about the derivative of a product is incorrect. The correct for¬ 
mula for differentiating products is rather surprising. 


5* The product rule: 


d , v dv du 

— (uv) = u — + V—. 
dx dx dx 


(1) 


Students may wish to keep in mind the following verbal statement of this rule: 
The derivative of the product of two functions is the first times the derivative of 


*We continue the numbering started in Section 3.1. 
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the second plus the second times the derivative of the first. To prove this, we 
write y = uv and let the independent variable x be changed by an amount Ajc, to 
x + Ax This produces corresponding changes Aw, Av, A y in the variables u, v, 
y, and we have 

y + Ay = (u 4- A u)(v + Av) = uv + u Av + v Aw + Au Av, 

Ay = (y + Ay) - y = u Av + v Au + Au Av, 

Ay Av Au . Av 
= u — + v — + Au —. 


Taking limits as A_r —> 0 yields 


dy dv du , _ dv 
Tx~ u Tx + v i; + 0 'Tx' 


which is equivalent to (1). We have used the fact that Au —> 0 as Ajc —» 0. This 
is due to the continuity of w, which follows from the differentiability by the ar¬ 
gument given in Section 2.6. 


Example 1 We first test formula (1) on the factors x 3 and x 4 , whose product we 
already know has derivative lx 6 . We get 


f (* 3 • X 4 ) = X 3 f X 4 + X 4 

ax ax 


d_ 

dx 


x 


3 


= Jt 3 ■ 4 jc 3 + JC 4 ■ 3jc 2 = 7jc 6 , 


as we should. As a more complicated example, we apply our formula to the func¬ 
tion y = (x 3 — Ax)Qx 4 + 2): 

^ = (* 3 - Ax) 4 - Ox 4 + 2) + (3x 4 + 2)~r - Ax) 

dx dx dx 

= (x 3 - 4jc)( 1 2jc 3 ) + (3x 4 + 2)(3 jc 2 - 4) 

= 12jc 6 - 48x 4 + 9x 6 - 12jc 4 + 6x 2 - 8 

= 21x 6 - 60x 4 + 6x 2 - 8. 


Notice that we can also proceed by multiplying the two factors at the beginning 
and then differentiating. This gives 

y = 3jc 7 — 12jc 5 + 2 jc 3 - 8x, 


so 


= 2\x 6 - 60x 4 + 6x 2 - 8, 
dx 

as we expect. Since we can solve this problem without using the product rule, it 
may appear that this rule is unnecessary. This is indeed true when both factors 
are polynomials, because the product of two polynomials is also a polynomial. 
However, in the more complex situations that lie ahead—in which the factors 
are often different types of functions—it will be clear that the product rule is in¬ 
dispensable. 
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6 The quotient rule: 


d I u \ _ vdu/dx — u dvldx 
dx \v) v 2 


at all values of x where v =£ 0. 


(2) 


Most people find it easier to remember the working instructions given by (2) in 
words rather than symbols: The derivative of a quotient is the denominator times 
the derivative of the numerator minus the numerator times the derivative of the 
denominator , all divided by the denominator squared. To prove this, we write 
y = u!v and let x change by an amount At. As before, this produces changes Am, 
Ay, Ay in the variables m, v, y, and we have 


y + Ay = 


u T~ Am 
v + Lv 


u T Am 

v + Ay 


u_ 

V 


Ay = 


uv + v Am — uv — u Ay 
y(y + Ay) 


v Am — u A v 
y(y + Ay) 


Ay _ v Am/At — u A vl At 
At y(y + Ay) 


If we now take limits as At —> 0 we obtain formula (2), 

dy _ v du/dx - u diidx 
dx v 2 


since Ay —> 0 as At —> 0, by the continuity of v (recall that v is continuous be¬ 
cause it is differentiable). 


Example 2 To differentiate the quotient y = (3t 2 — 2)1 {x 1 + 1), we follow the 
verbal prescription, 

dy_ _ (t 2 + \){dldx)Qx 2 - 2) - (3t 2 - 2)(d/dx){x 2 + 1) 
dx (t 2 + 1 ) 2 

= (t 2 + 1 )(6t) - (3t 2 —2)(2t) 

(t 2 + l) 2 

6t 3 + 6t - 6t 3 + 4t = 10t 

(t 2 +1) 2 “(t 2 +1) 2 ‘ 

With practice, calculations like this can be performed very quickly. For instance, 

d 1 (t 2 + 1)(Q) - 1(2t) -2t 

dx T 2 + l (t 2 + 1 ) 2 (t 2 + l ) 2 ’ 

d_ 3t = (t 2 + l)(3) — 3 t(2t) = 3 - 3t 2 
dx T 2 + l (t 2 + l ) 2 (t 2 + l ) 2 ’ 

d 2t + 1 = (3t- l)(2) — (2 t + l)(3) -5 

dx 3t — 1 (3t - l) 2 (3 t — l) 2 ' 


The quotient rule enables us to extend rule 2 of Section 3.1, 

- 7 - x n — nx n ~\ 
dx 


(3) 
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to the case in which n is a negative integer. To make the negative character of n 
more visible, we write n = — m, where m is a positive integer. Now, using (2) 
and the fact that (3) is known to be valid for positive integer exponents, we have 

d d d 1 x m ( 0 ) - 1 ( mx m -') 

dx X dx X dx x m (x m ) 2 


Y m— 1 


~m— 1 — y. Y n— 1 


which proves our statement. Thus, for example, 


- 7 - x 1 = (— 1)jc 2 = — x 2 , - 7 - x 2 = (— 2)x 3 = -2x 3 , 

dx 


d_ 

dx 


etc. 


Since (3) is clearly true for n = 0, we now know that it is valid for all integer 
exponents. 


Example 3 To differentiate 

we write it as 

Then 

which can be rewritten as 


? = 3 * 2 -Jr, 

y = 3x 2 - 2x~ 3 . 


—■ = 6x + 6x 4 , 



if positive exponents are preferred. 


We urge students to memorize the product and quotient rules, and to anchor 
them in their minds by conscientious practice. 


PROBLEMS 

In Problems 1-8, differentiate each function two ways, and 
verify that your answers agree. 

1 (x - \)(x + 1). 

2 (2x - 6)(3x 2 + 9). 

3 (3x 2 + l)(x 3 + 6x). 

4 (x — l)(x 4 4- x 3 + x 2 + x + 1). 

5 (3jc- \)(2x 2 +x). 

6 (jc 3 - 3x)(x 2 + 5). 

7 (4x 5 + x)(3x + 1 ). 

8 (jc 4 + 1)(jc 4 - 1). 


In Problems 9-28, differentiate each function and simplify 
your answer as much as possible. 


9 

x + 1 

10 

1 

X- \' 

x 2 + 2 ' 

11 

2x 3 + 1 

12 

3* + 4 

;t + 2 

7x3- 8 * 

13 

3x 

14 

4x - x 4 

1 + 2x r 

X 3 + 2 ' 

15 

1 -^ 2 

1 +x 2 ' 

16 

2x+ 1 

1 -x 2 ' 
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17 

19 

21 

23 

25 

27 


x - \ x T 

1 


18 4^ 5 - -y. 


2 - ■ 


X 

X — 


x 2 + 2 jc + r 


5__J_ 

X 3 JC 4 

2jc 2 


20 


22 


24 


26 


jc j 

J_ 1 _ 

3jc 4jc 2 ‘ 

3jc 3 + 2jc 2 - 3jc + 7 
2jc -3 


4jc - 


6jc 4 

2 jc jc + 2 


jc - 1 2jc 


28 jc 


1 - 2jc _2 ‘ 

4jc 

jc 3 + 5jc — 3 ’ 
1 


4 _ . 


1 ’ 


In Problems 29-34, find the derivative in two ways, first by 
dividing and then by using the quotient rule, and show that 
your answers agree. 

29 


30 

31 

32 

33 

34 


2x + 6jc 4 —2jc 6 


+ JC 4 


3jc 4 — 4jc 3 + 5jc —4 


jc + 15 


6 — 3jc 2 +. 


35 Find all points on the curve y = 6/x where the tangent is 
parallel to the line 2jc + 3y + 1 = 0. 

36 Find the equations of the tangent lines to the curve y = 
(jc — 1)/(jc + 1) that are parallel to the line jc —2 y = 1. 


37 Sketch the graph of the curve y = jc/(jc + 1). How many 
tangent lines pass through the point (1, 3)7 Find the jc- 
coordinates of the points of tangency of these lines. 

In Problems 38-43, find the equations of the stated lines. 

38 The tangent and normal to y = ^ ^ at (1, 2). 


39 The tangent and normal to y = 


at jc = 2. 


40 The tangent to y = 


JC 3 + JC 


at (2, 10). 


1 — 2 jc + 3jc 2 

41 The normal to y = --- ? -at (0, 1). 

1 + 

i 

42 The tangent and normal to y = 


2x - 


■ at (2, i). 


jc — 2 

43 The tangent and normal to y =-- at (2, 0). 

jc + 1 

44 Show that the tangents to the curves y = (jc 2 + 45)/jc 2 
and y = (jc 2 - 4)/(jc 2 + 1) at jc = 3 are perpendicular to 
each other. 

45 Let P be a point on the first-quadrant part of the curve 
y = 1 /jc. Show that the triangle determined by the jc-axis, 
the tangent at P, and the line from P to the origin is 
isosceles, and find its area. 

46 Use the product rule to verify rule 3 of Section 3.1: If c 
is a constant and u is a function of jc, then 

d du 

— (cm) = c — 
dx dx 

47 Sketch the curve y = 2/(1 + jc 2 ), and find the points on 
it at which the normal passes through the origin. 

48 Verify the location of the high and low points on the 
graph of 


y jc 2 - 3jc + 2 

as stated in Example 4 of Section 1.6. 


3.3 

COMPOSITE 
FUNCTIONS AND 
THE CHAIN RULE 


Let us consider the problem of differentiating the function 

y = U 3 + 2) 5 . (1) 

We can do this with the tools we now have by using the binomial theorem to ex¬ 
pand the function into the polynomial 

y = jc 15 + IOjc 12 + 40jc 9 + 80jc 6 + 80jc 3 + 32. (2) 


It now follows at once that 


= 1 5jc* 4 + 120JC 11 + 360jc 8 + 480jc 5 + 240jc 2 . (3) 

dx 

In this case the work of expansion is bothersome but not too difficult. However, 
few of us would willingly attempt to carry out the same procedure for the func- 
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tion y = (x 3 + 2) 100 . It is much better to develop the chain rule, which enables 
us to differentiate both functions with equal ease—and a host of others as well. 

For this purpose it is important to understand the structure of the function (1). 
We accomplish this by introducing an auxiliary variable u = x 3 + 2, so that (1) 
can be decomposed into simpler pieces as follows: 

y = u 5 where u = x 3 + 2. (4) 

Working in the other direction, we can reconstruct (1) out of these pieces by sub¬ 
stituting the expression for u into y = u 5 . Such a function is called a composite 
function , or often a function of a function. We have already encountered this idea 
in Section 1.5. In general, suppose that y is a function of w, where u in turn is a 
function of x, say 

y = /(«) where u = g(x). ( 5 ) 

The corresponding composite function is the single function 

>’ = f(g(x)\ ( 6 ) 

obtained by substituting u = g(x) into y =f(u). 

Our position now is this. We assume we are confronted by the composite func¬ 
tion (6), and we wish to learn how to differentiate it by decomposing it into the 
simpler functions (5) and using the presumably simpler derivatives of these func¬ 
tions. This is what the chain rule is all about. 

7 The chain rule: Under the circumstances described above , we have 

dy_ _ dy_ du_ 

dx du dx' K) 

As we see, in this form the chain rule has the appearance of a trivial algebraic 
identity; it is easily remembered because the Leibniz fractional notation for 
derivatives suggests that du can be canceled from the two “fractions” on the 
right. Its intuitive content is easy to grasp if we think of derivatives as rates 
of change: 

If y changes a times as fast as u 
and u changes b times as fast as x, 
then y changes ab times as fast as x. 

Or, in everyday terms, if a car travels twice as fast as a bicycle and the bicycle 
is four times as fast as a walking man, then the car travels 2-4 = 8 times as fast 
as the man. 

Before looking into the proof of the chain rule, let us see how it applies to the 
problem just discussed, in which (1) is the given function and (4) is its decom¬ 
position. Formula (7) gives 

T L = T L "7 i = 5« 4 • 3* 2 = 15*V + 2) 4 , (8) 

dx du dx 

where the auxiliary variable u is replaced by x 3 + 2 in the last step. It is not im¬ 
mediately obvious that this result is the same as (3), but the equivalence is easy 
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to establish.* Further, the derivative of y = (.x 3 + 2) 100 can be computed just as 
easily in just the same way: We write 

y = w 100 where w = x 3 + 2 

and use (7) to obtain 


dy _ dy_ du 
dx du dx 


100w" • 3x 2 


= 300x 2 (x 3 + 2)". 


As these examples show, the chain rule is a very powerful tool. 

We begin the proof of (7) with the usual change Ax in the independent vari¬ 
able x; this produces a change A u in the variable w, and this in turn produces a 
change Ay in the variable y. We know that differentiability implies continuity, so 
Aw —> 0 as Ax —> 0. If we look at the definitions of the three derivatives we are 
trying to link together, 


dy Ay dy Ay du A u 

— = lim —, = lim — = lim —— 

dx Ax— »o Ax du Aw— >o Aw dx Ax 


(9) 


then it is natural to try to complete the proof as follows: By simple algebra we 
have 


Ay _ _Ay Aw 
Ax Aw Ax’ 


( 10 ) 


so 


dy .. Ay Ay Aw 

— = lim — = lim —-— 

dx A*->o Ax Ax—>o Aw Ax 


r 

lim —— 
Ajc— >0 Aw 


lim n ^ 

Ax—>0 Ax 


Ay 

lim -7— 

Aw—>0 Aw 


A u 
lim —— 

Ax— >0 Ax 


dy_ 

du 


du 

dx 


( 11 ) 


This reasoning is almost correct, but not quite. The difficulty centers on a pos¬ 
sible division by zero. In computing dy/dx from the definition in (9), we know 
that it is part of the meaning of this formula that the increment Ax is small and 
approaches zero but is never equal to zero. On the other hand, it can happen that 
Ax induces no actual change in w, so that A u = 0, and this possibility invalidates 
(10) and (11). This flaw can be patched up by an ingenious bit of mathematical 
trickery. We give the argument in the footnote below for students who wish to 
examine it. 1 " 

It will become clear as we proceed that the chain rule is indispensable for 
almost all differentiations above the simplest level. An important special case 


*We hope students did not accept the expansion in (2) — and similarly will not accept the stated equiv¬ 
alence of (8) and (3) — without checking the details for themselves. Total skepticism is the recom¬ 
mended state of mind for studying this (or any similar) book: Take nothing on faith; verify all omit¬ 
ted calculations; believe nothing unless you have seen and understood it for yourself. 
f We begin with the definition of the derivative dy/du, which is 

T 1 = lim 4*- 

du Aw—>o Aw 


This is equivalent to 
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has been illustrated in connection with finding the derivatives of (x 3 + 2) 5 and 
(. x 3 + 2) 100 . The general principle here is expressed by the formula 


dx 


( Y = n( r~ l 



where any differentiable function of jc can be inserted in the parentheses. If we 
denote the function by u , the formula can be written as follows. 


8 The power rule: 


\i n = ytu n 1 


dx 


du 

dx' 


( 12 ) 


At this stage of our work we know that the exponent n is allowed to be any pos¬ 
itive or negative integer (or zero). In Section 3.5 we will see that (12) is also 
valid for all fractional exponents. 

Example 1 To differentiate y = (3.x 4 -F l) 7 , we make a routine application of 

( 12 ): 

= 7(3jc 4 + l) 6 4~ (3jc 4 + 1) = 7(3jc 4 + l) 6 • 12 a: 3 . 
dx dx 

But to differentiate y = [(3jc 4 + l ) 7 + l] 5 , we apply (12) twice in succession: 

= 5[(3a: 4 + l) 7 + l] 4 4- [Ox 4 + l) 7 + 1] 
dx dx 

= 5[(3a: 4 + l) 7 + l] 4 • 7(3 a: 4 + l) 6 4~ Ox 4 + 1) 

dx 

= 5[(3x 4 + l) 7 + l] 4 • 7(3x 4 + l) 6 • 12x 3 . 

After this procedure becomes familiar and more or less automatic, it is often pos¬ 
sible to skip the intermediate steps and write down the answer at once. For the 
sake of clarity, in these calculations we have left the various factors in exactly 
the positions where they appear in the successive steps of the work. Normally, 
of course, we tidy things up a bit, and write the first answer, for example, in the 
more compact form 84 jc 3 (3jc 4 + l) 6 . 


Example 2 If y = [(1 — 2jc)/( 1 + 2.x)] 4 , then by (12) and the quotient rule we 
have 


±L=dy- + € 

Am du 
or 

Ay = Am + e Am, 
du 

where e—> 0 as Am —» 0. It is assumed in these equations that Am is a nonzero increment in u, but 
the last equation is valid even when Am = 0. Dividing this by a nonzero increment A* yields 

Ay _ dy_ _Am + Am 
A x du Ax Ax’ 

and on letting Ax —> 0 we obtain the chain rule (7), since e —» 0. 
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dy_ = / 1 ~2x \3 d_ / 1 - 2 jc \ 

\ 1 + 2.x) dx \ 1 + 2xJ 

( \ -2x \3 (1 +2jc)(-2)-(1 -2jcX2) 

4 \ \ +2x) ' (1 + 2a:) 2 

= -16(1 - 2a:) 3 
(1 + 2x) s ' 


Example 3 If y = (x 2 — 1 ) 3 (x 2 + 1) 2 , then by combining (12) with the product 
rule we have 

^- = (x 2 - l ) 3 -f (x 2 + 1 r 2 + (x 2 + IT 2 -f (* 2 - i ) 3 
dx dx dx 

= (x 2 - l ) 3 • (—2)(jc 2 + 1)- 3 (2jc) + (x 2 + 1 r 2 • 3(x 2 -\) 2 (2x). 

To simplify this, we take out the factor 2 jc(jc 2 — l) 2 , get rid of the negative ex¬ 
ponents, and reduce to a common denominator: 


&- = 2x(x 2 - l ) 2 
dx 


~2(x 2 - 1 ) 3 

(jc 2 +1) 3 (jc 2 +1) 2 


= 2x(x 2 - l ) 2 


-2(x 2 - 1 ) + 3(x 2 + 1)" 
(x 2 + l ) 3 


2x(x 2 - 1 ) 2 (jc 2 + 5) 
(x 2 + l ) 3 


In Chapter 4 we will be using derivatives as tools in many concrete problems of 
science and geometry, and it will then be clear that it is worth a little extra ef¬ 
fort to put the derivatives we calculate into their simplest possible forms. 


There are a few concluding remarks that ought to be made. We have not yet 
explained why the expression “chain rule” is appropriate. The reason is this. In 
(7) we are dealing with three variables y, u, and x that are linked together step 
by step in a chain in such a way that each is dependent on the next. We can sug¬ 
gest this relation by writing 

y depends on u depends on x. 

The formula 

dy _ dy_ du 

dx du dx 

tells us how to differentiate the first variable with respect to the last by taking 
into account each individual link in the chain. This formula can easily be ex¬ 
tended to more variables. For instance, if x depends in turn on z, then 

dy _ dy du dx 
dz du dx dz ’ 

if z depends on w, then 

dy _ dy du dx dz 

dw du dx dz dw ’ 

and so on. Each new variable adds a new link to the chain and a new derivative 
to the formula. 
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PROBLEMS 


Find dy/dx in Problems 1-10. 

1 (2 — 5 *) 2 

2 y = (4 + 5*) 4 . 

3 y = (x 2 + 4x - l) 3 . 

A - X ~ 2 

y ~ (3x + 5) 3 ' 

5 y = (5 - X? (4 + x) 5 . 

6 y = [1 + (1 + *) 5 ] 6 . 

- 1 

y (3x + 1 ) 4 ' 

8 y = (5 - 7x 4 )~ 5 . 

9 y = (1 - 6x) 6 . 

2-x 


10 y = 


(3 - a:) 4 ' 


In Problems 11-16, express dy/dx in terms of jc. 

11 y = (1 — u) 4 , Li = 

12 y = \ — ~r, u = 3x - 1. 

u z 

13 y = (1 + w 2 ) 2 , u = (2x + l) 2 . 

1 


14 y = u 7 , u = 


3 - 4jc' 


15 y = u( 1 — u) 4 , u = —j. 


ic U X 

16 v = --, u = --. 

7 1 + U 1 + JC 


In Problems 17-32, find dy/dx. 

17 y = (x 5 - 3x) 4 . 

18 y = (x 2 - 2) 500 . 

19 y = (jc + x 2 — 2x 5 ) 6 . 

20 y = ( \-3x)~K 

21 y = (12 - x 2 r 2 . 

22 y = [1 - (3x- 2) 3 ] 4 . 

23 y = (x 2 + 3jc - 5) 7 . 

24 y = (jc 3 — 7jc) 5 . 

25 y = (3x 2 - 5x + 2 )“ 6 . 

26 ^ (* 3 -5* + l) 5 ' 

27 y = (5 jc + 3) 4 (4jc - 3) 7 . 

28 y = (x 2 - 2 ) 5 (jc 2 + 2 ) 10 . 

29 y — x 2 (9 — x 2 )~ 2 . 

30 y = (\ - 2x)~ 4 (x 2 - x) 2 . 

31 ^ = (2jc - 3) 8 (3jc 2 - jc -h 2) 10 . 

32 y = (5x 2 + 6) 3 (JC 3 _ 3) 4 


In Problems 33-36, find ds/dt. 
( 2 t - l ) 3 


33 5 = 

34 s = 

35 5 = 

36 s = 


(r 2 + 3) 2 ' 
1 

(2 1 - l) 2 ' 
6 

(5 - 4r) 3 ' 
t 4 - 10 r 2 
(r 2 - 6) 2 ■ 


In Problems 37-39, find dy/dx by two methods, and verify 
that your answers agree. 

37 y = (2jc - 1) 5 U + 3 ) 5 = (2jc 2 + 5jc - 3) 5 . 


38 y = 

39 y = 


(1 - 2jc 2 ) 3 
(3x + l ) 4 

a - 2x) 4 ' 


= (i - 2jc 2 r 3 . 

3jc + 1 \ 4 
1 - 2jc 


In Problems 40-42, find dy/dx as a function of jc in two ways, 
first without the power rule and then using the power rule. 

40 y = u 2 , u = x 2 - 3x 4- 2. 

41 y = u 2 - 3u + 2, u = 7 - 5x. 

J_ 

JC 


42 y = w 3 , u = x 


In Problems 43 and 44, find the equation of the tangent line 
to the given curve at the given point. 

43 y = (jc 3 - jc 2 + jc) 8 , (1,1). 


44 y = ■ 


r, (3,-3). 


(8 — x 2 ) 5 ’ 

45 If u is a function of x, express each of the following in 
terms of u and dii/dx\ 

(a) £ « 3 ; 

(b) -f (2u- l) 2 ; 
dx 

(c) -J- (id -If. 
dx 

46 Find a function y = f(x) for which 

(a) = 2( X 2 - 1 ) • 2x\ 

dx 


(b) ^=4(* 2 - 
dx 

(d) & = 3 (* 3 - 


1 ) 2 • 2x\ 
2) ■ 3x 2 ; 
2 ) 2 • 3x 2 . 


46 
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3.4 

SOME TRIGONOMETRIC 
DERIVATIVES 



So far, the only truly fundamental functions we have learned how to differenti¬ 
ate are the simple power functions x n \ 

d 

— x n = nx n . 
dx 

All other functions have been constructed from these by addition, subtraction, 
multiplication, division and by forming a function of a function, and our general 
rules have allowed us to find the derivatives of these combinations. We now ex¬ 
pand our kit of tools beyond elementary algebra by learning how to differentiate 
the basic trigonometric functions sin x and cos jc: 

d 

— sin jc = cos jc (1) 

dx 

and 

d 

— cos x = - sin jc. (2) 


To obtain these formulas we go back to the definition of the derivative of an 
arbitrary function /(jc), 


—/(jc) = lim 
dx J a*—» o 


fix + Ajc) -f{x) 
Ajc 


When we apply this definition to the function /(jc) = sin jc, and use the addition 
formula for the sine [identity (3) in Section 1.7], we obtain 

d . ,, sin (jc + Ajc) - sin jc 

— sin jc = lim -:- 


= lim 

Ax-^0 


sin jc cos Ajc + cos jc sin Ajc — sin jc 
Ajc 


(3) 


An algebraic rearrangement of (3) gives 


— sin jc = lim 
dx Ax-*0 


COS JC 


sin Ajc 
Ax 


- sin x 


1 — cos Ax 
Ax 



sin Ax 


1 

— cos Ax 

lim 

Ax 

— sin x 

lim 

Ax 

Ajc —»0 


Ax —»0 


(4) 


since cos x and sin x are constants with respect to this limit operation. The lim¬ 
its in brackets here are precisely the limits (3) and (4) in Section 2.5, with 6 re¬ 
placed by Ax: 


lim 
A *—0 


sin Ax 
Ax 


and 


lim ■ 
Ax-»0 


- cos Ax 
Ax 


= 0 . 


These facts enable us to write (4) as 


— sin x = cos x • 1 — sin x • 0 
dx 


= COS X, 

which is ( 1 ). 

The proof of (2) is similar, with the difference that we use the addition for¬ 
mula for the cosine [identity (4) in Section 1.7]: 
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d 

— cos 
dx 


x = lim 

Ac—>0 


cos (x + Ax) — COS X 
Ax 


= lim 

Ac^O 


cos x cos Ax — sin x sin Ax - cos x 
Ax 


- lim 

At—>0 


sin Ax 
Ax 


1 - cos Ax 
Ax 




sin Ax 


1 

— cos Ax 

sin x 

lim 

Ax 

— COS X 

lim 


Ac—>0 


Ac—>0 

Ax 


= — sin x ■ 1 — cos x • 0 = —sin x. 


This establishes (2). 

When (1) and (2) are combined with the chain rule, we have the main tools 
of this section, 


and 


d . du 

— sin u = cos u — 
dx dx 


d 

— cos u - —sin u 
dx 


du 
dx ’ 


(5) 


( 6 ) 


where u = u(x) is understood to be any differentiable function of x. Students must 
learn to use these formulas in combination with all previous rules of differenti¬ 
ation. 


Example 1 Find dy/dx if y = sin (5 + 4x 3 ). Here u = 5 4- 4x 3 , so by (5), 

= cos (5 + 4x 3 ) 4~ (5 + Ax 3 ) = I2x 2 cos (5 + 4* 3 ). 
dx dx 


Example 2 Find dyldx if y = cos (sin x). Here u = sin x, so by ( 6 ) and (1), 


— = -sin (sin x) — (sin x) = -cos x • sin (sin x). 
dx dx 


Example 3 Find dyldx if y = sin [(1 — x 2 )/(l + x 2 )]. Here u = (1 — x 2 )/ 
(1 + x 2 ), so by (5) and the quotient rule, 


dy_ = 
dx 


cos 


= cos 


( l ~x 2 \ d_ / I -x 2 \ 

\ 1 + x 2 / dx \ 1 + x 2 / 

/ 1 - x 2 \ (1 + x 2 )(—2x) - (1 - x 2 )2x 

\l+x 2 /‘ (1+x 2 ) 2 


—4x 

(1 +x 2 ) 2 


cos 



Example 4 Find dy/dx if y = cos (1 + sin 5x). Here u = 1 + sin 5x, so finding 
du/dx requires an application of the chain rule, and we have 























THE COMPUTATION OF DERIVATIVES 


= -sin (1 + sin 5 jc) - 7 - (1 + sin 5 jc) 
dx ax 

d 

= —sin (1 + sin 5 jc) * cos 5x * — (5jc) 

dx 

= — 5 cos 5 jc * sin (1 + sin 5 jc). 


In these examples the chain rule acquires additional scope by being used in 
ways not covered by the ideas of the preceding section. 

We remind the reader of the standard notational convention for powers of the 
trigonometric functions: In general sin".* means (sin x) n . However, it must not be 
forgotten that (sin x)~ l is never written sin -1 *. The reason for this is that the lat¬ 
ter notation is reserved for the inverse sine function, which will not play any part 
in our work until Chapter 9 but will be in regular use from that point on. 


Example 5 Find dy/dx if y = sin 5 lx 2 . Here we let w = sin lx 2 . Then y = w 5 
and 


dy_ 

dx 


= 5w 4 -j- = 5w 4 • cos lx 2 • —— {lx 2 ) 
dx dx 


= 5w 4 • cos lx 2 • 14* 


= 70jc sin 4 lx 2 • cos lx 2 . 


In Section 1.7 we stated that in calculus it is preferred to use radian measure 
for angles instead of degree measure. We are now able to explain the reason for 
this. Let sin x° and cos x° denote the sine and cosine of an angle of x degrees. 
Since an angle of x degrees has radian measure 7r*/l 80, we have 


Then 


dx 


sin jr 


7TX d I TJX \ _ TT 7TX 

180 dx U80 j ~ T80 C0S 780’ 


SO 


l sinx ° = iio cos/ 


If we insist on using degrees to measure angles, then we are forced to use this 
formula instead of the simpler formula (1). We therefore use radian measure to 
prevent our calculations from being cluttered up by the repeated occurrence of 
the unwelcome factor 77 /I 8 O. 

The remaining four trigonometric functions can be defined in terms of sin x 
and cos x, as at the end of Section 1.7, and their derivatives are calculated from 
these definitions. Here are the definitions again: 
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1 

sec x =-, 

cos JC 


1 

CSC x = -. 

sin x 


These are the tangent, cotangent, secant, and cosecant functions. All four of these 
functions will be thoroughly developed in Chapter 9 and used extensively there¬ 
after, but for the present we confine our attention to the tangent function and its 
derivative, 


— tan jc = sec 2 jc. ( 8 ) 

ax 


To establish this formula we refer to (7) and use the quotient rule: 

d d sin x cos x • cos x — sin x • (—sin jc) 

— tan x = —-= - 3 - 

dx dx cos jc cos z jc 

cos 2 jc + sin 2 jc 1 0 

=- z -- = — -z — = sec z jc. 

COS z JC COS z JC 


The chain rule extension of ( 8 ) is 

d 0 du 

— tan u = sec z u —. ( 9 ) 

dx dx 


Example 6 Find dyldx if y = tan 5 (3x 2 + 1). If we put w = tan (3x 2 + 1), then 
y = w 5 and by (9), 

•^ = 5 w A ^ = 5w 4 • sec 2 (3jc 2 + \) ■ ~ (3 * 2 + D 
dx dx dx 

= 5vv 4 • sec 2 (3jc 2 + 1) • 6jc 

= 30jc • tan 4 (3x 2 + 1 ) • sec 2 (3jc 2 + 1). 


PROBLEMS 


In Problems 1-32, find dy/dx. 



25 

1 

y = sin (5jc — 2). 

2 

y = cos (jc 5 + 1 ). 

3 

y = sin (cos jc). 

4 

y = sin [sin (sin jc)]. 

27 

5 

y = sin 3 jc. 

6 

y = sin 2 (4jc - 1 ) 3 . 

7 

sin jc 

y — - 

1 + COS JC 

8 

y = cos 3 4jc. 

29 

31 

9 

y = sin jc 3 . 

10 

y = (1 + sin 2 jc) 4 . 

33 

11 

y = tan 5jc. 

12 

y = tan 2 3jc. 


13 

y = tan (sin jc). 

14 

y = tan 3 jc 4 . 


15 

y = (1 + tan 2 * 2 ) 2 . 

16 

y = 1 - cos 3jc. 


17 

y = 5 sin 3jc — 3 cos 5jc. 

18 

y = 3 sin (4jc - 5). 


19 

y = cos 3(5jc - 3) 3 . 

20 

y = cos 7 2jc. 

34 

21 

sin jc 

y — 

1 — sin jc 

22 

y = sin jc cos 2 jc. 

35 

23 

1 — cos 3jc 
^ sin 3jc 

24 

y = jc sin jc. 

36 


COS JC 

y = 

26 

y = sin jc — jc cos jc. 

JC 

3 • 1 

y = jc j sin — 

28 

y = sin 2 jc cos 3jc. 

X 

y = (2 — cos 2 jc) 3 . 

30 

y — sin 2 jc + cos 2 jc. 

y = sin (tan jc). 

32 

y = tan 2 (1 - sin 3 jc). 

Derive formula (2) in 

another way, by using the identi- 

ties 

. / TT \ 

and 

i tt \ 

cos jc = sin 1 —— jc 1 

sin jc = cos 1 — — jc 1. 


Find the values of jc for which the graph of y = x + 
2 sin jc has a horizontal tangent. 

Same as Problem 34 for y = (cos jc)/(2 + sin jc). 

By differentiating the first of the following double-angle 
formulas, obtain the second: 
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sin 2x = 2 sin x cos x, cos 2x = cos 2 x — sin 2 x. 

37 An object moving on a straight line has position 5 = 
A cos kt at time t, where A and k are constants. 

(a) Describe the motion, giving physical meaning to the 
constants A and k. 

(b) Find the velocity v. 

(c) Show that v 2 + k 2 s 2 has the same value at all times. 
Give a physical interpretation. 

(d) Find the acceleration a and show that a is propor¬ 
tional to 5 but oppositely directed. 

Trigonometric identities are often useful in simplifying the 
form of functions or their derivatives. We mention particularly 


the double-angle formulas in Problem 36 and the half-angle 
formulas in Problem 11 of Section 1.7. Show that each of the 
following derivatives can be expressed in the given form. 

38 - 7 - \~rx — 7 — sin 2ax\ = sin 2 ax. 
dx \ 2 4 a ) 


39 - 7 - I + - 7 - sin 2ax ) = cos 2 ax. 
dx \ 2 4 a 


40 — f— sin ax —— sin 3 ax) = cos 3 ax. 
dx \ a 3 a 


41 — —— cos ax —— sin 2ax sin ax) = sin 3 ax. 
dx \ 3a 6 a / 


3.5 

IMPLICIT FUNCTIONS 
AND FRACTIONAL 
EXPONENTS 


Most of the functions we have met so far have been of the form y = f(x\ in which 
y is expressed directly—or explicitly—in terms of x. In contrast to this, it often 
happens that y is defined as a function of x by means of an equation 

F(x,y) = 0, (1) 

which i s not solved for y but in which x and y are more or less entangled with 
each other. When x is given a suitable numerical value, the resulting equation 
usually determines one or more corresponding values of y. In such a case we say 
that equation ( 1 ) determines y as one or more implicit functions of x. 


Example 1 (a) The very simple equation xy = 1 determines one implicit func¬ 

tion of x, which can be written explicitly as 


y = 


]_ 

X 


(b) The equation x 2 + y 2 = 25 determines two implicit functions of x, which 
can be written explicitly as 

y = V25 - x 2 and y = — V25 - x 2 . 



Figure 3.2 


As we know, the graphs of these two functions are the upper and lower halves 
of the circle of radius 5 shown in Fig. 3.2. 

(c) The equation 2x 2 — 2xy = 5 — y 2 also determines two implicit functions. 
If we use the quadratic formula to solve for y, we find that these functions are 

y = x + V5 — x 2 and y = x — V5 — x 2 . 

(d) The equation x 3 + y 3 = 3 axy ( a > 0) determines several implicit func¬ 
tions, but the problem of solving this equation for y is so forbidding that we might 
as well forget it. 


It is rather surprising that we can often calculate the derivative dy/dx of an im¬ 
plicit function without first solving the given equation fory. We start the process 
by differentiating the given equation through with respect to x, using the chain 
rule (or power rule) and consciously thinking of y as a function of x wherever it 
appears. Thus, for example, y 3 is treated as the cube of a function of x and its 
derivative is 
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dy. 

dx' 


and x 3 y 4 is thought of as the product of two functions of x and its derivative is 

- 7 - (jc 3 y 4 ) = x 3 • 4 y 3 y- + y 4 • 3x 2 . 
dx dx 

To complete the process, we solve the resulting equation for dy/dx as the un¬ 
known. This method is called implicit differentiation. We show how it works by 
applying it to the equations in Example 1. 


Example 2 (a) We can think of the equation xy = 1 as stating that two func¬ 

tions of x (namely, xy and 1) are equal. It follows that the derivatives of these 
functions are equal, so 


x — + y = 0 or 
dx 


dy_ = _y_ 

dx x 


In this case it is possible to solve the original equation for y and check our re¬ 
sult: Since y = 1/jc, the formula we have just obtained becomes 

dy _ _y_ _ _J_ __1_ J_ 1_ 

dx x x y xx x 2 ' 


and differentiating y = \lx directly also yields 


dy _ 1 

dx x 2 ' 


(b) From the equation x 1 4- y 2 = 25 we get 


lx + 2y = 0 
dx 


or 


dy _ _x_ 
dx y‘ 


This gives the correct result whichever of the two implicit functions we are think¬ 
ing about. Thus, at the point (4, 3) on the upper curve in Fig. 3.2, the value of 
dy/dx is — 7 , and at (4, -3) on the lower curve, its value is 7 . 

(c) If we apply this process of implicit differentiation to the equation 2x 2 — 
2 xy = 5 — y 2 , we obtain 


4x - 2x 


dy 

dx 


2 y = -2 y 


dy_ 

dx 


dy 2x - y 

or —- =-. 

dx x - y 


(d) In Example 1 (d) the derivative dy/dx is clearly beyond direct calculation. 
However, it is easily found by our present method: Since x 3 + y 3 = 3 axy, we 
have 


3jc 2 + 3 y 2 — 3 ax y + 3 ay 

dx dx 


dy _ ay - x 2 
dx y 2 — ax' 


It is apparent that implicit differentiation usually gives an expression for dy/dx 
in terms of both jc and y, instead of in terms of jc alone. However, in many cases 
this is not a real disadvantage. For instance, if we want the slope of the tangent 
to the graph of the equation at a point (xo, yo)> all we h ave to do is substitute xo 
and y 0 for x and y in the formula for dy/dx. This is illustrated in Example 2(b) 
for the points (4, 3) and (4, —3). 
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We now use implicit differentiation to show that the vital formula 

- 7 - x n = nx n ~ x ( 2 ) 

dx 

is valid for all fractional exponents n = plq* 

For the sake of convenience, we begin the proof of (2) for fractional exponents 
by introducing y as the dependent variable, 

y = x p,q . 

Raising both sides of this to the q\h power yields 

y? = x p - 

and by differentiating implicitly with respect to t and using the known validity 
of the power rule for integral exponents, we obtain 

qyq ~'^ =pxP ~' 


or 

dy _ p x p ~ x 
dx q y q ~ x ' 

But y q ~ 1 = yQ/y = x p lx plq , so 

dy_ = p^ xP^_ = p_x^y_ xp/q = £ xplq _^ 

dx q y q 1 q x p q 

and the proof is complete. 


Example 3 We immediately have 


jL y \/2 — _L y- M2 A. r —2/3 
dx 2 ’ dx 



A- r 5/4 _ A yl/4 

dx 4 X 


The first of these derivatives is often used in the form 

1 


— Vx = 

dx 


2V7 


This formula was established directly from the definition of the derivative in Ex¬ 
ample 3 of Section 2.3. 


Example 4 By the chain rule, the power rule of Section 3.3 is now known to be 
valid for all fractional exponents. Accordingly, 

4- (4 - x 2 r 5/2 = ~ (4 - x 2 r 112 4- (4 - * 2 ) 

dx 2 dx 

= -f(4-^)-(-2x)=^ ¥I . 


^Students who are comfortable with fractional exponents should ignore this footnote. However, for 
those who have forgotten the meaning of these exponents, we offer a brief review. We begin by re¬ 
calling that the square root Vx, the cube root Ax, and more generally the gth root Ax, where q is 
any positive integer, are all defined for jc > 0; if q is odd. Ax is also defined for x < 0. The defini¬ 
tion of fractional exponents now proceeds in two stages: First, x l,q is defined for q > 0 by x llq = 
Ax\ and second, if p/q is in lowest terms and q > 0 , x plq is defined by x plq = (x l/q ) p . It is sometimes 
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Example 5 In differentiating expressions containing radicals, it is necessary to 
begin by replacing all radical signs by fractional exponents. Thus, 

-f , * = -j- x(x 2 - 1)- 1/2 =* f(x 2 - i)-» 2 (2x) + (X 2 -1 r m 

ax \ x 2 _ j ax \ 2 / 

-jc 2 1 -x 2 + (x 2 - 1) _ -1 

“ (x 2 - 1) 3/2 (x 2 - 1 ) 1/2 “ (x 2 - l) 3 ' 2 (x 2 - 1) 3/2 ' 


For convenience of reference, we list together all the differentiation rules de¬ 
veloped in this chapter. 


1 

2 

3 

4 


T x c = a 


— x n = nx n 1 (n any integer or fraction). 
ax 

d du 

— (cm) = C —. 

dx dx 

d du dv 

- ( m+^)--+—. 


5 

6 

7 

8 
9 

10 


, . d / dv du 

The product rule. — ( uv ) = u — + v —. 

dx dx dx 

v dufdx — u dvldx 


The quotient rule. — — = 

dx \ vJ u 

The chain rule: ■ ^r- 

dx du dx 

The power rule. -7 - u n = nu n ~ x ~~ ( n any integer or fraction). 
dx dx 

d . du 

— sin u — cos u —. 

dx dx 

d . du 

— cos u = -sin u —. 

dx dx 


These rules will be used in many ways in almost everything we do from this 
point on. We therefore urge students who have not already done so to commit 
them to memory and practice them until their use becomes almost automatic. The 
eminent philosopher A. N. Whitehead might well have had these rules in mind 
when he said, “Civilization advances by extending the number of important op¬ 
erations which we can perform without thinking about them.” 

It is worth pointing out that most mistakes in differentiation come from mis¬ 
using the power rule or the quotient rule. For instance, in applying the power rule 
it is easy to forget the essential final factor du/dx: 

Common mistake Right answer 

(1 + 6x 2 ) 4 = 4(1 + 6x 2 ) 3 4(1 + 6x 2 ) 3 • I2x 

-7- (1 + 2 *) 1/3 = ^(1 + 2*r 2/3 yd + 2x )“ 2/3 • 2 


useful to know (and it is not difficult to prove) that (x p ) llq = (.x xtq ) p if jc > 0. For example, 8 2/3 is 
easy to evaluate both ways, since 8 2/3 = (8 2 ) 1/3 = 64 1/3 = 4 and S 2 *' 3 = (8 1/3 ) 2 = 2 2 = 4; but 32 3/5 = 
(32 3 ) 1/5 is hard, while 32 3/5 = (32 1/5 ) 3 = 2 3 = 8 is easy. 
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The difficulty with the quotient rule lies in remembering the order of subtraction 
in the numerator. If we forget, one way of quickly recalling the correct order is 
to use the product rule as follows: 


d_ 

dx 



d . _i. , 1N _ 2 dv , du 

—r~ (uV = U ' \)V 2 —- V V —— 

dx dx dx 


J_ du _ u chy _ v du/dx — u dv/dx 
v dx v 2 dx v 2 



Remark The equation in Example 1(d) has a long history and deserves a bit of 
further comment. Its graph is called th e folium of Descartes and is shown in Fig. 
3.3. If we consider the simplest case by putting a — 1, the equation becomes 

jc 3 + y 3 = 3ry, (3) 

and the problem of solving this for y in terms of x—which we airily dismissed 
above—is not absolutely out of the question. And thereby hangs a tale of con¬ 
siderable historical interest. 

In 1545 the boisterous Italian physician-mathematician-astrologer Girolamo 
Cardano (1501-1576) discovered a formula for solving any cubic equation by 
means of radicals.* * This formula resembles the familiar quadratic formula but is 
much more complicated. If Cardano’s formula is used to solve equation (3) for 
y, the three solution functions that arise are t 


y i = 







3 


and 


.V 2 , y3 = -\yi 


— tV-3 I 


4 + 




The method of implicit differentiation as carried out in Example 2(d) is clearly 
preferable to the task of directly differentiating horrors like these. Furthermore, 
implicit differentiation works just as easily for equations such as 

x 5 + 5jt 4 y 2 + 3xy 3 + y 5 = 1, 

which are actually impossible to solve for y in terms of xf 


*At one point in his turbulent life Cardano was imprisoned for heresy: his offense was that he pub¬ 
lished a horoscope for Jesus. 

f These formulas can be obtained from the ideas in Chapter X of H. Tietze, Famous Problems of 
Mathematics (Graylock Press, 1965). 

*In 1824 the Norwegian mathematician Niels Henrik Abel (1802-1829) proved that no general for¬ 
mula exists for solving a fifth-degree equation by means of radicals, as is possible for equations of 
lower degree. This young man made many profound discoveries in his short life, and it has been said 
that he was the greatest genius produced by the Scandinavian countries. 
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PROBLEMS 


In Problems 1-10, find dy/dx by implicit differentiation. 

1 3jc 3 + 4y 3 + 8 = 0. 

2 xy 2 - x 2 y + x 2 + 2 y 2 = 0. 

3 x — y - y 7 . 

4 jc 4 y 3 — 3jcy = 60. 

5 jc 3 — y 3 = 4 xy. 

6 1 + 1=1. 

y 

7 Vjc + Vy = 6. 


x-y 

9 x 2/3 + y 2/3 = 1. 

10 vTy + 4 = y. 

In Problems 11-18, find dy/dx by implicit differentiation and 
also by solving fory and then differentiating, and verify that 
your two answers are equivalent. 

11 3xy + 2 = 0. 

12 x 2 T y 2 = 9. 

13 y 2 = 3x — 1. 

14 2x 2 + 3jc + y 2 = 12. 


16 x 2 + 5x + xy = 3. 

17 9jc 2 + 4y 2 = 36. 

18 x 2 + y - y 2 = 5. 

In Problems 19-28, find the derivative of each function. 


19 

yt/5. 

20 

*5/6. 

21 

*~ 3/4 . 

22 

X~ VU 

23 

3^. 

24 

(1 + x m ) y2 . 

25 

/ X 3 + 8\3/4 

26 

Vi + Vi +x. 


V -« 2 ) 



27 

(x + 2\3« 

28 

Ji +3 - 


U - 1/ ■ 


> X 2 - 3 

29 

Find the equation of 




(a) the tangent to y = (5 - 3 jc) 1/3 at (-1, 2); 

(b) the tangent to jc 4 + 16y 4 = 32 at (2, 1); 

(c) the normal to y = jcV9 + jc 2 at the origin; 

(d) the normal to y 2 - 4 xy = 12 at (1, 6). 

30 Show that the curves jc 2 + 3y 2 = 12 and 3 jc 2 - y 2 = 6 
intersect at right angles at the point (V3, V3). 

31 Show that for the “curve” jc(jc + 6) + y 2 - 4y + 15 = 0, 
implicit differentiation gives 

dy _ x + 3 

dx 2 — y ‘ 

Show further that this result is completely meaningless, 
because there are no points on this “curve.” 

32 Verify that the normal at any point (jcq, To) on the circle 
jc 2 + y 2 = a 2 passes through the center. 

33 Find a function y =/(;c) for which 

(a) & = 3V * ; (b) * = 5xV ^- 

34 Show that the curve x> j3 + x 3 y = 4 has no horizontal tan¬ 
gent. 

35 Find the highest point on the loop of the folium of 
Descartes (Fig. 3.3) whose equation is (3). 

In Problems 36-40, find dy/dx by implicit differentiation. 

36 tan y = x. 37 y 3 + y 2 = sin x. 

38 cos y = x. 39 sin y = xy. 

40 cos xy = x 2 + y 2 . 


In Problems 41-48, find the derivative of each function. 


41 

sin Vx. 

42 

COS JC 




Vjc 

43 

tan 2 Vjc. 

44 

sin 3 (1 — 5jc) 4/3 . 

45 

V6 — 5 cos 2.v. 

46 

/ sin 2 jc 




V 1 + cos jc' 

47 

tan (3x - 1)' 1/2 . 

48 

[tan (3x ~ 1)]“ 1/2 


The derivative of y = x 4 is clearly y f = Ax 3 . But Ax 3 can also be differentiated, 
yielding 12x 2 . It is natural to denote this function by y" and call it the second 
derivative of the original function. By differentiating the second derivative y" = 
\2x 2 we obtain the third derivative y" = 24x, and so on indefinitely. Several no¬ 
tations are in common use for these higher-order derivatives, and students should 
become familiar with all of them. The successive derivatives of a function y = 
f(x) can be written as follows: 


3.6 

DERIVATIVES OF 
HIGHER ORDER 


First derivative /'(jc) y' 

Second derivative /"(jc) y" 


dy_ 

dx 




d 2 y 

~dx 2 


- l?™ 
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Third derivative /"'(*) y'" 




A few remarks about these notations are perhaps in order. The entries in the 
first column are read “/prime of jc,” “/double prime of jc,” “f triple prime of jc,” 
“/ upper n of jc”; similarly, those in the second column are read “y prime,” “y 
double prime,” and so on. The prime notation quickly becomes unwieldy and is 
not often used beyond the third order. It is sometimes convenient to think of the 
original function as the zeroth-order derivative and to write /(jc) = / (0) (jc). The 
seemingly strange position of the superscripts in the third column can be under¬ 
stood if we remember that the second derivative is the derivative of the first de¬ 
rivative, 



On the left side of this, the superscript 2 is attached to the d on top and to the 
dx on the bottom, and this is consistent with the way these symbols are written 
on the right. 

What are the uses of these higher derivatives? In geometry, as we will see in 
Chapter 4, the sign of/"(jc) tells us whether the curve y =f(x) is concave up or 
concave down. Also, in a later chapter this qualitative interpretation of the sec¬ 
ond derivative will be refined into a quantitative formula for the curvature of the 
curve. 

In physics, second derivatives are of very great importance. If s = f(t) gives 
the position of a moving body at time t, then we know that the first and second 
derivatives of this position function, 


dv _ d 2 s 
dt dt 2 ’ 


ds 

v = —— and 

dt 


are the velocity and acceleration of the body at time t. The central role of accel¬ 
eration arises from Newton’s second law of motion, which states that the accel¬ 
eration of a moving body is proportional to the force acting on it. The basic prob¬ 
lem of Newtonian dynamics is to use calculus to deduce the nature of the motion 
from the given force. We shall begin examining problems of this kind in Chap¬ 
ter 5. 

Derivatives of higher order than the second do not have any such fundamen¬ 
tal geometric or physical interpretations. However, as we shall see later, these de¬ 
rivatives have their uses too, mainly in connection with expanding functions into 
infinite series. 

All these applications will be discussed in detail at the proper time. Mean¬ 
while, our present task is to develop proficiency at performing the calculations. 

Example 1 It is easy to find all the derivatives of y = jc 5 : 

y' = 5jc 4 , y" — 20jc 3 , y'" = 60 x 2 , 

y ( 4 ) = 120.x, y (5) = 120, y (A1) = 0 for n > 5. 
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The following notation will often be useful. For any positive integer n, the sym¬ 
bol n \ (read factorial”) is defined to be the product of all the positive integers 
from 1 up to /i: 

n\ — 1 • 2 • 3 • • • n. 

Thus, 1! = 1, 2! = 1 • 2 = 2, 3! = 1 • 2 ■ 3 = 6, 4! = 1 • 2 • 3 • 4 = 24, etc. If 
we differentiate y = x n repeatedly we clearly get 

y' — nx n ~\ 
y" = n(n ~ l)*" -2 , 
y m = n(n ~ 1 ){n ~ 7)x n ~ 3 , . . . , 
yin) — n ( n — - 2 ) • • • 2 • 1 = n\, 

yik) = o for k > n. 


Example 2 To discover a formula for the nth derivative of y = \lx = x ’, we 
compute until a pattern emerges: 

y' = — x~ 2 , 

y" = 2x~ 3 , 

y m = -2 • 3jT 4 = -3\x~ 4 , 
yi4) = 2 • 3 • 4x~ 5 = 4 ! jc ^ 5 , 
y (5) = -2 ■ 3 • 4 • 5x~ 6 = ~5\x~ 6 . 

From the evidence so far and the way the process of differentiation works, it is 
clear that except for sign y (n) is n\x~^ n+] \ A convenient way of expressing the 
alternating sign is provided by the number (—1)", which equals —1 if n is odd 
and 1 if n is even. We therefore have 

y<") = (-lyVi!*-^ 1 ) 

for every positive integer n. 


Example 3 Implicit differentiation can be used to find a simple formula for y" 
on the circle x 2 + y 2 = a 2 . To begin the process, we differentiate and obtain 


2x + 2yy' = 0 


, x 
or y = —. 

y 


(1) 


Differentiating again by the quotient rule and remembering that y is a function 
of x , we get 


/ 



xy' 

2 


When (1) is substituted into this, the formula becomes 

„ _ — x(—x/y) _ y 2 + x 2 _ a 2 

^ y 2 y 3 y 3 

which should be simple enough for anyone. 
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Example 4 Repeated differentiation enables us to give a relatively easy proof of 
the binomial theorem. For any positive integer n, we consider the function 

(1 + x) n = (1 +*)(1 + *)■ ■ - (1 +*). 

It is obvious that this function is a polynomial of degree n , that is, 

(1 + x) n = an + a\x + aix 2 + a^x 3 + ■ ■ ■ + a n x n , ( 2 ) 

and our problem is to find out what the coefficients are. If we put x = 0, we im¬ 
mediately obtain ao = 1. Next, differentiating both sides of (2) repeatedly yields 

n( 1 + x)" -1 = a\ + 2 ci 2 X + 3a^x 2 + • ■ • + na rl x n ~\ 

n(n — 1)(1 + x) n ~ 2 = la 2 + 3 • 2a^x + ■ • ■ + n{n — 1 )a n .x n ~ 2 , 

n{n - 1 ){n - 2)(1 + x ) n ~ 3 = 3 ■ 2 a 3 + ■ • • + n(n — J )(n - 2)a n x n ~ ?> , 

and so on. These equations hold for all values of x, so we can put x = 0 in each 
of them. This procedure gives the following expressions for the coefficients a j, 
<32, < 33 , ... : 

n(n — 1) n(n — 1 )(n — 2) 

a\ — n, a 2 — ^ , <33 — ^ . 3 » 

n(n — 1 )(m — 2 ) • • • (n — k + 1 ) 


With these coefficients, equation (2) takes the form 

~ 1 ) x 2 + »(» ~ [ X” ~ 2 ) ^3 + 


(1 + x) n = 1 + nx + ■ 


1-2-3 

n(n - 1 )(n — 2) • • • (m — k + 1) 
1 - 2 - 3 - - - A: 


x* + ■ • ■ + *", (3) 


and this is the binomial theorem.* 


To obtain the equivalent version given in the footnote of Section 3.1, substitute x = bla in equation 

(3) and then multiply by a n . 


PROBLEMS 


1 


2 


3 


Find the first four derivatives of 

(a) 8* - 3; 

(b) 8* 2 - 11*+ 2; 

(c) 8* 3 + lx 2 - x + 9; 

(d ) x 4 — 1 3jc 3 + 5* 2 + 3* — 2; 

(e) x 5/2 . 


Calculate the indicated derivative in each case: 

(a) y" if .v = (b) v" ify = x 2 -\, 

1 - X ' x z 


(c) 


dx 2 


1 -x 
1 + x) ’ 


d 2 

(d) ^ 


* 3 + 



< e ) ^500 (*‘ 31 _ 3x79 + 4 )' 

Find a general formula for y (n) in each case: 


4 


5 


6 


(a) 

(c) 



v = 


1 + X 


(b) y = 


1 

1 + 3jc ’ 


Use implicit differentiation to find a simple formula for 
y" in each case: 

(a) b 2 x 2 + a 2 y 2 = a 2 b 2 \ (b) y 2 = 4 px\ 

(c) x ]/1 + y ,/2 = a x,2 \ (d) * 3 + y 3 = c?\ 

(e) x 4 + y 4 = a 4 . 

Find a simple formula for y" on the curve x n + y n = a n 
and show that your results in parts (c), (d), and (e) of 
Problem 4 are all special cases of this formula. 

Find the values of y\ y", and y'" at the point (4, 3) on 
the circle x 2 + y 2 = 25. 
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7 If 5 is the position of a moving body at time t, find the 
time, position, and velocity at each moment when the ac¬ 
celeration is zero: 

(a) i = 8r 2 - y (r > 0); 

(b) i = 12 t m + t m (t > 0); 

8 (a) What is the 23rd derivative of 

x 22 - 501x 17 + fx 6 - 7T 3 X 2 ? 

(b) What is the 22nd derivative? 

9 If f(x) = x 3 — 2x 2 — x, for what values of x is /'(x) = 

rw? 

10 Show the following: 

(a) if y' is proportional to x 2 , then y" is proportional to 
x; 

(b) if y' is proportional to y 2 , then y" is proportional to 


11 It is natural to expect from the chain rule that the for¬ 
mula 

d 2 y _ d 2 y d 2 u 
dx 2 du 2 dx 2 

might be true. Disprove this guess by considering y = 
y/ii where u = x 2 + 1. Prove that in fact 

d 2 y _ d 2 y /du\ 2 d 2 u dy 
dx 2 du 2 \<ix/ dx 2 du ’ 

and verify this for the given functions. 

12 If u and fare functions of x, and y = uv, show that 

y" = uv + 2 u v' + uv”. 

Find a similar formula for 

13 For each of the functions sin x and cos x, obtain 

(a) the 4th derivative; 

(b) the 10th derivative; 

(c) the 100th derivative; 

(d) the 159th derivative. 


CHAPTER 3 REVIEW: CONCEPTS, FORMULAS, METHODS 


Memorize and learn to use, or think through the following. 

1 Derivative of a function. 

2 Binomial theorem. 

3 Derivative of a polynomial. 

4 Reflection property of parabolas. 

5 Normal line to a curve. 

6 Product rule. 

7 Quotient rule. 


ADDITIONAL PROBLEMS FOR CHAPTER 3 

SECTION 3.1 

1 Find the points on the curve y = x 3 ~ 3x 2 - 9x + 5 at 
which the tangent is horizontal. 

2 Find the points on the curve y = x 3 — x 2 at which the 
tangent has slope I. 

3 Find the points on the curve y = x 3 + x at which the tan¬ 
gent has slope 4. What is the smallest value the slope of 
the tangent to this curve can have, and where on the curve 
does the slope of the tangent have this smallest value? 

4 At what points on the curve y = x 3 — x 2 + x is the tan¬ 
gent parallel to the line 2x — y — 7 = 0? 

5 Find the slope of the tangent to the curve y = x 4 — 2x 2 + 

2 at any point. For what values of x is the tangent hori¬ 
zontal? For what values of x does the tangent point up¬ 
ward to the right? 


8 Composite function. 

9 Chain rule. 

10 Power rule. 

11 Derivatives of trigonometric functions. 

12 Implicit function. 

13 Implicit differentiation. 

14 Second derivative and higher derivatives. 


6 The curve y = ax 2 + bx + 2 is tangent to the line 8x + 
y = 14 at the point (2, —2). Find a and b. 

7 Find the constants a, b, and c if the curve y = ax 2 + 
bx + c passes through the point (— 1, 0) and is tangent 
to the line y = x at the origin. 

8 If the curve y = ax 2 + bx + c passes through the point 
(-1,0) and has the line 3x + y = 5 as its tangent at the 
point (1, 2), what values must the constants a, b, and c 
have? 

9 The curves y = x 2 + ax + b and y = x 3 - c have the 
same tangent at the point (1, 2). What are the values of 
a, b, and cl 

10 Find the equations of the tangents to the curve y = x 2 - 
4x that pass through the point (I, -4). 

11 If a =£ 0, show that the tangent to the curve y = x 3 at 
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(a, a 3 ) intersects the curve a second time at the point 
where x = —2a. 

12 Show that the tangents to the curve y = x 2 at the points 
(a, a 2 ) and (a + 2, (a + 2) 2 ) intersect on the curve y = 
x 2 — 1. 

13 Find the values of a, b, c, and d if the curve y = ax 3 + 
bx 2 + cx + d is tangent to the line y = x — 1 at the point 
(1,0) and is tangent to the line y = 6jc - 9 at the point 
(2, 3). 

14 Use the reflection property of parabolas to show that the 
two tangents to a parabola at the ends of a chord through 
the focus are perpendicular to each other. 

15 Show that the tangent to the curve y = x 3 - 2jc 2 - 3jc + 
8 at the point (2, 2) is one of the normals of y = x 2 — 
3x + 3. 

16 There is only one normal to the parabola x 2 = 2y that 
passes through the point (4, 1). Find its equation. 

17 The point P = (6, 9) lies on the parabola x 2 = 4 y. Find 
all points Q on this parabola with the property that the 
normal at Q passes through P. 


SECTION 3.2 

18 Differentiate each of the following functions two ways 
and verify that your answers agree: 

(a) (jc 2 - 1)(jc 3 - 1); (b) 3 jc 4 (jc 2 + 2jc); 

(c) (x 2 - 3)(x - 1); (d) (x + IX* 2 - lx - 3). 

19 Differentiate each of the following functions and sim¬ 
plify your answer as much as possible: 


(a) 

* + *" 1 

(b) 

x 2 + 2x + 1 

x — x~ ] ’ 

x 2 ~2x+ 1 

(c) 

X 2 

(d) 

2x+ 3 

X 3 + 2’ 

jc 2 + jc — 4 ’ 

(e) 

X 3 

(f) 

1 - X. 

1 -* 2 ’ 

1 +*’ 

(g) 

6x 4 + 9 

JC — 1 ’ 

(h) 

X 2 +6x +9 
x 2 — 4jc + 4 


20 


21 


22 


Find dy/dx two ways, first by dividing and then by us¬ 
ing the quotient rule, and show that your answers agree: 
9 - x 3 . 5 - 3x , v x 3 — 6x 


(a) 


(b) 


(c) 


JC x~* x* 

Prove the quotient rule from the product rule as follows: 
Write y = u/v 'm the form yv = u, differentiate this with 
respect to jc by the product rule, and solve the resulting 
equation for dy/dx. 

Extend the product rule to a product of three functions 
by showing that 


d du dv dw 

— ( uvw) = vw — + uw — + uv—. 
dx dx dx dx 


Flint: Treat uvw as a product ( uv)w of two factors. (No¬ 
tice that the right-hand side of this extended product rule 
is the sum of all terms in which the derivative of one fac¬ 


tor is multiplied by the other factors unchanged. This pat¬ 
tern persists for products of more than three factors.) 

23 Use Problem 22 to differentiate 

(a) (*+ 1 )(jc + 2)(x + 3); 

(b) (jc 2 + 2jc)(jc 3 + 3jc 2 )(jc 4 + 4). 

24 Use Problem 22 to show that (d/dx) u 3 = 3u 2 du/dx, and 
apply this formula to calculate 

- 7 - ( 6* 11 + 9x 5 -3) 3 . 
dx 

25 Sketch the curve y = 10V5/O + x 2 ) and find the points 
on it at which the normal passes through the origin. 

26 Consider the curve y = a/(\ + jc 2 ), where a is a positive 
constant. For what values of a does there exist a point 
P = Uo, yo) on the first-quadrant part of the curve at 
which the normal passes through the origin? If the nor¬ 
mal at the point for which jc () = 2 passes through the ori¬ 
gin, what must be the value of al 

27 There are two points on the curve y = (x + 4)/(jc — 5) at 
which the tangent passes through the origin. Sketch the 
curve and Find these points. 


SECTION 3.3 

28 Find dy/dx in each case: 

(a) y = (4x 2 - 2) 12 ; (b) y = (jc 4 + l) 125 ; 

(c) y = (x 4 - x 8 ) 16 ; (d) y = (x -1 - x~ 2 )~ 3 \ 

(e) y = (4jc 2 + 5) -1 ; (f) y = (x + x 2 + x 3 + x 4 ) 5 . 


29 


30 


31 


32 


Find dy/dx in each case: 

(a) y = (1 + 2x) 3 (4 - 5 jc) 6 ; 

(b) y = (x 2 + l) 10 (jt 2 - l) 15 ; 

(c) y = (jc 2 - 1)(16 + x 2 )~ 3 \ 

(d) y = (4x 3 - 9x 2 ) 2 (3x ~2x 2 ) 3 . 


Find dx/dt in each case: 
(/ + 3/ 2 ) 2 


(a) 5 = 


(c) 5 = 


t + 1 

(f 2 + l ) 4 


(b) 5 = 


(d) 5 = 


C t 2 - I) 3, 

Find a function y = f(x) for which 


(/ 3 “ I ) 5 ’ 

(1 + 2 1 2 ) 5 
(1 -3/ 3 ) 4 ' 


(a) = \2x 3 (x 4 + 

dx 


l) 2 


(b) ^ = 72* 5 (* 6 + l) 5 . 
dx 

Prove the power rule for positive integral exponents n by 
writing y = u n , expanding Ay = (u + A u) n — u n by the 
binomial theorem, and then dividing by Ajc. Use the quo¬ 
tient rule to extend this result to negative integral expo¬ 
nents. 


SECTION 3.4 

33 Find dy/dx for each of the following functions: 

(a) y = cos (1 — 3jc); (b) y = sin (1 - x 7 )\ 

(c) y = cos (cos x)\ (d) y = cos [sin (cos jc)]; 

(e) v = cos 4 jc; (f) y = cos 5 (1 - 3jc 2 ) 3 ; 
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(g) y = ■ 


COS JC 

— sin jc’ 

v4. 


(h) y = sin 5 3jc; 


34 


(i) y = cos jc 4 ; 

(k) y = tan (1 — 3jc); 

(m) y = cos (tan jc); 

(o) y = tan 4 jc 5 . 

Differentiate each of the given functions: 


(j) y = (1 - cos 5 x) 3 ; 

(1) y = tan 4 (1 - 2a: 3 ); 
(n) y = sin [cos (tan jc)]; 


(a) y = sin (IOjc - 1 ); 


(c) y = 5 cos (7 - 
y = 4 cos 4 (1 


(e) 

(g) 


2jc); 

-x)- 


cos J x sin^ jc; 


(b) y = cos 2 jc; 

(d) y = sin 5 jc 5 ; 

,*x 1 - cos x 

(f)>, = TT^7 ; 

(h) y- 


3 cos 2jc’ 


35 

36 

37 

38 


(i) 

y ~- 

- x cos jc; 

a) 

V = 

X 




4 ■ 1 

= jc sin —; 


sin 

jc’ 


(k) 

y ~- 

(i) 

y = 


1 


JC 


sin 

JC + 

cos jc’ 

(m) 

y - 

= (1 + sin jc) 4 ; 

(n) 

y = 

sin- 

S JC- 

■ cos 3 jc; 

(o) 

y - 

= cos 2 jc — sin 2 jc; 

(P) 

y = 

tan 

(sin 

5jc); 

(q) 

y 

= sin 2 (tan 2 jc). 







Find the values of jc for which the graph of y = 2 sin jc + 
sin 2 jc has a horizontal tangent. 

Same as Problem 35 for y = sin 2jc — 2 sin jc. 
Assuming that the derivative of sin jc is known to be 
cos jc, find the derivative of cos jc by differentiating the 
identity sin 2 jc + cos 2 jc = 1 . 

Consider the function defined by 


/« = 


1 


x sin — 

JC 


0 


if jc ^ 0, 


if jc = 0. 


(a) The graph of this function is shown in Fig. 3.4. Make 
a careful verification of the correctness of the gen¬ 
eral features of this graph. What happens to y when 
jc is large? 

(b) Show that /(jc) is continuous at jc = 0. 

(c) Find/'(jc) for jc =£ 0. 

(d) Show that/'(0) does not exist. 



39 Consider the function defined by 


fix) 


x 2 sin — if jc ^ 0, 

JC 

0 if jc = 0. 


(a) The graph of this function is shown in Fig. 3.5. Ex¬ 
amine it carefully and make sure that it correctly re¬ 
flects the main characteristics of the function. What 
happens to y when jc is large? 

(b) Show that/(jc) is continuous at jc = 0. 

(c) Find/'(jc) for jc =£ 0. 

(d) Find/'(()). 

(e) Show that/'(jc) is not continuous at jc = 0. 



SECTION 3.5 

40 Find dy/dx by implicit differentiation: 

(a) jc 4 + 2jcy 3 + 2y 4 = 4; (b) ~ - 2jc = y; 

(c) y 2 = j 2 ~ 2 ; =x4 + /; 

(e) \ / xy + 2 y = Vjc. 

41 Find dy/dx by implicit differentiation and also by solv¬ 
ing for y and then differentiating, and verify that your 
two answers are equivalent: 


(a) y 3 = 3x 2 + 5x — 1; 

(b) y 5 = jc 2 ; 

(c) 4y 2 = 3xy + x 2 : 

(d) jc 3/2 + y 3/2 : 

Find the derivative in each case: 

(a) x 3 ' 2 - x 3 >\ 

(c) Vjc + Vx 5 ; 

(b) (x 2 + 2) 4/9 ; 

(e) + -E ; 

(f) lx 2 - 1; 


(h) V2 + VT- 

Find the equation of 
(a) the tangent to jc 3 + y 3 

= 2jcy + 5 at (2, 


Figure 3.4 
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(b) the tangent to y = — 


2jc 


at (3, 3); 


48 Find a general formula for y (/1) if 


V x - I 

(c) the normal tojc 3 + 3jcy 3 - jcy 2 = xy + 10 at (2, 1); 

(d) the normal to jc 2/3 + y 2/3 = 5 at (—8, 1). 

(a) y = x 
49 Show that 

- 2 Jc ’ 


(b) y = , . 

ci T ox 

44 Show that the sum of the jc- and y-intercepts of any line 
tangent to the curve Vx + Vy = Va is equal to a. 

d n 

1 

— n 1 

r c- d" i ] 

dx n 

41 -x) 

— ni 

x n + \ (l-^)«+l 


45 The curve jc 2/3 + y 213 = a 213 is called a hypocycloid of 
four cusps . Sketch it and show that the tangent at (jco, To) 
is jco _!/3 jc + Vo -1/3 T = 0 2/3 - Use this equation to show 
that the segment cut from the tangent by the axes has 
constant length a, so that a segment of length a with its 
ends sliding along the axes always touches the curve. 

46 Find the derivatives of the given functions: 


50 Consider the function /(jc) defined by 

^ f x 2 if jc > 0, 
y —x 2 if x < 0. 

Sketch the graph, show that/'(jc) = 2|jc|, and conclude 
that/"(0) does not exist. 


(a) cos Vjc; 

(c) VsinVr, 

(e) 1 + sin Vjc; 


(g) 


Vc 


(b) Vx sin Vx; 

(d) cos 3 Vjc 4 + 1; 
(f) tan 1/3 (1 + 3jc); 
(h) VtanCVsin jc). 


SECTION 3.6 
47 Calculate y" if 

(a) y = (1 + 3jc) 1/3 ; 


(c) y = jc 4/5 ; 

(e) y = Vx + — l —; 

Vx 


(b) y : 


vTTT 


(d) y = jc 3 Vx — 7jc; 
(f) y = (jc 2 + 4) 5/2 . 


51 For each of the following functions, find f"(x) and then 
calculate the limit 


lim 


f(x+2bx) ~ Zf(x±Ax)+f(x) 

( Aj :) 2 


and notice that they are equal: 

(a) f(x) = x\ (b) fix) = \lx. 

52 Solve Problem 51 after replacing the limit given there by 


lim 
Aa—> o 


fix + Ax) - 2 f(x) + fix - Ax) 
(Ajc) 2 


53 Find the given derivatives by calculating the first few de¬ 
rivatives and noticing a pattern: 

(a) the 20th derivative of jc sin jc; 

(b) the 62nd derivative of sin 3jc. 
























APPLICATIONS 
OF DERIVATIVES 


In this chapter we begin to justify the effort we have spent on learning how to 
calculate derivatives. 

Our first applications are based on the interpretation of the derivative as the 
slope of the tangent line to a curve. The purpose of this work is to enable us to 
use the derivative as a tool for quickly discovering the most important features 
of a function and sketching its graph. This art of curve sketching is essential in 
the physical sciences. It is also one of the most useful skills that calculus can 
provide for those who need to use mathematics in their study of economics or 
biology or psychology. 

A function f(x) is said to be increasing on a certain interval of the x-axis if on 
this interval x\ < X 2 implies/(xi) </(^j- In geometric language, this means that 
the graph is rising as the point that traces it moves from left to right. Similarly, 
the function is said to be decreasing (the graph is falling) if x\ < X 2 implies 
f(x i) >f(x 2 ). These concepts are illustrated in Fig. 4.1. 

In sketching the graph of a function, it is important to know the intervals on 
which it is increasing and those on which it is decreasing. The sign of the de¬ 
rivative gives us this information: 


4.1 

INCREASING AND 
DECREASING 
FUNCTIONS. MAXIMA 
AND MINIMA 


A function f(x) is increasing on any interval in which f\x) > 0, and it is decreasing 
on any interval in which f\x) < 0. 

This is geometrically evident if we keep in mind the fact that a straight line points 




s 

* 


*1 


x 


Figure 4.1 Increasing and decreasing 
functions. 
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upward to the right if its slope is positive and downward to the right if its slope 
is negative (Fig. 4.2). 

It is clear that a smooth curve can make the transition from rising to falling 
only by passing over a peak where the slope is zero. Similarly, it can change 
from falling to rising only by going through a trough where the slope is zero. At 
such points we have a maximum or minimum value of the function. We locate 
these values by finding the critical points of the function, which are the solutions 
of the equation f'(x) = 0 ; that is, we force the tangent to be horizontal by equat¬ 
ing the derivative to zero, and we then solve the equation f\x) = 0 to discover 
where this happens. In Fig. 4.2 the critical points are x\, X 2 , x 3 , and the corre¬ 
sponding critical values are the values of the function at these points, that is, 
f(x\),f{x 2 ),f(x 3 ). 

It is important to understand that a critical value is not necessarily either a 
maximum or a minimum. This is shown by fix 3 ) in Fig. 4.2; at the critical point 
.*3 the graph does not pass either over a peak or through a trough, but instead 
merely flattens out momentarily between two intervals on each of which the de¬ 
rivative is positive. 

It should also be pointed out that we are discussing the so-called relative (or 
local) maximum or minimum values. These are values that are maximal or min¬ 
imal compared only with nearby points on the curve. In Fig. 4.2, for instance, 
fix 1 ) is a maximum even though there are many higher points on the curve, off 
to the right. If we seek the absolute maximum of a function, we must compare 
its relative maxima with one another to determine which (if any) is larger than 
any other value assumed by the function. 

Example 1 To sketch the graph of the polynomial 

y =f(x) = 2x 3 - 3x 2 - 12* + 12, 

we begin by computing the derivative and factoring this derivative as completely 
as possible: 


f{x) = 6x 2 — 6x — 12 = 6 (* -I- 1 )(jc — 2 ). 







4.1 INCREASING AND DECREASING FUNCTIONS. MAXIMA AND MINIMA 



The critical points are evidently x = — 1 and x = 2, and by substituting -1 and 
2 in f(x), we see that the corresponding critical values are y — 19 and y = — 8 . 
We now examine the three intervals into which the critical points divide the 
jc-axis, for on each of these intervals f(x) has constant sign. When x < —{, 
x + 1 and x — 2 are both negative, so their product is positive and f'(x) > 0 . 
When —1 < x < 2, x + lis positive and jc — 2 is negative, so their product is 
negative and f(x) < 0. When x > 2, * + 1 and x - 2 are both positive, so their 
product is positive and f(x) > 0. These results are displayed in Fig. 4.3, where 
the slanted lines give a schematic suggestion of the direction of the graph in each 
interval. In Fig. 4.4 we now plot the points ( — 1, 19) and (2, — 8 ) and sketch a 
smooth curve through these points, using the information in Fig. 4.3 provided by 
the sign of the derivative; that is ,f(x) is increasing when jc < — 1 , decreasing when 
— 1 < x < 2, and increasing when x > 2. Notice that in Fig. 4.4 we use different 
units of length on the two axes, as a matter of convenience in drawing a picture 
of reasonable size.* It is clear that our function has a maximum at x = -1 and 
a minimum at x = 2 , and also that no absolute maximum or minimum exists. 

The zeros of a function are always valuable aids in curve sketching when they 
can be found, but finding them can be quite difficult. We have plotted a few 
additional points in Fig. 4.4 to suggest that the zeros of this particular function 
are approximately —2.2, 0.9, and 2.9. As a matter of fact, we sometimes sketch 
the graph of a function to help us estimate the approximate location of its zeros, 
just as we have done here, as a first step toward the numerical calculation of these 
zeros to any desired degree of accuracy. In Section 4.6 we describe a standard 
method for carrying out such calculations. 1 " 


Example 2 The rational function 


y = 


x 

x 2 + 1 


*The basic idea of a graph as a visual aid displaying the qualitative nature of the function does not 
require the use of equal units on the two axes. It is only when we work with certain quantitative as¬ 
pects of the geometry of the plane, such as distances between points, areas of regions, or angles be¬ 
tween lines, that it is necessary to use equal units on both axes. 

+ Those students who have a graphing calculator will enjoy plotting the graph of the polynomial in 
this example, with the vertical scale changed by a factor of 10 or more. 
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was discussed in Example 6 of Section 1.6, and we explained there why the graph 
has the shape it does (Fig. 4.5). To find the precise location of the indicated max¬ 
imum and minimum, we calculate the derivative and equate it to zero: 

, ( x 2 -F 1)*1 — jc-2jc 1 — jc 2 

y = (jc 2 + l ) 2 ~ (x 2 + l ) 2 = °’ 

The roots of this equation (the critical points) are jc = 1 and jc = — 1, so the max¬ 
imum and minimum occur at jc = 1 and jc = — 1 , respectively. The actual maxi¬ 
mum and minimum values are y = \ and y = With these facts and our ini¬ 
tial awareness of the overall shape of the graph, it is obvious that this function 
increases on the interval — 1 < x < 1 and decreases for jc < -1 and jc > 1 . 
However, these conclusions can also be drawn directly from the sign of the de¬ 
rivative, which is clearly positive for — 1 < jc < 1 and negative for jc < — 1 
and jc > 1 . 


Figure 4.6 Endpoints, cusp, and 
corner. 


These examples, as well as our past experience, suggest a few informal rules 
that are useful in sketching the graph of a function /(jc). If possible and conve¬ 
nient, we should determine 


1 The critical points of /(jc). 

2 The critical values of /(jc). 

3 The sign of /'(jc) between critical points. 

4 The zeros of/(jc). 

5 The behavior of /(jc) as jc —* oo and as jc —» — °o. 

6 The behavior of /(jc) near points at which the function is not defined. 

However, perhaps the most important rule of all is this: Don’t be a slave to any 
rule, be flexible, use common sense. Remember the old Hungarian proverb: “All 
fixed ideas are wrong, including this one.” 


Remark 1 Maxima and minima can occur in three ways not covered by the pre¬ 
ceding discussion: at endpoints, cusps, and corners. As examples we consider 
the three functions 

jc = V 1 — x 2 , v = jc 2/3 , y = 1 — V^c 2 = 1 — |jc|. 


Their graphs are shown in Fig. 4.6. The first function has the closed interval 
- 1 < jc < 1 as its domain, and at the endpoints it has minima that are not re¬ 
vealed by equating the derivative to zero. The second function has a minimum 
at jc = 0 that is a cusp, because its derivative 


- 2-1/3 - 
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is negative to the left of 0 and positive to the right of 0 , and becomes infinite 
near 0. The third function has a maximum at x = 0, and this maximum is called 
a corner for obvious reasons. In seeking the maxima and minima of functions, 
equate the derivative to zero by all means, but do so carefully, keeping these three 
possibilities in mind as well. 

Remark 2 Among other things, mathematicians are professional skeptics. On 
one side of their nature they have trained themselves to attack loose arguments 
and to accept only those statements that they find impossible to doubt, in the 
hope that Ultimate Certainty will reward their efforts. Our statements about in¬ 
creasing and decreasing functions and maxima and minima are supported only 
by geometric plausibility arguments. The statements themselves are true, and 
when examined more carefully they fall into a field of study called Analysis, 
where the foundations of calculus are fully investigated. However, this book is 
for students, not mathematicians, and our main concern is with the use of the 
tools rather than the tools themselves. We gave a few preliminary discussions in 
Section 2.6, and we remind any reader who might be interested that fully rigor¬ 
ous proofs can be found in Appendices A.3 and A.4 at the back of the book. 


PROBLEMS 


Sketch the graphs of the following functions by using the first 
derivative and the methods of this section; in particular, find 
the intervals on which each function is increasing and those 
on which it is decreasing, and locate any maximum or mini¬ 
mum values it may have. 


1 

y = jc 2 — 2 jc. 

2 

y = 2 + x - x 2 . 

3 

y = jc 2 - 6 jc + 9. 

4 

y = x 2 — 4x + 5. 

5 

y = 2 jc 3 - 3jc 2 + 1. 

6 

y = x 3 - 3x 2 + 3x - \ . 

7 

1 

H 

II 

8 

y = x 4 - 2x 2 + 1 . 

9 

y = 3jc 4 + 4jc 3 . 

10 

y = 3x 5 - 20x 3 . 

11 

y = x + — . 

JC 

12 

y = 2x + -y- 
x z 

13 

1 

14 

X 

y ~ x 2 T- jc - 

(X- l) 2 ' 

15 

y = jc V 3 - x. 

*16 

y = 5* 2/3 - x il3 - 

17 

The function /(x) = 

X 3 + X 

— 1, being a third-degree 


18 


19 


polynomial, obviously crosses the x-axis (why?) and 
therefore has at least one zero. By examining/'(x), show 
that this function has only one zero. Show similarly that 
/(x) = 7x 131 + 1 lx 73 + x — 500 has one and only one 
zero. 

Consider the function y = x m (l - x) n , where m and n are 
positive integers, and show that 

(a) if m is even, y has a minimum at x = 0; 

(b) if n is even, y has a minimum at x = 1; 

(c) y has a maximum at jc = m/{m + n) regardless of 
whether m and n are even or not. 

Sketch the graph of a function /(x) defined for x > 0 
and having the properties 


/(l) = 0 and /'(*) = J (all * > 0). 

20 Sketch the graph of a function /(x) with the properties 

f'(x) < 0 for x < 2 and /'(x) > 0 for x > 2 

(a) if /'(x) is continuous at x = 2; 

(b) if/'(x) —» — 1 as x —> 2 and /'(x) —» 1 as x—> 2 + . 

21 In each case, sketch the graph of a function with all the 
stated properties: 

(a) /(1) = 1, /'(x) > 0 for jc < 1, /'(x) < 0 for 
jc > 1; 

(b) /(-1) = 2 and f{2) = -1 , /'(x) > 0 for jc < -1 

and jc > 2,/'(jc) < 0 for — 1 < jc < 2; 

(c) /(-1) = 1 and/'(-l) = 0,/'(x) < 0 for jc < -1 

and — 1 < jc < 2,/'(jc) > 0 for jc > 2; 

(d) /'(jc) < 0 for — 2 < jc < Oandjc > l,/'(x) > 0 
for jc < —2 and 0 < jc < l,/'(— 2) = /'(0) = 0, 
/'(1) does not exist. 

22 Construct a formula for a function /(jc) with a maximum 
at jc = —2 and a minimum at jc = 1 . 

23 Find the critical points and corresponding critical values 
for the function y = cos 2jc + 2 cos jc on the interval 
[0, 27 t]. Sketch the graph. 

24 Find a > 0 so that the curves y = sin ax and y = cos ax 
intersect at right angles. 

25 Show that the largest possible value of y = sin jc - cos jc 

is V2. 

26 Find the largest possible value of each of the following 
functions: 
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(a) y = sin jc + cos jc; 

(b) y = sin jc + cos 2 jc; 

(c) y = sin 2 jc + cos 2 jc. 

27 Find the maximum and minimum values of y = 
2 sin 2jc + sin 4jc on the interval [0, 7r], and state where 
these values occur. Sketch the graph. 

28 Find the maximum and minimum values of each of the 
following functions on the interval [0 , 27t], and sketch 
the graphs: 


(a) y = sin(cosjc); (b) y = cos (sin jc). 

29 Show that y = 27/(sin jc) + 64/(cos jc) has a minimum 
value but no maximum value on the interval 0 < jc < 
7 t/ 2 , and find this minimum value. 

30 Find all critical points for each of the following func¬ 
tions, use the first derivative to decide whether each of 
the corresponding critical values is a maximum, a mini¬ 
mum, or neither, and sketch the graphs: 

(a) y = jc + sin jc; (b) y = sin 2 2jc. 


4.2 

CONCAVITY AND 
POINTS OF 
INFLECTION 


One of the most distinctive features of a graph is the direction in which it curves 
or bends. The graph on the left in Fig. 4.7 curves upward as the point that traces 
it moves from left to right, and the graph on the right curves downward. The sign 
of the second derivative gives us this information, as follows. 

A positive second derivative, f\x) > 0, tells us that the slope /'(jc) is an in¬ 
creasing function of x. This means that the tangent turns counterclockwise as we 
move along the curve from left to right, as shown on the left side of Fig. 4.8. 
The curve is said to be concave up (the concave side of a curve is its hollow 
side). Such a curve lies above its tangent except at the point of tangency. Simi¬ 
larly, if the second derivative is negative, f\x) < 0, then the slope f\x) is a de¬ 
creasing function, and the tangent turns clockwise as we move to the right (see 
the right side of Fig. 4.8). Under these circumstances the curve is concave down; 
it lies below its tangent except at the point of tangency. 

Most curves are concave up on some intervals and concave down on other in¬ 
tervals. A point like P in Fig. 4.8, across which the direction of concavity changes, 
is called a point of inflection.* If f\x) is continuous and has opposite signs on 
each side of P, then it must have a zero at P itself. The search for points of in¬ 
flection is mainly a matter of solving the equation f\x) = 0 and checking the di¬ 
rection of concavity on both sides of each root. 


Example 1 To investigate the function 

y =/(jc) = 2jc 3 - 1 2jc 2 + 1 8jc - 2 


*The word “inflection” comes from the Latin inflectere, meaning “to bend” 




Figure 4.7 


Figure 4.8 
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for concavity and points of inflection, we calculate 

f'(x) = 6x 2 — 24x + 18 = 6 (jc - l)(x — 3) 


and 


f"{x)= \2x- 24 = 12(jc — 2). 

The critical points [the roots of/'(x) = 0] are clearly x = 1 and x = 3, and the 
corresponding critical values are y = 6 and y = -2. We have a possible point of 
inflection atx = 2, since this is the only root of/"(x) = 0. It is evident that f"(x) 
is negative for jc < 2 and positive for jc > 2, so the graph is concave down on 
the left of x = 2 and concave up on the right. This tells us that we really have a 
point of inflection at jc = 2, as indicated in Fig. 4.9. 


Example 2 The rational function 


y = 


1 

jc 2 + 


1 



is very easy to sketch by inspection if we notice the following clues: it is sym¬ 
metric about the y-axis because the exponent is an even number, its values are 
all positive, it has a maximum at jc = 0 because this yields the smallest denom¬ 
inator, and y —» 0 as |jc| —> «>. It is therefore intuitively clear that the graph has 
the shape shown in Fig. 4.10. There are evidently two points of inflection, and 
the only question is, What are their precise locations? To discover this, we com¬ 
pute 

, — 2jc 

y ( x 2 +\) 2 


and 


(jc 2 + 1)2 • (-2) + 2x • 2(jc 2 + 1) • 2jc 
(jc 2 + l) 4 



_ (jc 2 + 1) • (-2) + 8jc 2 _ 2(3jc 2 - 1) 

(jc 2 + l) 3 “ (jc 2 + l) 3 ‘ 

Equating y" to zero and solving gives jc = ± 1 A/3, which locates the points of 
inflection. If we wish, we can verify our first impression about the direction of 
concavity on various parts of the curve, as shown in Fig. 4.10, by observing 
that y" < 0 when x 2 < y and y" > 0 when jc 2 > y. These facts tell us that 
the graph is concave down for —1/V3 < jc < 1/V3 and concave up for 
jc < — 1/V3andjc > 1/V3. 


Remark 1 As we have tried to suggest in these examples, knowing that/"(jc 0 ) = 
0 is not enough to guarantee that jc = jco furnishes a point of inflection. We must 
also know that the graph is concave up on one side of xo and concave down on 
the other. The simplest function that shows this difficulty is y = /(x) = x 4 (Fig. 
4.11). Here/'(x) = 4x 3 and f"{x) = 12x 2 , so f'(x) = 0 at x = 0. However,/"(x) 
is clearly positive on both sides of the point x = 0 , and therefore—as we already 
know from the graph—this point corresponds to a minimum, not a point of in¬ 
flection. The function y = x 5 — 5x 4 provides a more complicated example of the 
same phenomenon. Here 
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y' = 5x 4 — 20x 3 and y" = 20x 3 - 60x 2 = 20x 2 (x — 3). 

The roots of y" = 0 are x = 0 and x = 3. However, y" does not change sign at 
x = 0, so the only point of inflection is at x = 3. The graph is concave down on 
the left of this point and concave up on the right. 

Remark 2 The graph of y = x ,/3 = is easy to sketch and has an obvious 
point of inflection at x = 0 (Fig. 4.12). We can also discover this by inspecting 
the second derivative. We have 

y'=\x-™ 

and 




2—5/3 = 


~2 

9 V* 5 ’ 


so y" is positive if x < 0 and negative if x > 0. However, y" does not exist at 
x = 0. In searching for points of inflection, we must therefore consider not only 
points at which y" = 0 , but also points (if there are any) at which y" does not ex¬ 
ist. 


Remark 3 In the so-called second derivative test —which we state informally 
in Fig. 4.13 — the sign of the second derivative is used to decide whether a crit¬ 
ical point furnishes a maximum or a minimum value. This test is sometimes use¬ 
ful, but its importance is often exaggerated. We will see in the next two sections 
that in most applied problems it is clear from the context whether we have a max¬ 
imum or minimum value, and no further testing is necessary. 


Figure 4.13 The second derivative 
test. 




A maximum, if A minimum, if 

/'(x 0 ) = 0 and /"(* 0 ) < 0 f\x 0 ) = 0 and /"(x 0 ) > 0 


PROBLEMS 


For each of the following, locate the points of inflection, find 
the intervals on which the curve is concave up and those on 
which it is concave down, and sketch. 


1 y = (x — a) 3 + b. 
3 y = x 3 + 3x 2 + 4. 
5 y = x 4 + 2x 3 + 1. 
7 y = x 4 — 2x 3 . 


2 y = x 3 - 6x 2 . 

4 y = 2x 3 + 3x 2 - 12x. 
6 y = x 4 - 6x 2 
8 y = 3x 5 - 5x 4 . 


9 

11 


y = 


y = 


x 2 + 9* 
4x 2 

x 2 + 3‘ 


10 

12 


y = 


ax 

x 2 + b 2 


(, a , b > 0). 



11 

x 


13 y = x - —. 

x 

14 In each part of this problem, use the given formula for 
the second derivative of a function to locate the points 
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of inflection, the intervals on which the graph is concave 
up, and the intervals on which the graph is concave down: 
(a) y" = 8 jc 2 + 32jc; (b) /' = 15 jc 3 + 39jc; 

(c) y" = 3 jc 4 - 27jc 2 ; (d) y" = {x + 2)(jc 2 - 4). 

15 Sketch the graph of a function f(x) defined for all jc such 
that 

(a) f(x) > 0,/'(*) > 0, and f"{x) > 0; 

(b) f'{x) < 0 and f'\x) < 0 . 

16 Is it possible for a function /(jc) defined for all jc to have 
the three properties /(jc) > 0 , /'(jc) < 0, and /"(jc) < 
0? Explain. 

17 (a) By sketching, show thaty = jc 2 + a/x has a minimum 

but no maximum for every value of the constant a. 
Also, verify this by calculation. 

(b) Find the point of inflection of y = jc 2 - 8/jc. 

18 Starting from jc 2 + y 2 = a 2 , calculate d 2 yfdx 2 by implicit 
differentiation and state why its sign should be opposite 
to the sign of y. 

19 Find the value of a that makes y = jc 3 - ax 2 + 1 have a 
point of inflection at jc = 1 . 

20 Find a and b such that y = a\Tx + bl\Tx has (1, 4) as a 
point of inflection. 

21 If k is a positive number =£ 1, show that the first quad¬ 
rant part of the curve y = jc* is 

(a) concave up if k > 1 ; 

(b) concave down if k < 1 . 

22 If k is a positive number =£ 1 and y = x k - kx, show that 

(a) if k < 1 , y has a maximum at jc = 1 ; 

(b) if k > 1 , y has a minimum at jc = 1 . 

23 Show that the graph of a quadratic function y = ax 2 + 
bx + c has no points of inflection. Give a condition un¬ 
der which the graph is (a) concave up; (b) concave down. 


24 Show that the general cubic curve y = ax 3 + bx 2 + 
cjc + d has a single point of inflection and three possi¬ 
ble shapes depending on whether b 2 > 3ac, b 2 = 3 ac, 
or b 2 < 3 ac. Sketch these shapes. 

25 In each of the following, sketch the graph of a function 
with all the stated properties: 

(a) /(0) = 2,/(2) = 0,/'(0)=/'(2) = 0,/'(jc) > 0 for 
\x 11 > 1,/'(jc) < 0 for |jc — 1| < 1 ./"(jc) < 0 
for jc < 1,/"(jc) > 0 for jc > 1; 

(b) /( —2) = 6, /(l) = 2, /(3) = 4, /'(l) =/'(3) = 0, 
/"(jc) < 0 for |jc - 21 > 1 ,/'(*) > 0 for \x — 2| < 
1 1 /”(■*) < 0 for jc > 2 or |jc + 11 < 1 , /"(jc) > 0 
for |jc — 11 < 1 or jc < -2; 

(c) /(0) = 0, /(2) =/(-2) = 1, /'(0) = 0, /'(jc) > 0 
for jc > 0, /'(jc) < 0 for jc < 0, /"(jc) > 0 for 
|jc| < 2, /"(jc) < 0 for |jc| > 2, linWoo/(jc) = 2, 
lirn^-oo/M = 2; 

(d) /(2) = 4,/'(x) > Ofor* < 2,/'(jc) < Oforjc > 
2, fix) > 0 for x =£ 2, lim^ 2 |/'M| = °°> lim*-^ 
f(x) = 2, lhiWoo f(x) = 2. 

26 Show that the graph of y = sin jc is concave down when 
it is above the jc-axis and concave up when it is below 
the jc-axis. 

27 For each of the following functions defined on the in¬ 
terval 0 < jc < 277, find all points of inflection and de¬ 
termine the intervals on which the graph is concave up 
and those on which the graph is concave down, and 
sketch the graph: 

(a) y = jc — 2 sin jc; 

(b) y = sin 2 jc; 

(c) y = jc + sin jc. 


Among the most striking applications of calculus are those that depend on find¬ 
ing the maximum or minimum values of functions. 

Practical everyday life is filled with such problems, and it is natural that math¬ 
ematicians and others should find them interesting and important. A busi¬ 
nessperson seeks to maximize profits and minimize costs. An engineer design¬ 
ing a new automobile wishes to maximize its efficiency. An airline pilot tries to 
minimize flight times and fuel consumption. In science, we often find that na¬ 
ture acts in a way that maximizes or minimizes a certain quantity. For example, 
a ray of light traverses a system of lenses along a path that minimizes its total 
time of travel, and a flexible hanging chain assumes a shape that minimizes its 
potential energy due to gravity. 

Whenever we use such words as “largest,” “smallest,” “most,” “least,” “best,” 
and so on, it is a reasonable guess that some kind of maximum or minimum prob¬ 
lem is lurking nearby. If this problem can be expressed in terms of variables and 
functions—which is not always possible by any means—then the methods of 
calculus stand ready to help us understand it and solve it. 

Many of our examples and problems deal with geometric ideas, because max¬ 
imum and minimum values often appear with particular vividness in geometric 


4.3 

APPLIED MAXIMUM 
AND MINIMUM 
PROBLEMS 


APPLICATIONS OF DERIVATIVES 


124 


settings. In order to be ready for this work, students should make sure they know 
the formulas for areas and volumes given in Fig. 1.22 of Chapter 1. 

We begin with a fairly simple example about numbers. 


Example 1 Find two positive numbers whose sum is 16 and whose product is 
as large as possible. 


Solution If x and y are two variable positive numbers whose sum is 1 6, so that 

x + y = 16, (1) 

then we are asked to find the particular values of jc and y that maximize their 
product 

P = xy. (2) 

Our initial difficulty is that P depends on two variables, whereas our calculus 
of derivatives works only for functions of a single independent variable. Equa¬ 
tion (1) gets us over this difficulty. It enables us to express y in terms of jc, y = 
16 x, and thereby to express P as a function of jc alone, 

P = x( \6 — x) = \6x — x 2 . (3) 



In Fig. 4.14 we give a rough sketch of the graph of (3). Our only purpose here 
is to provide visual emphasis for the following obvious facts about this function: 
that P — 0 for jc = 0 and jc = 1 6 , that P > 0 for 0 < jc < 1 6 , and that there¬ 
fore the highest point (where P has its largest value) is characterized by the con¬ 
dition dP/dx = 0, since this condition means that the tangent is horizontal. To 
solve the problem we compute this derivative from (3), 

= 16 - 2x\ 
dx 


we equate this derivative to zero, 

16 — 2jc = 0; 

and we see that jc = 8 is the solution of this equation. This is the value of jc that 
maximizes P y and by (1) the corresponding value of y is also 8. It is quite clear 
from Fig. 4.14 that jc = 8 actually does maximize P\ but if we wish to verify this, 
we can do so by computing the second derivative. 


and by recalling that a negative second derivative implies that the curve is con¬ 
cave down and therefore we have a maximum—which we already knew from 
Fig. 4.14. The related problem of making the product P as small as possible 
within the stated restrictions has no solution, because the restriction that jc and y 
must be positive numbers means that jc must belong to the open interval 0 < 
jc < 16 , and this part of the graph has no lowest point. 


Example 2 A rectangular garden 450 ft 2 in area is to be fenced off against rab¬ 
bits. If one side of the garden is already protected by a barn wall, what dimen¬ 
sions will require the shortest length of fence? 
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Solution We begin by drawing a sketch and introducing notation that will make 
it convenient to deal with the area of the garden and the total length of the fence 
(Fig. 4.15). If L denotes the length of the fence, we are to minimize 


L = 2x + y (4) 

subject to the restriction that 

xy * * 450. (5) 


By using (5), L can be written as a function of jc alone, 


X 

^ Bam 

l 

450 ft 2 


1 


Figure 4.15 


L 


lx + 


450 


( 6 ) 


A quick sketch (Fig. 4.16) helps us to visualize this function and feel comfort¬ 
able with its properties, especially the fact that it has a minimum and no maxi¬ 
mum (we are only interested in positive values of x). Our next steps are to com¬ 
pute the derivative of ( 6 ), 

dL_ 450 

dx Jt 2 ’ 

and then to equate this derivative to zero and solve the resulting equation, 

450 

2 - = 0, x 2 — 225, x = 15. 

xr 



(We ignore the root x— —15 for the reason stated.) By (5), the corresponding 
value of y is y = 30, so the garden with the shortest fence is 15 by 30, or twice 
as long as it is wide. 


Example 3 Find the dimensions of the rectangle of greatest area that can be in¬ 
scribed in a semicircle of radius a. 


Solution If we take our semicircle to be the top half of the circle x 2 + y 2 = a 2 
(Fig. 4.17, left), then our notation is ready and waiting: We must maximize 

A = 2 xy (7) 


with the restriction that 

* 2 + y 2 = a 2 . ( 8 ) 

Since (8) yields y = Va 2 — x 2 = ( a 2 — x 2 ) 1/2 , (7) becomes 

4 = 2 x{a 2 - x 2 ) m . (9) 




Figure 4.17 
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It is clear that jc lies in the interval 0 < jc < a. On the right in Fig. 4.17 we 
imagine the extreme cases: When x is close to 0, the rectangle is tall and thin, 
and when x is close to a , it is short and wide—and in each case the area is small, 
so somewhere in between we have a maximum area. To locate this maximum, 
we compute dAJdx from (9), equate it to zero, and solve: 

2x • j(a 2 — x 2 )~ 1/2 • (~2x) + 2(a 2 — x 2 ) l/2 = 0, — * = S/a 2 — jc 2 , 

V^T 2 


x 2 = a 2 — x 2 , 2x 2 = a 2 , x = —= jV2a. 

V2 

Since y = Vo 2 — jc 2 , we see that the corresponding value of y is also \s/2a, so 
the dimensions of the largest inscribed rectangle are 2jc = \fla and y = \sfla, 
and this rectangle is twice as long as it is wide. 

There is a more efficient way of solving this problem if we don’t care about 
the actual dimensions of the largest rectangle but only about its shape. The first 
step is to notice that (8) determines y as an implicit function of jc, so implicit dif¬ 
ferentiation with respect to jc yields 



2x + 2y & = o 
dx 


dy _ _x_ 
dx y' 


( 10 ) 


Next, by differentiating (7) with respect to x and using the fact that dAJdx = 0 
at the maximum, we obtain 


2x -y- + 2y = 0 
dx 


dy , n 
x Tx +y = 0 - 


(ID 


When (10) is inserted i n (11), the result is 

jc j + y = 0, -x 2 + y 2 = 0, y 2 = x 2 , or y = x, 

where the last equation expresses the shape of the rectangle with the largest area. 
We can also describe this shape by saying that the ratio of the height of the rec¬ 
tangle to its base is 


y _ x _ I 
2jc 2jc 2’ 


Example 4 A wire of length L is to be cut into two pieces, one being bent to 
form a square and the other to form a circle. How should the wire be cut if the 
sum of the areas enclosed by the two pieces is to be (a) a maximum? (b) a min¬ 
imum? 

Solution If jc denotes the side of the square and r the radius of the circle, as 
shown on the left in Fig. 4.18, then the sum of the areas is 

A = jc 2 + 7rr 2 (12) 

where jc and r are related by 

4c + 2t rr = L. (13) 


We solve (13) for r in terms of jc, 
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A 



r = -^-(L- 4x), 

2 77 

and use this to express A in terms of jc alone, 

A=xi+ir-^(L-4xj> 

= x 2 + -L (L - Ax) 2 . (14) 

477 

Now we notice that when x = 0 all the wire is used for the circle, and when jc = 
\L all the wire is used for the square. To solve the problem, we must fully un¬ 
derstand the behavior of the function (14) on the interval 0 < jc < {L. Its values 
at x = 0 and x = \L are clearly L 2 /4tt and L 2 /16, and the first of these values is 
the larger because 16 > 477. This is indicated on the right in Fig. 4.18. Since 
(14) shows that the graph is a parabola that opens up (jc 2 has a positive coeffi¬ 
cient), all that is needed to verify the shape of the graph shown in the figure is 
to find the location of the low point. For this we calculate the derivative of (14), 

dA 1 

- = 2x + — -2(L-Ax)-(-A) 
ax 47 t 

= 2 x- — (L- 4x). 

77 

On setting this equal to zero and solving the resulting equation, we get 

jc —— (L - 4jc) = 0, 77 jc = L — 4jc, x = - f 

This number lies between {L and \ L , so the graph shown in the figure is correct 
and we complete the solution of the problem as follows. 

The highest point of the graph is at the left end, and therefore to maximize A 

we must choose jc = 0 and use all the wire for the circle. If we insist that the wire 

must actually be cut, then (a) has no answer; for no matter how little of the wire 
is used for the square, we can always increase the total area by using still less. 

For (b), the total area is minimized when x = L/(4 + 7r). Accordingly, the 
length of wire used for the square is 4x = 4L/(4 + 77 ) and the length used for the 
circle is 


Figure 4.18 












128 


APPLICATIONS OF DERIVATIVES 


L- 4x = 


L - 


4 L 

4+77 


ttL 

4+77* 


We also notice that the minimal area is attained when the diameter of the circle 
equals the side of the square, since 


2 r = — (L 
77 


4jc) = 


ttL 


4 + 77 4 + 77 * 


Example 5 At a price of $1.50, a door-to-door salesperson can sell 500 potato 
peelers that cost 70 cents each. For every cent that the salesperson lowers the 
price, the number sold can be increased by 25. What selling price will maximize 
the total profit? 

Solution If x denotes the number of cents the salesperson lowers the price, then 
the profit on each peeler is 80 — x cents and the number sold is 500 + 25x The 
total profit (in cents) is therefore 

P = (80 - x)(500 + 25x) = 40,000 + 1 500x - 25x * 1 2 . 

We maximize this function by setting the derivative equal to zero and solving the 
resulting equation, 

1500 - 50*, 1500 - 50* = 0 50* = 1500, * = 30. 

dx 

The most advantageous selling price is therefore $1.20. 


As these examples show, the mathematical techniques required in most max¬ 
imum-minimum problems are relatively simple. The hardest part of such a prob¬ 
lem is usually “setting it up” in a convenient form. This is the analytical, think¬ 
ing part of the problem, as opposed to the computational part. We emphasize this 
because it is clear that calculus is unlikely to be of much value as a tool in the 
sciences unless one learns how to understand what a problem is about and how 
to translate its words into appropriate mathematical language. This is what “word 
problems” or “story problems” are for—to help students learn these critically 
important skills. 

No set of rules for problem solving really works, because the essential ingre¬ 
dients are imagination and intelligence, which cannot be taught. However, the 
following general suggestions may be helpful. They don’t guarantee success, but 
without them progress is unlikely. 

STRATEGY FOR SOLVING MAXIMUM-MINIMUM PROBLEMS 

1 Understand the problem. Begin by reading the problem carefully, several 
times if necessary, until it is fully understood. It is a sad fact of life that stu¬ 
dents often seem driven to start working on a problem before they have any 
clear idea of what it is about. Take your time and make your efforts count. 

2 If geometry is involved—as it often is— make a fairly careful sketch of rea¬ 
sonable size. Show the general configuration. For instance, if a problem is 
about a general triangle, don’t mislead yourself by drawing one that looks 
like a right triangle or an isosceles triangle. Don’t be hasty or sloppy. You 
hope your sketch will be a source of fruitful ideas, so treat it with respect. 
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3 Label your figure carefully , making sure you understand which quantities are 
constant and which are allowed to vary. If convenient, use initial letters to 
suggest the quantities they represent, as A for area, Vfor volume, h for height. 
Be aware of geometric relations among the quantities in your figure, espe¬ 
cially those involving right triangles and similar triangles. Write these rela¬ 
tions down in the form of equations and be prepared to use them if needed. 

4 If Q is the quantity to be maximized or minimized, write it down in terms of 
the quantities in the figure, and try to use the relations in Step 3 to express 
Q as a function of a single variable. Draw a quick informal graph of this 
function on a suitable interval, perform little thought experiments in which 
you visualize the extreme cases, and use derivatives to discover details and 
thereby solve your problem.* 


*Serious mathematical problem solving is mental activity on the highest level. Even exceptionally 
able students, who are confident they have the necessary imagination and intelligence, may derive 
additional comfort from the possession of a method. Such a method was distilled by George Polya 
(1887-1985) out of his own vast experience as an eminent creative mathematician (250 research pa¬ 
pers and 10 books) and the foremost mathematics teacher of his generation. Polya’s method consists 
of four simple principles that will be recognized at once as only common sense: (I) understand the 
problem; (2) devise a plan; (3) carry out your plan; (4) look back on your work and learn. The strat¬ 
egy given above is nothing but Polya’s principles (I), (2), and (3) adapted to the special circum¬ 
stances of maximum-minimum problems. Polya developed his ideas in a series of great and delight¬ 
ful books that should be required reading for every undergraduate mathematics major: How To Solve 
It (Princeton Press, 1945, 2nd ed., 1957); Mathematics and Plausible Reasoning (2 vols., Princeton 
Press, 1954); and Mathematical Discovery (2 vols., Wiley, 1962). His discussion of the following 
problem in How To Solve It , under the heading Working Backwards, is only one of many unforget¬ 
table gems: How can you bring up from the river exactly 6 quarts of water when you have only two 
containers, a 4-quart pail and a 9-quart pail (Fig. 4.19), to measure with? Figure 4.19 


PROBLEMS 


1 Find the positive number that exceeds its square by the 
largest amount. Why would you expect this number to 
be in the open interval (0, 1)? 

2 Express the number 18 as the sum of two positive num¬ 
bers in such a way that the product of the first and the 
square of the second is as large as possible. 

3 Show that the rectangle with maximum area for a given 
perimeter is a square. 1 " 

4 Show that the rectangle with minimum perimeter for a 
given area is a square. 

5 Show that the square has the largest area among all rec¬ 
tangles inscribed in a given fixed circle x 2 + y 2 - a 2 . 

6 If we maximize the perimeter of the rectangle instead 
of the area in Problem 5, show that the solution is still 
a square. 

7 An east-west and a north-south road intersect at a point 
O. A diagonal road is to be constructed from a point A 
east of O to a point B north of O, passing through a 
town C which is a miles east and b miles north of O. 
Find the ratio of OA to OB if the triangular area OAB 


+ This was the earliest maximum-minimum problem solved by the 
methods of calculus (by Fermat, about 1629). 


is as small as possible. Show that this minimal area is 
attained when C bisects the segment AB. 

8 A certain poster requires 96 in 2 for the printed message 
and must have 3-in margins at the top and bottom and 
a 2-in margin on each side. Find the overall dimensions 
of the poster if the amount of paper used is a minimum. 

9 A university bookstore can get the book Courtship Rit¬ 
uals of the American College Student at a cost of $4 a 
copy from the publisher. The bookstore manager esti¬ 
mates that she can sell 180 copies at a price of $10 and 
that each 50-cent reduction in the price will increase the 
sales by 30 copies. What should be the price of the book 
to maximize the bookstore’s total profit? 

10 A new branch bank is to have a floor area of 3500 ft 2 . 
It is to be a rectangle with three solid brick walls and 
a decorative glass front. The glass costs 1.8 times as 
much as the brick wall per linear foot. What dimensions 
of the building will minimize the cost of materials for 
the walls and front? 

11 At noon a ship A is 50 mi north of a ship B and is steam¬ 
ing south at 16 mi/h. Ship B is headed west at 12 mi/h. 
At what time are they closest together, and what is the 
minimal distance between them? 

12 Express the number 8 as the sum of two nonnegative 
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numbers in such a way that the sum of the square of 
the first and the cube of the second is as small as pos¬ 
sible. Also solve the problem if this sum is to be as large 
as possible. 

13 Find two positive numbers whose product is 16 and 
whose sum is as small as possible. 

14 A triangle of base b and height h has acute base angles. 
A rectangle is inscribed in the triangle with one side on 
the base of the triangle. Show that the largest such rec¬ 
tangle has base b/2 and height hi 2, so that its area is 
one-half the area of the triangle. 

15 Find the area of the largest rectangle with lower base 
on the x-axis and upper vertices on the parabola y = 
27 - x 2 . 

16 An isosceles triangle has its vertex at the origin, its base 
parallel to and above the x-axis, and the vertices of its 
base on the parabola 9y = 27 — x 2 . Find the area of the 
largest such triangle. 

17 If a rectangle has an area of 32 in 2 , what are its di¬ 
mensions if the distance from one corner to the mid¬ 
point of a nonadjacent side is as small as possible? 

18 If the cost per hour of running a small riverboat is pro¬ 
portional to the cube of its speed through the water, find 
the speed at which it should be run against a current of 
a miles per hour to minimize the cost of an upstream 
journey over a distance of b miles. 

19 A Norman window has the shape of a rectangle sur¬ 
mounted by a semicircle. If the total perimeter is fixed, 
find the proportions of the window (i.e., the ratio of the 
height of the window to its base) that will admit the 
most light. 

20 Solve the Norman window problem in Problem 19 if 
the semicircular part is made of stained glass that trans¬ 
mits only half as much light per unit area as does the 
clear glass in the rectangular part. 

21 A trough is to be made from three planks, each 12 in 
wide. If the cross section has the shape of a trapezoid, 
how far apart should the tops of the sides be placed to 
give the trough maximum carrying capacity? 

22 Solve Problem 21 if there is one 12-in plank and two 
6-in planks. 

23 The strength of a rectangular beam is jointly propor¬ 
tional to its width and the cube of its depth. 1- Find the 
proportions (ratio of depth to width) of the strongest 
beam that can be cut from a given cylindrical log. 

24 Among all isosceles triangles with fixed perimeter, 
show that the triangle of greatest area is equilateral. 

25 An isosceles triangle is inscribed in the circle x 2 + y 2 = 
a 2 with its base parallel to the x-axis and one vertex at 
the point (0, a). Find the height of the triangle with max- 

f This means that if x is the width and y is the depth, then the strength 
S is given by the formula S = cxy 3 , where c is a constant of propor¬ 
tionality. 


imum area and show that this triangle is equilateral. 
(Can you show by geometric reasoning alone that the 
largest triangle inscribed in the circle is necessarily 
equilateral?) 

26 A wire of length L is to be cut into two pieces, one bent 
to form a square and the other to form an equilateral 
triangle. How should the wire be cut if the sum of the 
areas enclosed by the two pieces is to be (a) a maxi¬ 
mum? (b) a minimum? Show that case (b) occurs when 
the side of the square is f the height of the triangle. 

*27 A man 6 ft tall wants to construct a greenhouse of length 
L and width 18ft against the outer wall of his house by 
building a sloping glass roof of slant height y from the 
ground to the wall, as shown in Fig. 4.20. He consid- 



Figure 4.20 


ers space in the greenhouse to be usable if he can stand 
upright without bumping his head. If the cost of build¬ 
ing the roof is proportional to y, find the slope of the 
roof that minimizes the cost per square foot of usable 
space. Hint: Notice that this amounts to minimizing y/x. 

*28 (The fireman’s problem) A fence a feet high is b feet 
from a high burning building. Find the length of the 
shortest ladder that will reach from the ground across 
the top of the fence to the building. 

*29 A corridor of width a is at right angles to a second cor¬ 
ridor of width b. A long, thin, heavy rod is to be pushed 
along the floor from the first corridor into the second. 
What is the length of the longest rod that can get around 
the corner? 

*30 A long sheet of paper is a units wide. One comer of the 
paper is folded over as shown in Fig. 4.21. Find the 
value of x that minimizes (a) the area of the triangle 
ABC; (b) the length of the crease AC. 

31 The speed v of a wave on the surface of a still body of 
liquid depends as follows on its wavelength A: 


v = 



lira 
<5A ’ 
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Figure 4.21 


B 


34 Water has the very unusual property among common 
liquids of having a temperature above its freezing point 
at which its density is a maximum. (This property is 
important for the survival of life in ponds and lakes: the 
denser water sinks and prevents the bottom water from 
freezing.) If 1 liter of water at 0°C occupies a volume 
of 


V = 


6.42 8.51 , 6.79 

10 5 1 0 6 1 0 8 


liters at T°C, find the temperature at which the density 
is greatest. 


where the constants are the acceleration g due to grav¬ 
ity, the surface tension <x of the liquid, and the density 
8 of the liquid. Find the minimum speed of a wave and 
the corresponding wavelength. 

32 The illumination provided on a flat surface by a light 
source is inversely proportional to the square of the dis¬ 
tance from the source and directly proportional to the 
sine of the angle of incidence. How high should a light 
be placed on a pole to maximize the illumination on the 
ground 50 ft from the pole? 

33 Find the maximum area of a rectangle that can be cir¬ 
cumscribed about a given rectangle with base B and 
height H (Fig. 4.22). 



We continue to develop the basic ideas of Section 4.3 by means of additional ex¬ 
amples. 

Example 1 A manufacturer of cylindrical soup cans receives a very large order 
for cans of a specified volume Vq. What dimensions will minimize the total sur¬ 
face area of such a can, and therefore the amount of metal needed to manufac¬ 
ture it? 


4.4 

MORE MAXIMUM- 
MINIMUM PROBLEMS. 
REFLECTION AND 
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Solution If r and h are the radius of the base and the height of a cylindrical can 
(Fig. 4.23, left), then the volume is 

V 0 = irr 2 h (1) 

and the total surface area is 

A = 2irr 2 + 2irrh. (2) 

We must minimize A, which is a function of two variables, by using the fact that 
equation (1) relates these variables to one another. We therefore solve (1) for h, 
h — VqItti 2 , and use this to express A as a function of r alone, 
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A = 2irr 2 + 2irr • —y 
7 rr z 


= 2tjt 2 + 



(3) 


The graph of this function (Fig. 4.23, right) shows that A is large when r is small 
and also when r is large, with a minimum somewhere in between. As usual, to 
discover the precise location of this minimum, we differentiate (3), equate the 
derivative to zero, and solve, 

dA 2 Vo . 2Vq 3 

—— = 4 nr -y, 47rr-y = 0, 47rr 3 = 2 Vo, 

dr r A r z 

277T 3 = V 0 . (4) 


If we want the actual dimensions of the most efficient can, we can solve equa¬ 
tion (4) for r and then use this to calculate h, 


1^ h = 


_Vo_ = Vo / 2ttA 2/3 = 


Vo) 


= 2 . 


Vfl 

2n' 


from which we observe that h = 2 r. Or, if we are interested primarily in the 
shape, we can replace V 0 in (4) by ij^h and immediately obtain 

27tt 3 = nr 2 h or 2 r = h. 


From the point of view of lowering costs for raw materials—an extremely seri¬ 
ous matter for manufacturers — this remarkable result tells us that the “best” shape 
for a cylindrical can is that in which the height equals the diameter of the base. 


Example 2 Find the ratio of the height to the diameter of the base for the cylin¬ 
der of maximum volume that can be inscribed in a sphere of radius R . 

Solution If we sketch a cylinder inscribed in the sphere and label it as shown 
on the left in Fig. 4.24, then we see that 

V = 2irx 2 y (5) 

where 


x 2 + y 2 = R 2 . 


(6) 
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Figure 4.24 


Visualizing the extreme cases (Fig. 4.24, right) tells us that V is small when x is 
near zero and also when x is near R, so between these extremes there is a shape 
of maximum volume. To find it, we use (6) to write (5) as 

V = 2iry(R 2 - y 2 ) - 2ir(R 2 y - y 3 ), 


from which we obtain 

= 27 t(R 2 - 3 y 2 ). 
dy 

Setting this equal to zero to find y and then using (6) to find x gives 

y = —^=7 and x = Jr 2 — \R 2 = ~~ R. 

V3 ' V3 


The ratio of the height to the diameter of the base for the largest cylinder is there¬ 
fore 


2y = y 
2 x x 


-L = iV2 = 0.707. 

V2 2 


This result can be obtained more efficiently by the method of implicit differ¬ 
entiation. If x is taken as the independent variable and y is thought of as a func¬ 
tion of x, then (6) yields 


2 x + 2 y = 0 
dx 


or 


From (5) we find that 


dy _ x_ 
dx y’ 


dV 

dx 


= 2 tt j x 2 + 2xy 1 = 27 r 


= 27 T 


dx 


2 


+ 2 xy 


— x 3 + 2xy 2 \ _ 277x 


■ (2y 2 - x 2 ). 


It therefore follows that dV/dx = 0 when 


2y 2 = x 2 or — = —73 = -7- V2, 
x V 2 2 ’ 


as before. 


Example 3 If a ray of light travels from a point A to a point P on a flat mirror 
and is then reflected to a point B, as shown in Fig. 4.25, then the most careful 
measurements show that the incident ray and the reflected ray make equal an- 
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Figure 4.25 Reflection of light. 


gles with the mirror: a = f3. Assume that the ray of light takes the shortest path 
from A to B by way of the mirror, and prove this law of reflection by showing 
that the path APB is shortest when a = f3. 

Solution If we think of the point P as assuming various positions on the mir¬ 
ror, with each position determined by a value of x , then we wish to consider the 
length L of the path as a function of x. From Fig. 4.25 this function is clearly 

L = Va 2 + x 2 + V & 2 + (c — x ) 1 

= ( a 2 + x 2 ) 1/2 + [ b 2 + (c - x) 2 ] 1/2 , 


and differentiation yields 


= j(a 2 + x 2 ) 1/2 ■ (2x) + \[b 2 + (c - x) 2 ] 1/2 • 2(c — x) • (-1) 


Jt_ c — x 

V a 2 + x 2 V/? 2 + (c — x) 2 

If we minimize L by equating this derivative to zero, we get 


(7) 


V a 2 + x 2 S/b 2 + (c - x) 2 


and this equation can be changed in form as follows: 



jc c — x' 


The equation last written can easily be solved for jc. However, there is no need 
to do this, because the equation as it stands tells us what we want to know: For 
the angles a and (3 in the two right triangles shown in the figure, the ratios of 
the opposite side to the adjacent side are equal, so a and (3 are equal. 

It is fairly clear on intuitive grounds that we have minimized L. If we wish to 
verify this by the second derivative test, we can use (7) to compute 

d 2 L _ a 2 _ b 2 _ 

dx 2 (a 2 + x 2 ) m [b 2 + (c - x) 2 ] 3/2 

(we omit the details of the computation), and all that remains is to notice that 
this quantity is positive. 



Figure 4.26 


Remark 1 The reasoning in Example 3 can be made simpler if we recall from 
Section 1.7 the definition of the cosine of a positive acute angle A. If we think 
of A as one of the acute angles of a right triangle (Fig. 4.26), then by definition 

b adjacent side 

cos A = — = —--. 

c hypotenuse 

Using this, the minimizing condition (8) can be written as 

cos a = cos /3, 

so a = (3. For use in the next example, we also recall the definition of the sine 
of A, 
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a opposite side 

sin A = — = -r 11 -. 

c hypotenuse 

Remark 2 The law of reflection discussed in Example 3 was known to the an¬ 
cient Greeks. However, the fact that a reflected ray of light follows the shortest 
path was discovered much later, by Heron of Alexandria in the first century A.D. 
Heron’s geometric proof is simple but ingenious. The argument goes as follows. 
Let A and B be the same points as before (Fig. 4.27), and let B' be the mirror 
image of B, so that the surface of the mirror is the perpendicular bisector of BB 
The segment AB' intersects the mirror at a point P y and this is the point where a 
ray of light is reflected in passing from A to B; for a = y and y = /3, so a = (3. 
The total length of the path is AP + PB = AP 4- PB ' = AB For any other point 
P' on the mirror the total length of the path is AP' 4- P'B = AP' + P'B\ and 
this is greater than AB' because the sum of two sides of a triangle is greater than 
the third side. This shows that the actual path of our reflected ray of light is the 
shortest possible path from A to B by way of the mirror. 



Example 4 The reflected ray of light previously discussed travels in a single 
medium at a constant speed. However, in different media (air, water, glass) light 
travels at different speeds. If a ray of light passes from air into water as shown 
in Fig. 4.28, it is refracted (bent) toward the perpendicular at the interface. The 
path APB is clearly no longer the shortest path from A to B. What law determines 
it? In 1621 the Dutch scientist Snell discovered empirically that the actual path 
of the ray is that for which 

sin a 

——— = a constant, (9) 

sin p 

where this constant is independent of the positions of A and B. This fact is now 
called Snell's law of refraction* Prove Snell’s law by assuming that the ray takes 
the path from A to B that minimizes the total time of travel. 

Solution If the speed of light in air is v a and in water is v w , then the total time 
of travel T is the time in air plus the time in water, 

\/a 2 + x 2 Vb 2 + (c - x ) 2 
T =-1---— 

V a V w 



Figure 4.28 Refraction of light. 


= — (a 2 + x 2 Y 12 + — [b 2 + ( c - x ) 2 ] m . 


If we compute the derivative of this function and notice its meaning in terms of 
Fig. 4.28, we obtain 


dT = J_ x _ J_ c x 
dx v a Va 2 + x 2 VV + (c - *) 2 

_ sin a sin /3 

V a V w 


( 10 ) 


If we now minimize T by equating this to zero, the result is 


*Willebrord Snell (1591-1626) was an astronomer and mathematician. At the age of 22 he succeeded 
his father as professor of mathematics at Leiden. His fame rests mainly on his discovery of the law 
of refraction (9). 
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sin a sin B sin a v cl 

-=- or —r =—. (11) 

v a v w sin p v w 

This is a more revealing form of Snell’s law, because it tells us the physical mean¬ 
ing of the constant on the right side of (9): It is the ratio of the speed of light in 
air to the (smaller) speed of light in water. This constant is called the index of 
refraction of water. If the water in this experiment is replaced by any other translu¬ 
cent medium, such as alcohol, glycerin, or glass, then this constant has a differ¬ 
ent numerical value, which is the index of refraction of the medium in question. 

As in Example 3, we can verify that the configuration (11) actually minimizes 
T by computing the second derivative and noting that this quantity is positive: 

£L = A- _ — _+ J__ £ _ >0 . 

dx 2 v a ( a 2 + x 2 ) 312 v w [b 2 + (c — x ) 2 ] 3/2 

But there is another method that is worth mentioning. We begin by observing 
that dT/dx as given by (1 0) is a difference of two terms. As x increases from 0 
to c, the first term, (sin a)lv a , increases from 0 to some positive value. The sec¬ 
ond term, (sin fi)/v w , decreases from some positive value to 0. This shows that 
dT/dx is negative at x = 0 and increases to a positive value at x = c. The mini¬ 
mum value of T therefore occurs at the only x for which dT/dx = 0, and this is 
precisely the configuration described by (11). 


Remark 3 The ideas of Example 4 were discovered in 1657 by Fermat (Section 
1.4), and for this reason the statement that a ray of light traverses an optical sys¬ 
tem along the path that minimizes its total time of travel is called Fermat's prin¬ 
ciple of least time. (It should be noticed that when a ray of light travels in a sin¬ 
gle uniform medium, “shortest path’’ is equivalent to “least time,’’ so Example 3 
falls under the same principle.) During the next two centuries Fermat’s ideas stim¬ 
ulated a broad development of the general theory of maxima and minima, lead¬ 
ing first to Euler’s creation of the calculus of variations and then to Hamilton’s 
principle of least action, which has turned out to be one of the deepest unifying 
principles of physical science. Euler expressed his enthusiasm in the following 
memorable words: “Since the fabric of the world is the most perfect and was es¬ 
tablished by the wisest Creator, nothing happens in this world in which some rea¬ 
son of maximum or minimum would not come to light.’’* 

Remark 4 Snell’s sine law (9) was first published by Descartes in 1637 (with¬ 
out any mention of Snell), and he purported to prove it in an incorrect form equiv¬ 
alent to 

sin a _ Vw_ 

sin (3 v a * 

Descartes based his argument on a fanciful model and on the metaphysically based 
opinion that light travels faster in a denser medium. Fermat rejected both the opin¬ 
ion (“shocking to common sense’’) and the argument (“demonstrations which do not 
force belief cannot bear this name’’). After many years of passive but exasper¬ 
ated skepticism he at last actively confronted the problem and proved the correct 
law himself in 1657, creating the necessary calculus techniques as he went along. 


Tor a brief account of the great Swiss mathematician Euler (pronounced “OIL-er”) see Section 8.4. 
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PROBLEMS 

1 A closed rectangular box with a square base is to be 
made out of plywood. If the volume is given, find the 
shape (ratio of height to side of base) that minimizes 
the total number of square feet of plywood that are 
needed. 

2 Solve Problem 1 if the box is open on top. 

3 Find the radius of the cylinder of maximum volume that 
can be inscribed in a cone of height H and radius of 
base R. 

4 Find the height of the cone of maximum volume that 
can be inscribed in a sphere of radius R . 

5 A square piece of tin 24 in on each side is to be made 
into an open-top box by cutting a small square from 
each comer and bending up the flaps to form the sides. 
How large a square should be cut from each comer to 
make the volume of the box as large as possible? 

6 Solve Problem 5 if the given piece of tin is a rectangle 
15 by 24 in. 

7 A cylindrical can without a top is to be made from a 
specified weight of sheet metal. Find the ratio of the 
height to the diameter of the base when the volume of 
the can is greatest. 

8 A cylindrical tank without a top is to have a specified 
volume. If the cost of the material used for the bottom 
is three times the cost of that used for the curved lat¬ 
eral part, find the ratio of the height to the diameter of 
the base for which the total cost is least. 

9 Draw a reasonably good sketch of y = Vx and mark 
the point on this graph that seems to be closest to the 
point (j, 0). Then calculate the coordinates of this clos¬ 
est point. Hint: Minimize the square of the distance from 
the point (y, 0) to the point (x, Vjc). 

19 Generalize Problem 9 by finding the point on the graph 
of y = Vjc that is closest to the point (a, 0) for any 
a > 0. 

11 A spy climbs out of a submarine into a rubber boat 2 
mi east of a point P on a straight north-south shoreline. 
He wants to get to a house on the shore 6 mi north of 
P. He can row 3 mi/h and walk 5 mi/h, and he intends 
to row directly to a point somewhere north of P and 
then walk the rest of the way. 

(a) How far north of P should he land in order to get 
to the house in the shortest possible time? 

(b) How long does the trip take? 

(c) How much longer will it take if he rows directly to 
P and then walks to the house? 

12 Show that the answer to part (a) of Problem 11 does 
not change if the house is 8 mi north of P. 

13 If the rubber boat in Problem 11 has a small outboard 
motor and can go 5 mi/h, then it is obvious by common 
sense that the fastest route is entirely by boat. What is 


the slowest speed for which the fastest route is still en¬ 
tirely by boat? 

*14 The intensity of illumination at a point P due to a light 
source is directly proportional to the strength of the 
source and inversely proportional to the square of the 
distance from P to the source. Two light sources of 
strengths a and b are a distance L apart. What point 
on the line segment joining these sources receives the 
least total illumination? If a is 8 times as large as 
b y where is this point? (Assume that the intensity 
at any point is the sum of the intensities from the two 
sources.) 

*15 Two towns, A and B, lie on the same side of a straight 
highway. Their distance apart is c, and their distances 
from the highway are a and b. Show that the length of 
the shortest road that goes from A to the highway and 
then on to B is Vc 2 + 4 ab 

(a) by using calculus; 

(b) without calculus. Hint: Introduce the “mirror im¬ 
age” of B on the other side of the highway. 

16 Find the minimum vertical distance between the curves 
y = 16x 2 and y = — \/x 2 . 

17 An isosceles triangle is circumscribed about a circle of 
radius R. If jc is the height of the triangle, show that its 
area A is least when jc = 3 R. Hint: Minimize A 2 . 

18 If the figure in Problem 17 is revolved about the alti¬ 
tude of the triangle, the result is a cone circumscribed 
about a sphere of radius R. Show that the volume of the 
cone is least when jc = 4 R, and that this least volume is 
twice the volume of the sphere. 

19 A silo has cylindrical walls, a flat circular floor, and a 
hemispherical top. For a given volume, find the ratio of 
the total height to the diameter of the base that mini¬ 
mizes the total surface area. 

20 In Problem 19, if the cost of construction per square 
foot is twice as great for the hemispherical top as for 
the walls and the floor, find the ratio of the total height 
to the diameter of the base that minimizes the total cost 
of construction. 

21 What is the smallest value of the constant a for which 
the inequality ax + \/x > 2N/2 is valid for all positive 
numbers jc? 

*22 There is a refinery at a point A on a straight highway 
and an oil well at a point B which can be reached by 
traveling 5 mi along the highway to a point C and then 
12 mi across country perpendicular to the highway. If 
a pipeline is built from A to B, it costs k times as much 
per mile to build it across country as along the high¬ 
way, because of the difficult terrain. The line will be 
built either directly from A to B or along the highway 
to a point P part of the way toward C and then across 
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country to B, whichever is cheaper. Decide on the 
cheapest route (a) if k = 3; (b) if k = 2. (c) What is the 
largest value of k for which it is cheapest to build the 
pipeline directly from A to B1 

23 A circular ring of radius a is uniformly charged with 
electricity, the total charge being Q. The force exerted 
by this charge on a unit charge located at a distance jc 
from the center of the ring, in a direction perpendicu¬ 
lar to the plane of the ring, is given by F = Qx(x 2 + 
a 2 )~ yi . Sketch the graph of this function and find the 
value of x that maximizes F 

24 A cylindrical hole of radius x is bored through a sphere 
of radius R in such a way that the axis of the hole passes 
through the center of the sphere. Find the value of x that 
maximizes the complete surface area of the remaining 
solid. Hint: The area of a segment of height /i on a 
sphere of radius R is lirRh. 

*25 The sum of the surface areas of a cube and a sphere is 
given. What is the ratio of the edge of the cube to the 
diameter of the sphere when (a) the sum of their vol¬ 
umes is a maximum? (b) the sum of their volumes is a 
minimum? 

*26 Consider two spheres of radii 1 and 2 whose centers are 
6 units apart. At what point on the line joining their cen¬ 
ters will an observer be able to see the most total sur¬ 
face area? (See the hint for Problem 24.) 

*27 Find the point on the parabola y = x 2 that is closest to 
the point (6, 3). 

28 A man at point A on the shore of a circular lake of ra¬ 
dius 1 mi wants to reach the opposite point C as soon 
as possible (Fig. 4.29). He can walk 6 mi/h and row his 
boat 3 mi/h. At what angle 6 to the diameter AC should 
he row? 



29 Find the maximum possible area A of a trapezoid in¬ 
scribed in a semicircle of radius 1 (Fig. 4.30) 

(a) by expressing A as a function of 6 ; 

(b) by expressing A as a function of <£. 



Figure 4.30 


30 A laboratory scientist performs an experiment n times 
to measure a physical quantity jc and obtains the n 
results x\, X2, . . . , x n . These measurements deviate 
from the “true value” of jc because of unavoidable 
environmental factors of many kinds, such as fluctu¬ 
ations in temperature or air pressure. She decides to 
use an estimate x for jc based on the method of least 
squares. This means that x is chosen to minimize the 
quantity 

S = (x - X \) 2 + (x- X 2 ) 2 +•••+(*- Xn) 1 , 

which is the sum of the squares of the deviations of the 
estimate x from the measured values. Show that this es¬ 
timate 3c is the average of the measured values: 

X] + X 2 + ’ • ’ + X n 
X = -. 

n 

31 A spider at a comer S of the ceiling of a cubic room 
8 ft on each side wishes to catch a bug at the oppo¬ 
site comer B of the floor (Fig. 4.31). The spider, 
who must walk on the ceiling, the walls, or the 
floor, wishes to find the shortest path to the bug. Find 
a shortest path 

(a) by using calculus; 

(b) without calculus, by merely thinking. 



Figure 4.31 
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If a tank is being filled with water, then the water level rises. To describe how 
fast the water level rises, we speak of the rate of change of the water level or, 
equivalently, the rate of change of the depth. If the depth is denoted by h , and t 
is the time measured from some convenient moment, then the derivative dh/dt is 
the rate of change of the depth. Further, the volume V of water in the tank is also 
changing, and dV/dt is the rate of change of this volume. 

Similarly, any geometric or physical quantity Q that changes with time is a 
function of time , say Q = Q(t), and its derivative dQ/dt is the rate of change of 
the quantity. The problems that we now consider are based on the fact that if two 
changing quantities are related to one another, then their rates of change are also 
related. 


4.5 

RELATED RATES 


Example 1 Gas is being pumped into a large spherical rubber balloon at the con¬ 
stant rate of 8 ft 3 /min. Find how fast the radius r of the balloon is increasing (a) 
when r = 2 ft; (b) when r = 4 ft. 


Solution The volume of the balloon (Fig. 4.32) is given by the formula for the 
volume of a sphere, 

V=V 3 . (1) 

From the statement of the problem we know that dV/dt = 8, and we must find 
dr/dt for two specific values of r. It is essential to understand the background 
of this situation, namely, the fact that V and r are both dependent variables 
with the time t as the underlying independent variable. With this in mind, it is 
natural to introduce the rates of change of V and r by differentiating (1) with 
respect to t , 



dV_ 

dt 


4 

3 


0 j dr j dr 

77 • 3r 2 — = 477r 2 — 
dt dt 


( 2 ) 


where the chain rule is needed in the calculation. It follows from (2) that 


dr = 1 dV = 2 

dt 4777 2 dt 777 2 ’ 


since dV/dt = 8. In case (a) we therefore have 


dr 

dt 


~~ =0.16 ft/min, 
277 


and in case (b), 

T = 0.04 ft/min. 

dt 87t 

These conclusions confirm our commonsense awareness that since the volume 
of the balloon is increasing at a constant rate, the radius increases more and more 
slowly as the volume grows larger. 


Example 2 A ladder 13 ft long is leaning against a wall. The bottom of the lad¬ 
der is being pulled away from the wall at the constant rate of 6 ft/min. How fast 
is the top of the ladder moving down the wall when the bottom of the ladder is 
5 ft from the wall? 
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Figure 4.34 


Solution The first thing we do is draw a diagram of the situation and label it, 
being careful to use letters to represent quantities that are changing (Fig. 4.33). 
In terms of this figure, we can clarify our thoughts by stating what is known and 
what we are trying to find: 

= 6, --y- = ? when jc = 5. 

dt dt 

(The use of the minus sign here can best be understood by thinking of dy/dt as 
the rate at which y is increasing and —dy/dt as the rate at which y is decreasing. 
The problem asks for the latter.) Roughly speaking, we know one time deriva¬ 
tive and we want to find the other. We therefore seek an equation connecting x 
and y which we can differentiate with respect to t to obtain a second equation 
connecting their rates of change. It is clear from the figure that our starting point 
must be the Pythagorean theorem, 

jc 2 + y 2 = 169. (3) 


When this is differentiated with respect to t, we get 


2*4 + 2>4 = 0 


dt 


dt 


dy _ x_ dx_ 
dt y dt 


or 


and therefore 


dy _ x dx 
dt y dt' 


dy _ 6x 
dt y 


(4) 


since dx/dt = 6. Finally, equation (3) tells us that y = 12 when jc = 5, so (4) 
yields our conclusion, 

dy 6-5 _ . . 

— — = — = 2y ft/min when x = 5. 


Warning: Don’t substitute the values x — 5, y — 12 prematurely. The essence of 
the problem is the fact that x and y are variables; if we pin them down to spe¬ 
cific values too soon, as is done in Fig. 4.34, then this makes it impossible to un¬ 
derstand or solve the problem. In other words, preserve the fluidity of the situa¬ 
tion until the last possible moment. 


4 ft 3 /min 



Example 3 A conical tank with its vertex down is 12 ft high and 12 ft in di¬ 
ameter at the top. Water is being pumped in at the rate of 4 ft 3 /min. Find the rate 
at which the water level is rising (a) when the water is 2 ft deep; (b) when the 
water is 8 ft deep. 

Solution As before, we begin by drawing and labeling a diagram (Fig. 4.35), 
with the purpose of visualizing the situation and establishing notation. Our next 
step is to use this notation to state as follows what is given and what we are try¬ 
ing to find: 

dV dx n , _ 

= 4, —— = ? when x — 2 and jc = 8. 

dt dt 

The changing volume V of water in the tank has the shape of a cone, so our start¬ 
ing point is the formula for the volume of a cone, 
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V = j77 y 2 x. 


(5) 


The only dependent variables we care about are V and jc, so we wish to elimi¬ 
nate the superfluous variable y. By examining Fig. 4.35 and using similar trian¬ 
gles, we see that 


2 = A 

x 12 


2 


or y = jx. 


( 6 ) 


and substituting this in (5) gives 


V = 



(7) 


We are now in a position to introduce the rates of change by differentiating (7) 
with respect to t, which yields 


dV 7T 7 dx 
dt A X dt 


( 8 ) 


dx _ 4 dV = 16 

dt 7TX 2 dt 7TX 2 ’ 


since dV/dt = 4. This formula tells us that when x = 2, 

= — = 1.27 ft/min, 

Jr 7r 


and when x = 8, 

= 0.08 ft/min, 

Jr 477 


and the solution is complete. 


It may be helpful to students if we now summarize the method that emerges 

from these examples: 

STRATEGY FOR SOLVING RELATED RATE PROBLEMS 

1 Read the problem carefully, several times if necessary, until it is fully un¬ 
derstood. 

2 Draw a careful sketch of the situation being considered. Add to the sketch all 
numerical quantities that remain constant throughout the problem. Now add 
as letters all quantities—the dependent variables—that are functions of time. 

3 Write down the given rate of change and the required rate of change in terms 
of derivatives. 

4 Find an equation that connects the two dependent variables in Step 3, using 
geometry if necessary to eliminate any superfluous dependent variable. Use 
the chain rule to differentiate both sides of this equation with respect to r. 

5 Substitute the given rate of change from Step 3 into the differentiated equa¬ 
tion obtained in Step 4, and solve for the required rate of change. 

Warning: Don’t give the dependent variables numerical values too soon. This 

should be done only after the differentiation in Step 4. 
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PROBLEMS 


1 A stone dropped into a pond sends out a series of con¬ 
centric ripples. If the radius r of the outer ripple in¬ 
creases steadily at the rate of 6 ft/s, find the rate at which 
the area of disturbed water is increasing (a) when 
r = 10 ft, and (b) when r — 20 ft. 

2 A large spherical snowball is melting at the rate of 27r 
ft 3 /h. At the moment when it is 30 inches in diameter, 
determine (a) how fast the radius is changing, and (b) 
how fast the surface area is changing. 

3 Sand is being poured onto a conical pile at the constant 
rate of 50 ft 3 /min. Frictional forces in the sand are such 
that the height of the pile is always equal to the radius 
of its base. How fast is the height of the pile increas¬ 
ing when the sand is 5 ft deep? 

4 A girl 5 ft tall is running at the rate of 12 ft/s and passes 
under a street light 20 ft above the ground. Find how 
rapidly the tip of her shadow is moving when she is (a) 
20 ft past the street light, and (b) 50 ft past the street 
light. 

5 In Problem 4, find how rapidly the length of the girl’s 
shadow is increasing at each of the stated moments. 

6 A light is at the top of a pole 80 ft high. A ball is dropped 
from the same height from a point 20 ft away from the 
light. Find how fast the shadow of the ball is moving 
along the ground (a) 1 second later; (b) 2 seconds later. 
(Assume that the ball falls 5 = 16 1 2 feet in t seconds.) 

7 A woman raises a bucket of cement to a platform 40 ft 
above her head by means of a rope 80 ft long that passes 
over a pulley on the platform. If she holds her end of 
the rope firmly at head level and walks away at 5 ft/s, 
how fast is the bucket rising when she is 30 ft away 
from the spot directly below the pulley? 

8 A boy is flying a kite at a height of 80 ft, and the wind 
is blowing the kite horizontally away from the boy at 
the rate of 20 ft/s. How fast is the boy paying out string 
when the kite is 100 ft away from him? 

9 A boat is being pulled in to a dock by means of a rope 
with one end tied to the bow of the boat and the other 
end passing through a ring attached to the dock at a 
point 5 ft higher than the bow of the boat. If the rope 
is being pulled in at the rate of 4 ft/s, how fast is the 
boat moving through the water when 13 ft of rope are 
out? 

10 A trough is 10 ft long and has a cross section in the 
shape of an equilateral triangle 2 ft on each side. If wa¬ 
ter is being pumped in at the rate of 20 ft 3 /min, how 
fast is the water level rising when the water is 1 ft deep? 

11 A spherical meteorite enters the earth’s atmosphere and 
bums up at a rate proportional to its surface area. Show 
that its radius decreases at a constant rate. 

12 A point moves around the circle x 2 + y 2 = a 2 in such 


a way that the x-component of its velocity is given by 
dx/dt = —y. Find dy/dt and decide whether the direc¬ 
tion of the motion is clockwise or counterclockwise. 

13 A car moving at 60 mi/h along a straight road passes 
under a weather balloon rising vertically at 20 mi/h. If 
the balloon is 1 mi up when the car is directly beneath 
it, how fast is the distance between the car and the bal¬ 
loon increasing 1 minute later? 

14 Most gases obey Boyle’s law: If a sample of the gas is 
held at a constant temperature while being compressed 
by a piston in a cylinder, then its pressure p and vol¬ 
ume V are related by the equation pV = c , where c is a 
constant. Find dp/dt in terms of p and dV/dt. 

15 At a certain moment a sample of gas obeying Boyle’s 
law (Problem 14) occupies a volume of 1000 in 3 at a 
pressure of 10 lb/in 2 . If this gas is being compressed 
isothermally at the rate of 12 in 3 /min, find the rate at 
which the pressure is increasing at the instant when the 
volume is 600 in 3 . 

*16 A ladder 20 ft long is leaning against a wall 12 ft high, 
with its top projecting over the wall. Its bottom is be¬ 
ing pulled away from the wall at the constant rate of 5 
ft/min. Find how rapidly the top of the ladder is ap¬ 
proaching the ground (a) when 5 ft of the ladder pro¬ 
jects over the wall; (b) when the top of the ladder 
reaches the top of the wall. 

17 A conical party hat made of cardboard has a radius of 
4 in and a height of 12 in. When filled with beer, it leaks 
at the rate of 4 in 3 /min. At what rate is the level of beer 
falling (a) when the beer is 6 in deep? (b) when the hat 
is half empty? 

18 A hemispherical bowl of radius 8 in is being filled with 
water at a constant rate. If the water level is rising at 
the rate of j in/s at the instant when the water is 6 in 
deep, find how fast the water is flowing in 

(a) by using the fact that a segment of a sphere has vol¬ 
ume 

v= 17/12 ( a “f) 

where a is the radius of the sphere and h is the height 
of the segment; 

(b) by using the fact that if V is the volume of the wa¬ 
ter at time t, then 

dV 2 dh 
dt dt 

where r is the radius of the surface and h is the depth. 

19 Water is being poured into a hemispherical bowl of ra¬ 
dius 3 in at the rate of 1 in 3 /s. How fast is the water 
level rising when the water is 1 in deep? 
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*20 In Problem 19, suppose that the bowl contains a lead 
ball 2 inches in diameter, and find how fast the water 
level is rising when the ball is half submerged. 

21 Assume that a snowball melts in such a way that its vol¬ 
ume decreases at a rate proportional to its surface area. 
If half the original snowball has melted away after 2 
hours, how much longer will it take for the snowball to 
disappear completely? 

22 A man in a hot air balloon is rising at the rate of 20 ft/s. 
How fast is the distance to the horizon increasing when 
the balloon is 2000 ft high? Assume that the earth is a 
sphere of radius 4000 mi. 


23 A drawbridge with two 20-ft spans is being raised at 
the rate of 2 radians/min (Fig. 4.36). How fast is the 
distance between the ends of the spans increasing when 
they are elevated 7r/4 radians? 


\ / 
N / 

N / 



Figure 4.36 


Consider the cubic equation 

Jt 3 -3;t —5 = 0. (1) 

It is possible to solve this equation by exact methods, that is, by formulas yield¬ 
ing a solution in terms of radicals in the same sense that the quadratic formula 

—b ± \Zb 2 — 4 ac 


4.6 

NEWTON’S METHOD 
FOR SOLVING 
EQUATIONS 


provides an exact solution of the quadratic equation ax 2 + bx + c = 0. However, 
if what we need is a numerical solution of (1) that is accurate to a reasonable 
number of decimal places, then it is more convenient to find this solution by the 
approximation method to be described here than to try to use the exact solution. 
Furthermore, while formulas that yield exact solutions in terms of radicals for 
equations of degree 2, 3, and 4 do exist, it is known to be impossible to solve 
the general equation of degree 5 or more in terms of radicals. Therefore, in or¬ 
der to solve a fifth-degree equation like x 5 - 3x 2 + 9x — 11 = 0, we would be 
forced to use an approximation method, since no other method is available. 

Returning to equation (1), if we denote x 3 — 3x — 5 by f(x), then we can eas¬ 
ily calculate the following values: 

/(—2) = -7, /(-1)= -3, /(0)= -5, 

/(1)= -7, f(2) = —3, f(3) = 13. 

The pair of values/(2) = — 3 and/(3) = 13 suggests that as x varies continu¬ 
ously from x = 2tox = 3 ,f(x) varies continuously from -3 to 13, and that con¬ 
sequently there is some intermediate value of x wher ef(x) = 0. This is true, but 
even though it is intuitively obvious, it is quite difficult to give a rigorous proof. 
We do not attempt such a proof here, but instead take it for granted that if a con¬ 
tinuous function/(x) has values f(a) and f(b) with opposite signs, then there is 
at least one root of the equation/(x) = 0 between a and b.* This tells us that (1) 
has a root between x = 2 and x = 3, and we can take either of these numbers as 
a first approximation to this root. The approximation x = 2 would seem to be the 
better choice, since —3 is closer to 0 than 13 is. 


This property of continuous functions is discussed in Section 2.6 under the heading The Intermedi¬ 
ate Value Theorem. 
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In general, suppose we have a first approximation jc = x\ to a root r of an equa¬ 
tion/( jc) = 0. This root is a point where the curve y = fix) crosses the x-axis, as 
shown in Fig. 4.37; and the idea of Newton’s method is to use the tangent line 
to the curve at the point where x = x\ as a stepping-stone to a better approxi¬ 
mation x = x 2 . Beginning with the approximation x = x\, we draw the tangent 
line to the curve at the point (x\,f(x\)). This line intersects the x-axis at the point 
x = * 2 , which seems to be a better approximation than x\ (see Fig. 4.37). Re¬ 
peating the process, we use the tangent line at (x 2 , f(x 2 )) to get to the point 
x = JC 3 , which is a still better approximation. Figure 4.37 illustrates the idea as a 
geometric procedure, but to apply it in calculations we need a formula. This for¬ 
mula is easily derived as follows. 

The slope of the first tangent line is f\x\). If we consider this line to be de¬ 
termined by the points (* 2 , 0 ) and (x\,f(x\)), then the slope is also 


0 -/(*i) 

x 2 ~ x { 


so 


0 ~/(*l) 

*2 ~ *1 


= /'(*!)• 


This equation yields 

-fix i) = (x 2 ~ x\)f (*i) or x 2 - xi = “TTr^r 

f Ui) 

so 


X2 =X\ 


fix i) 

/'Ui)’ 


( 2 ) 


In this way our first approximation x\ leads to a second approximation *2 given 
by ( 2 ); this in turn leads to a third approximation * 3 , given by 


and so on indefinitely. 


*3 = *2 - 


f(x 2 ) , 
f\x 2 y 


Example 


1 On applying this method 

fix) = jc 3 — 3jc — 5, 

f(x\) = -3, f\x x ) = 9, 


to equation ( 1 ), we have 

f'(x) = 3jc 2 - 3, X] = 2, 


x 2 = X[ 


fM _ -3 

fix 1 ) 9 


= 2i 


By writing x 2 in decimal form as x 2 = 2.333333, correct to six decimal places, 
and continuing with a good calculator, we get 


*3 = x 2 ” 


fix 2 ) 

fix 2 ) 


2.333333 


0.703699 

13.333329 


2.280556, 


rounding off to six decimal places. Since the burden of computation is on our 
calculator—and calculators are cheap labor—we shall continue working to six 
decimal places. When two successive approximations are equal in their first six 
decimal places, we shall consider this as evidence of accuracy. Thus, in the case 
of equation (1) we obtained *3 = 2.280556 after two applications of the proce¬ 
dure. Repeating this procedure yields 


*4 = 2.279020, 
*5 = 2.279019, 
x 6 = 2.279019. 
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We therefore conclude that x = 2.279019 is the desired solution of equation (1) 
accurate to six decimal places. 

Newton’s method is not restricted to the solution of polynomial equations like 
(1), but can also be applied to any equation containing functions whose deriva¬ 
tives we can calculate. 


Example 2 To illustrate this remark we consider the equation 

x = cos x, (3) 


where jc on the right side is understood to be measured in radians. The best way 
to begin thinking is to graph the two functions y = jc and y = cos jc on the same 
set of axes, as shown in Fig. 4.38. It is then easy to understand that these curves 
intersect at only one point and the jc-coordinate of this point is the solution of 
(3), because at the point of intersection the two y’s are equal. By inspecting Fig. 
4.38 we are able to give a good first approximation to this solution: 

X] = 0.7. 

To apply Newton’s method we write (3) in the form 

JC “ COS A' = 0 

and put /(jc) = jc — cos jc so that /'(jc) =14- sin jc. Now, setting our calculator to 
the radians mode, we find that 



x 2 = *1 


44*1 = 0.739436, 
/ (Jti) 


x 3 = * 2 - 4r4 = 0.739085, 
fix 2 ) 


Figure 4.38 


X4 = X3 — 


/(* 3) 

/'(*: 3) 


0.739085. 


This finishes the calculation and gives us the desired solution jc = 0.739085 cor¬ 
rect to six decimal places. 


Remark In some cases, the sequence of approximations produced by Newton’s 
method may fail to converge to the desired root. For example, Fig. 4.39 shows a 
function for which the approximation x\ leads to jc 2 and jc 2 leads back to jcj, so 
repetitions of the process do not bring us any closer to the root r than our initial 
guess. Specific examples of this behavior—and worse—are given in the prob¬ 
lems. The mathematical theory providing conditions under which Newton’s 
method is guaranteed to succeed can be found in books on numerical analysis. 


y. 



Figure 4.39 
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PROBLEMS 


1 


2 


3 


4 


5 


6 


7 


By sketching the graph of y = f(x) = x 3 - 3x - 5, show 
that equation (1) has only one real root. Hint: Use the 
derivative f'(x) = 3x 2 — 3 = 3(x 2 — 1) to locate the 
maxima and minima of the function and to learn where 
it is increasing and decreasing. 

(a) Show that x 3 + 3x 2 -6 = 0 has only one real root, 
and calculate it to six decimal places of accuracy. 

(b) Show that x 3 + 3x = 8 has only one real root, and 
calculate it to six decimal places of accuracy. 

Use Newton’s method to calculate the positive root of 
x 2 + x - 1 = 0 to six decimal places of accuracy. 
Calculate V5 to six decimal places of accuracy by solv¬ 
ing the equation x 2 — 5 = 0, and use this result in the 
quadratic formula to check the answer to Problem 3. 
Use Newton’s method to calculate N/To to six decimal 
places of accuracy. 

Consider a spherical shell 1 ft thick whose volume equals 
the volume of the hollow space inside it. Use Newton’s 
method to calculate the shell’s outer radius to six deci¬ 
mal places of accuracy. 

A hollow spherical buoy of radius 2 ft has specific grav¬ 
ity so it floats on water in such a way as to displace ^ 
its own volume. Show that the depth x to which it is sub¬ 
merged is a root of the equation x 3 — 6 x 2 + 8 = 0 , and 
use Newton’s method to calculate this root to six deci¬ 
mal places of accuracy. Hint: The volume of a spherical 
segment of height h cut from a sphere of radius a is 
7 rh 2 (a — h! 3). 


8 

9 


Suppose that by good luck our first approximation x\ 
happens to be the root of f(x) = 0 that we are seeking. 
What does this imply about X 2 , * 3 , etc.? 

Show that the function y =f(x) defined by 


fix) 


Vx — r x > r. 

— \r — x x < r. 


has the property illustrated in Fig. 4.39; that is, for any 
positive number a, if x\ = r + a , then X 2 = r — a, and if 
x\ = r — a, then X 2 = r + a. 

10 Show that Newton’s method applied to the function y = 
f(x) = 'Vx leads to x 2 = ~2x\, and is therefore useless 
— for finding where/(x) = 0. Sketch the situation. 

»"l 11 In Example 1 of Section 4.1 we saw from its graph that 
the function y =/(x) = 2x 3 - 3x 2 - 12x + 12 has pos¬ 
itive zeros close to x = 0.9 and x = 2.9. Use Newton’s 
method to calculate these zeros to six decimal places of 
—accuracy. 

12 Find a solution of 2x = cos x correct to six decimal 
_ places. 

»»1 13 Find the smallest positive solution of each of the fol¬ 
lowing equations, correct to six decimal places: 

(a) 4(x — 1) = sin x; 

(b) x 2 = sin x. 

14 How many solutions does the equation 


x = sin x 


have? Why? 


4.7 

(OPTIONAL) 
APPLICATIONS TO 
ECONOMICS. 
MARGINAL ANALYSIS 


Ever since its beginning, calculus has served primarily as a tool for the physical 
sciences. The uses of mathematics in the social sciences have arisen more re¬ 
cently. In this section we discuss several applications of calculus to microeco¬ 
nomics , the branch of economics that studies the economic decisions of individ¬ 
ual businesses or industries. More precisely, we focus our attention on the 
production and marketing of a single commodity by a single firm. 

The most important management decisions in a particular firm usually depend 
on the costs and revenues involved. We shall examine applications of derivatives 
to the cost and revenue functions. 


COST, MARGINAL COST, AND AVERAGE COST 

The total cost to a firm of producing x units of a given commodity is a certain 
function of x called the cost function and denoted by C(x). Here x can be the 
number of pieces produced, or the number of pounds, or the number of bushels, 
and so on. The cost C(x) can be measured in dollars, in thousands of dollars, in 
French francs, or in any other monetary unit. 

To determine the cost function C(x) is a difficult task for experts in book¬ 
keeping and accounting. Here, however, we take this function as given. We shall 
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Figure 4.40 


assume, for the sake of definiteness, that jc is the number of pieces or units pro¬ 
duced, and therefore a nonnegative integer, and also that the cost CQt) is mea¬ 
sured in dollars. For most commodities, such as TV sets or calculators, x can 
only be a nonnegative integer, so the graph of C(x) might look like the sequence 
of dots in Fig. 4.40. However, economists usually assume that these dots are con¬ 
nected by a smooth curve as shown in the figure. Accordingly, C(jc) is understood 
to be defined for all nonnegative values of x, not just for nonnegative integers. 

Many components make up the total cost. Some, like capital expenditures for 
buildings and machinery, are fixed and do not depend on x. Others, like wages 
and the cost of raw materials, are roughly proportional to the amount jc produced. 
If this were all, then the cost function would have the very simple form 

C(j c) = a + bx, 

where a is the fixed cost and b is the constant running cost per unit. 

But this is not all, and most cost functions are not as simple as this. The es¬ 
sential point is the fact that a time restriction is present, and that CQc) is the cost 
of producing x units of the product in a given time interval , say 1 week. There 
will then be a fixed cost of a dollars per week, as before, but the variable part 
of the cost will probably increase more than proportionally to jc as the weekly 
production x increases, because of overtime wages, the need to use older ma¬ 
chinery that breaks down more frequently, and other inefficiencies that arise from 
forcing production to higher and higher levels. The cost function C(jc) might then 
have the form 

a + bx + cx 2 or a + bx + cx 2 T dx 3 , 


or it might be a function even more complicated than these. The general nature 
of such a cost function is suggested in Fig. 4.40. 

The derivative C'(x) of the cost function is called the marginal cost. This de¬ 
rivative is, or course, the rate of change of cost with respect to the production 
level jc. The economic meaning of this important concept will become clearer as 
we proceed. 

As a first step in this direction we point out that it is a good approximation to 
think of the marginal cost C\x), at a given production level jc, as the extra cost 
of producing one more unit. To see this we recall the definition of the derivative, 


C'(jc) = lim 

\ x ^>0 


C(jc + Ac) - C(x) 
Ac 
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We therefore have the approximation 

„„ . _ C{x + Ax) - C(x) 

CW=-£-, 

where this approximation is good if Ajc is “suitably small.” It is customary in 
economics to assume that Ajc = 1 meets the requirement of being suitably small. 
Therefore we have 


C'(jc) = C(* + 1) - C(jc), 

approximately. In words, the marginal cost at each level of production jc is the 
extra cost required to produce the next unit of output [the (x + l)st unit]. 


Example 1 Suppose a company has estimated that the cost (in dollars) of pro¬ 
ducing jc units is 

C(x) = 5000 + lx + 0.02* 2 . 

Then the marginal cost is 

C'(x) = 7 + 0.04*. 

The marginal cost at the production level of 1000 units is 
C # (1000) = 7 + 0.04(1000) = $47/unit. 


The exact cost of producing the 1001st unit is 


C(1001) - C(I000) = [5000 + 7(1001) + 0.02(1001) 2 ] 

- [5000 + 7(1000) + 0.02(1000) 2 ] 


= $47.02. 


The difference between the marginal cost for * = 1000 and the exact cost of pro¬ 
ducing the 1001st unit is clearly negligible. 


The graph of a typical cost function is shown in Fig. 4.41. This cost function 
is increasing because it costs more to produce more. The marginal cost C'(;c) is 
the slope of the tangent to the cost curve. The cost curve is initially concave down 
(the marginal cost is decreasing), because it costs more to produce the first piece 
than to produce one more piece when many are being produced; this reflects the 
more efficient use of the fixed costs of production. At a certain production level 
*o there is a point of inflection Pq and the cost curve becomes concave up (the 
marginal cost is increasing), because when we produce almost as much as we 
can, it becomes more expensive to increase production by even a small amount. 
As we suggested earlier, reasons for this might include greater overtime costs or 
more frequent breakdowns of the equipment as we strain our productive capac¬ 
ity. 

It is a reasonable view that the most efficient production level for a manufac¬ 
turer is that which minimizes the average cost 

CM 

* 

which, of course, is the cost per unit when * units are produced. We sketch a typ¬ 
ical average cost curve in Fig. 4.42 by noticing that C(x)/x is the slope of the line 
joining the origin to the point P in Fig. 4.41. We know that some cost is un- 
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average cost. 


avoidable even before a single unit is produced—for instance, the capital ex¬ 
penditures mentioned earlier, utilities, insurance, and so on—so C(0) > 0. This 
shows that C{x)fx has the limit +°o as jc —» 0+. We are particularly interested in 
the minimum that C(x)lx appears to have. To locate this minimum we find the 
critical point x\ of the function C{x)!x by calculating the derivative by means of 
the quotient rule, 

_d_ ( C(x) \ = xC'jx) - C(x) 

dx \ X ) X 2 


This derivative must be zero at the critical point, so xC (jc) — C(jc) = 0 or 


C'W 


Of) 

x 


(1) 


We therefore have the following basic law of economics: 


If the average cost is a minimum, then 
marginal cost = average cost. 


In other words, at the peak of operating efficiency the marginal cost equals the 
average cost. We see from this that the graphs of marginal cost and average cost 
intersect at the point of minimum average cost, as shown in Fig. 4.42. Like other 
principles of economics, this is usually established by extensive verbal discus¬ 
sions supported by tables and graphs. However, the calculus derivation is brief 
and clear. 

Equation (1) has an interesting geometric interpretation for the cost function 
shown in Fig. 4.41: At the production level x = x\ where C(jc)/jc has a minimum, 
the line from the origin to the point P\ is tangent to the graph. We can see the 
reason for this by noticing that the average cost C(jc)/jc decreases as P moves to 
the right along the curve toward P i, and then increases as P moves beyond P\. 
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Example 2 A firm estimates that the cost (in dollars) of producing * units is 
C(x) = 3400 + 4x + 0.002* 2 . 

(a) Find the cost, marginal cost, and average cost of producing 500 units, 
1000 units, 1500 units, and 2000 units. 

(b) What is the minimum average cost, and at what production level is this 
achieved? 


Solution (a) The marginal cost is 

C'(*) = 4 + 0.004*. 


The average cost is 


CM 

* 


3400 

* 


+ 4 + 0.002*. 


We use these formulas to calculate the entries in the following table, giving all 
amounts in dollars (or dollars per unit) rounded to the nearest cent. 


* 

CM 

C\x) 

c(*y* 

500 

5,900 

6 

11.80 

1000 

9,400 

8 

9.40 

1500 

13,900 

10 

9.27 

2000 

19,400 

12 

9.70 


(b) When average cost is a minimum, we must have 
marginal cost = average cost, 


C\x) 


CM 

* 


4 + 0.004* = ^2 + 4 + 0.002*. 
* 


This equation simplifies to 


0 . 002 * 


3400 

* 


so 

* 2 = 1,700,000 


and 

* = V 1,700,000 = 1304. 


To verify that this production level actually gives a minimum for C(*)/*, we ob¬ 
serve that the second derivative (C(*)/*)" = 6800/* 3 > 0, so the graph of C(*)/* 
is concave up for all * > 0 and we have a minimum. Finally, the minimum av¬ 
erage cost is 


C( 1304) 
1304 


3400 

1304 


+ 4 


+ 0.002(1304) = $9.22. 


REVENUE, PROFIT, AND DEMAND 

It is clearly important for a manager to know all about the cost function, but this 
is not enough. The overall purpose of the firm is to make a profit, and for this it 
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is essential to consider the income from sales, or the revenue, as economists call 
it. And this requires us to bring the consumers (buyers) of the product into the 
picture. 

The revenue function R(x) is the total revenue (or income) derived from pro¬ 
ducing and selling x units of the product. The marginal revenue is the derivative 
R\x) of this function. By the same type of interpretation as used above, the mar¬ 
ginal revenue can be thought of as the extra revenue received from the sale of 
one more unit, 

R'(x)= R(x + 1) - R(x\ 

approximately. 

Example 3 Many business decisions are based on an analysis of the costs and 
revenues “at the margin,” or at the edge—hence the expression marginal analy¬ 
sis for this kind of thinking. 

To understand this, let us suppose we are running a taxi company in New York 
City and are trying to decide whether to add one more cab to our large fleet. If 
it will make money for the company then we add it, otherwise not. Clearly we 
need to consider the costs and revenues involved. Since the choice is between 
adding this cab and leaving the fleet the same size, the crucial question is whether 
the additional revenue generated by one more cab is greater or smaller than the 
additional cost incurred. This additional revenue and cost are precisely the mar¬ 
ginal revenue and marginal cost. Therefore, if the marginal revenue is greater 
than the marginal cost, then we should clearly add the cab and increase our profit. 
This is nothing but simple common sense expressed in the economists’ language 
of “marginal this” and “marginal that.” 


The total profit derived from producing and selling jc units is 

P(x) = R(x) - C(x). 


This is called the profit function; it is what is left over from the revenue after the 
cost is deducted. A firm will lose money when production is too low, because of 
fixed costs, and also when production is too high, because of high marginal costs. 
Unless the firm can operate profitably at some in-between level of production, 
the business will fail, so we can assume that the profit curve looks something 
like Fig. 4.43. 

The marginal profit is the derivative F (x) of the profit function. In order to 
maximize profit we look for the critical points of P(x), that is, the points where 
the marginal profit is zero. But if 

P'(x) =R\x) - C\x) = 0 

then 

R\x) = C\x). 

This gives another basic law of economics: 



Figure 4.43 A profit function. 


If the profit is a maximum, then 
marginal revenue = marginal cost. 


To satisfy ourselves that this condition gives a maximum and not a minimum, 
we can use the second derivative test, 
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P"(x) = R\x) - C"(x) < 0 


or 

R"(x) < C'\x). 

Thus the profit will be a maximum if 

R\x) = C'(jc) and R!\x) < C'\x). 

In order to put teeth into these generalities, we must consider the nature of the 
consumers who constitute the market. Normally, the higher the price of a com¬ 
modity, the smaller the number jc that will be sold. Thus jc, the number “de¬ 
manded,” is a decreasing function of the price p of a unit, and this function is 
usually determined by market research. The demand curve (Fig. 4.44) displays 
this dependence, and under these circumstances the variable jc is called the de¬ 
mand and the function x = x(p) is the demand function. For the sake of conve¬ 
nience in comparing the demand curve with the cost function, economists usu¬ 
ally interchange the axes and consider p as a function of jc, p = p(x), as shown 
in Fig. 4.45. This function is called the price function. 

When x units of a commodity are sold at a price of p(x) dollars per unit, then 
the revenue R(x) is evidently the product of the price per unit and the number of 
units sold, R(x) = xp(x), and the profit is 

P(x) = xp(x) - C{x). 

If both the price function p(x) and the cost function C(x) are known, then the law 
stated above can be used to find the value of jc that maximizes profit. It is clear 
that this value of jc need not be the one that minimizes the average cost, for the 
latter depends only on the cost function C(jc). That is, profit depends on the whims 
of the marketplace, while efficiency is an internal matter. 

Example 4 What production level will maximize profit for a firm with cost func¬ 
tion 

C(jc) = 2400 + 9jc + 0.002jc 2 
and demand function jc = 12,000 - 500/? ? 
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Solution First, we point out that the economic meaning of the demand function 
is that no units will be sold (.x = 0) at a price of $24 per unit, but for every dol¬ 
lar decrease in price, 500 more units will be sold. If we solve for p, then we ob¬ 
tain the price function 

P(X) = 24 " 500 X 
The revenue function is therefore 


R(x) = xp(x) = 24x - 


1 

500 


x 


2 


so the marginal revenue is 


*' W = 24 ~ 250 * 


and the marginal cost is 


C'(x) = 9 + 0.004* = 9 + 



When the profit is a maximum, then marginal revenue equals marginal cost, that 
is, 



= 9 + 


250 


x, 


and solving yields 



or * = 1875. 


To verify that this gives a maximum we calculate the second derivatives, 


/?"(*) = 


250’ 


C"(x) = 


250' 


Since R"(x) < C"(x) for all x, the production level x = 1875 maximizes profit. 
The corresponding price is p(1875) = $20.25. 


ELASTICITY OF DEMAND 

The nature of the demand curve in Fig. 4.44 depends on the particular product 
under consideration. It is relatively flat (or inelastic) for bread and motor oil, 
since people tend to buy what they need without much regard for the price, and 
relatively steep (or elastic ) for candy, since no one really needs it but more peo¬ 
ple buy more of it when the price is low. 

The elasticity of demand is an important concept of quantitative economics. 
To introduce it in a precise way, let (/?, x) be an arbitrary point on the demand 
curve in Fig. 4.44. If p increases by a small amount A p and —Ax is the corre¬ 
sponding decrease in x, then the ratio of the percentage decrease in x to the per¬ 
centage increase in p is 

100(—Ax/x) _ p_ Ax 
100(A p/p) x A p' 

The elasticity of demand E(p) at the price level p is now defined by 
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E(p) = lim 

A />—>0 



p dx 
x dp' 


The demand is said to be elastic if E(p) > 1 and inelastic if E(p) < 1. The pos¬ 
itive function E(p) is a useful tool of economic analysis because it measures the 
responsiveness of the demand to changes in the price p: it is small when the de¬ 
mand curve is relatively flat, so that changes in p induce relatively smaller changes 
in x, and large when this curve is relatively steep. It also has the merit of being 
independent of the units of measurement used for p and x This is a great con¬ 
venience in many economic and business situations. For example, changing the 
units of p from dollars to French francs, say, and the units of x from pounds to 
kilograms would leave the value of the elasticity E(p) unchanged, because this 
quantity involves only the percentage changes in p and x. 


Example 5 In Example 4 the demand function is jc = 12,000 — 500 p. Find E(p). 
At what price p is the demand elastic? Inelastic? 


Solution 


From the definition we have 
p dx _ 


E(p) 


x dp 


12,000 - 500p 
500p _ _p 


(-500) 


12,000 - 500p 24 -p 

From Example 4 we understand that 0 < p < 24, so the condition E(p) > 1 is 
equivalent to 


P 

24 -p 


> l 


or p > 24 - p or 2p > 24 or p > 12, 


so the demand is elastic forp > 12. Similarly, the demand is inelastic for p < 12. 
To understand what this means, we observe that when the revenue is expressed 
as a function of p (instead of x) we have R(p) = p( 12,000 - 500p) = 500p • 
(24 — p). It is easy to see from this that revenue is maximized for p = 12. There¬ 
fore to maximize revenue, the price must be lowered if the demand is elastic and 
raised if the demand is inelastic. These conclusions are valid for any decreasing 
demand function, whether it is linear or not (see Problem 26). 


The commonsense interpretation of all this is clear: If the demand is elastic at 
a given price, then a price decrease by a certain percentage causes a proportion¬ 
ally larger increase in sales, so the revenue, which is the product of price and 
sales, is increased. Similarly, if the demand is inelastic, then a price increase by 
a certain percentage causes a proportionally smaller decrease in sales, so again 
the revenue is increased. 

The discussions of this section suggest several ways in which derivatives can 
be used in economics. The most influential contribution to this subject in the 
twentieth century was perhaps Keynes’s General Theory of Employment, Inter¬ 
est and Money, which has been characterized as “an endless desert of econom¬ 
ics, algebra and abstraction, with trackless wastes of differential calculus, and 
only an oasis here and there of delightfully refreshing prose.”* This may be some- 


*Chapter IX of The Wordly Philosophers , by Robert L. Heilbroner. 
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what exaggerated for the sake of its juicy phrases, but nevertheless the general 
impression is valid—that modern economics makes extensive use of many kinds 
of mathematics, especially calculus. 


PROBLEMS 


J In Problems 1-4 a cost function (in dollars) is given for pro¬ 
ducing jc units of a certain commodity. In each case find the 
marginal cost at the production level of 500 units and also the 
actual cost of producing the 501st unit. 

1 C(x) = 15,000 + 13* + 0.03jc 2 . 

v2 


22 


io ' Too' 

C(x) = 5000 + 15jc - O.OIjc 2 + 0.0001jc 3 . 
C(x) = 3000 + 1 OOVx 


2 C(x) = 400 + — + 77 ^-. 

3 

4 


I For each of the cost functions in Problems 5-10, find the min¬ 
imum average cost and the production level at which this is 
achieved. 

5 C(jc) = 8000 + 1 5jc + jc 2 . 

6 CM = 2400 + 3jc + 0.02jc 2 . 

7 C(j c) = 60 + { jc + 95q jc 2 . 

8 CM = 5000 + 2jc + 0.00 Ijc 3 . 

9 C(,) = 2V^ + ^. 

10 CM = 10,000 + 8jc + 4jc 3/2 . 

For each of the cost and price functions in Problems 11-16, 
find the production level that maximizes profit. 

11 CM = 1240 + 8jc + 0.02jc 2 , pM = 16. 

C(jc) = 1240 + 8jc + 0.02jc 2 , pM = 16 - ^x. 


12 

13 

14 

115 
16 

17 


23 


C(jc) = 900 + 35jc + O.OOIjc 2 , p(x) = 65 - ^jc. 

CM = 750 + 140jc - 0.2jc 2 + ^jc 3 , 
p(x) = 300 — {jc. 

CM = 4500 + 50jc - jc 2 + 0.002jc 3 , 
p(x) = 80 — 0.0 Ijc. 

jc 2 

CM = 6000 + \5x - — + O.OOIjc 3 , 
p(x) = 120 — 0.05jc. 

A Broadway theater has seats for 2000 playgoers. With 
the ticket price at $50, the average attendance at a mod¬ 
erately successful play has been 1200. When the ticket 
price was lowered to $40, the average attendance rose to 
1400. 

(a) Find the price function, assuming that it is linear. 

(b) What should the ticket price be to maximize revenue? 


In Problems 18-21 use the given demand function to find the 
selling price p that maximizes the revenue. 

18 jc= 1200 - 20 p. 

19 jc = 800 — 2.5 p. 

20 jc = 160 — p 3/2 . 

21 jc = 768 — p 2 . 


24 


25 


26 


A perfect competitor is an enterprise that has such a small 
share of the market that it cannot influence the price of 
its product andean sell as much as it produces at the pre¬ 
vailing market price p. Figure 4.46 shows the cost and 
revenue curves of a certain perfect competitor. Sketch the 
profit curve. 



The daily cost to a small company of producing jc hand 
calculators is C(jc) = 1560 + 50jc - 8jc 2 4- {jc 3 dollars. 
The market price of this calculator is $130. What is the 
maximum daily profit, and what is the daily output jc that 
yields this profit? 

A small company with fixed costs (overhead) of a dol¬ 
lars produces jc units of a commodity which it sells at a 
fixed price of p dollars per unit. If it costs b dollars to 
produce each unit, where b < p, at what output level does 
the company break even, and what is the graphical in¬ 
terpretation of this break-even point? 

Suppose the company in Problem 24 produces trout fish¬ 
ing instructional videotapes for $8 that it sells for $30. 
If the overhead is $14,000, how many tapes must be sold 
to break even? 

Consider a demand curve jc = jc (p), where x(p) is any de¬ 
creasing function. 

(a) If E(p) > 1, show that the revenue R = px is in¬ 
creased by lowering the price. 

(b) If E(p) < 1, show that the revenue is increased by 
raising the price. 

(c) Establish the formula 
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and use this to deduce that E(p) = 1 at a point on 
the demand curve where the revenue is a maximum. 
27 Verify the conclusion in part (c) of Problem 26 for each 
of the demand curves in Problems 18-21: 


(a) x = 1200 - 20 p\ 

(b) jc = 800 - 2.5 p\ 

(c) jc = 160 - p 3/2 ; 

(d) jc = 768 - p 2 . 


CHAPTER 4 REVIEW: CONCEPTS, METHODS 


Define, state , or think through the following. 

1 Increasing and decreasing functions. 

2 Critical points and critical values. 

3 Relative (or local) maximum and minimum values. 

4 Absolute maximum and minimum values. 

5 Procedure for curve sketching. 

6 Endpoints, cusps, and comers. 

7 Concave up and concave down. 

8 Point of inflection. 


9 Second derivative test. 

10 Strategy for applied maximum-minimum problems. 

11 Law of reflection. 

12 Snell’s law of refraction. 

13 Fermat’s principle of least time. 

14 Rate of change. 

15 Strategy for related rate problems. 

16 Newton’s method. 


ADDITIONAL PROBLEMS FOR CHAPTER 4 


SECTION 4.1 

Sketch the graphs of the following functions by using the first 
derivative and the methods of Section 4.1; in particular, find 
the intervals on which each function is increasing and those 
on which it is decreasing, and locate any maximum or mini¬ 
mum values it may have. 

1 y = ^x 3 -^-2x + i 

2 y = x 3 + 6x 2 + I2x + 8. 

3 y = -x 3 + 3x + 2. 

4 y = x 3 + 3x — 2. 

5 y = x 4 — 6x 2 + 8x. 

6 y = (x + 2) 3 (x - 4) 3 . 

7 y = x 4 — 4* 3 + 16. 


*21 y = x 2 (x — 4) 2/3 . 

22 y = \fx H - — 2 V 2 . 

Vr 


SECTION 4.2 

For each of the following, locate the points of inflection, find 
the intervals on which the curve is concave up and those on 
which it is concave down, and sketch. 

23 y = JC 3 + X. 

24 y = jc 3 + 3jc 2 + 6jc + 7 . 

25 y = X 3 - 12 jc + 2 . 

26 y — jc 4 — 2jc 2 . 

27 y = x 4 + 4jc 3 . 

28 y = (jc + 2)(JC - 2) 3 . 

29 y — x 4 — 4jc 3 - 2x 2 + 12jc - 1 . 

30 y = *31 y = — — (a > 0 ). 


8 

y = 3* 5 — 1 Ojc 3 + 15* + 3. 



9 

y = x 2 {x+ I) 2 . 



*32 

10 

y = x 3 (x — l) 2 . 

11 

y = x 2 (4 - x 2 ). 


*12 

* 3 

13 

X 

*34 

y — 

y ~ (X+ l) 2 ' 


7 * +1 



14 

y = fx 3 + 

15 

4(* 2 — 1) 

y = 4 

36 


X 


JC 

37 

16 

4(*-l) 

^ * 2 ' 

17 

, 16 
y = x 2 -\ — 2 - 

X 2 


18 

,3 * 

1 ' 

II 

5S 

19 

5jc 2 + 2 
y= * 2 + i • 


20 

5* 2 - 20* + 21 




jc 2 - 4jc + 5 





JC 

*31 


X 3 

V* 2 + 1 

y — 

x 2 + 3 a 2 

1 

33 


5 

* 3 + 1 ■ 

y = 

3jc 4 + 5' 

<N 

m 

^ 1 

35 

y = 

_8__ 2 

X 3 JC* 

-+4- 

JC x z 





the second derivative of a function to locate the points 
of inflection, the intervals on which the graph is con¬ 
cave up, and the intervals on which the graph is con¬ 
cave down: 

(a) y” = jc 2 (jc — 1 )(jc — 2) 2 ; 

(b) y" = (jc 2 + 2)(x + 2) 2 (jc - 1)(jc - 2); 

(c) y" = x(x — 1)(jc 2 — 4)(jc — 3). 

38 If f(x) = (jc — a)(jc - b)(x — c), find the jc-coordinate 
of the point of inflection. Hint: See Additional Prob¬ 
lem 22 in Chapter 3. 















ADDITIONAL PROBLEMS FOR CHAPTER 4 


157 


39 Find the value of a that makes /(x) = ax 2 + \lx 2 have 
a point of inflection at x = 1. 

*40 Consider the general cubic curve y = ax 3 + bx 2 + 
cx + d. 

(a) Show that the curve has one and only one point of 
inflection, 


/ = 



where k = 


2 b 3 
21a 2 


be 
3 a 


+ d. 


(b) Show that the curve has one maximum point and 
one minimum point if and only if b 2 — 3ac > 0. 

(c) When the curve has a maximum point P and a min¬ 
imum point Q, show that the abscissa (x-coordi- 
nate) of 7 is the average of the abscissas of P and 
Q. Hint: Recall how to find the sum of the roots 
of a quadratic equation from its coefficients. 

(d) Part (c) suggests that our general cubic curve might 
be symmetric with respect to its point of inflec¬ 
tion 7. Prove this by (1) introducing a new X-axis 
and F-axis by means of 


X = 



and 


Y = y — k. 


so that the origin of the AT-system is the point 7; 
(2) showing that the equation of our curve in the 
AT-system is 


F = 


aX X 2 - 


b 2 - 3 ac\ 
3a 2 f 


and (3) observing that this transformed equation 
is symmetric with respect to the origin of the XF- 
system. 


SECTION 4.3 

41 Find the positive number that exceeds its cube by the 
largest amount. 

42 Find two positive numbers x and y such that their sum 
is 30 and the product xy A is a maximum. 

43 Find two positive numbers x and y such that their sum 
is 56 and the product x 3 y 5 is a maximum. 

44 (Generalization of the preceding problems) Let m and 
n be given positive integers. If x and y are positive 
numbers such that x + y = S, where 5 is a constant, 
show that the maximum value of the product P = x m y n 
is attained when 

mS nS 

x =- and y = - . 

m + n m + n 

*45 Express the number 18 as the sum of two positive num¬ 
bers in such a way that the sum of the square of the 
first and the fourth power of the second is as small as 
possible. 

46 Find the positive number such that the sum of its cube 
and 48 times the reciprocal of its square is as small as 
possible. 


47 The sum of three positive numbers is 15. Twice the 
first plus three times the second plus four times the 
third is 45. Choose the numbers so that the product of 
all three is as large as possible. 

*48 (A generalization of Problem 6 in Section 4.3) Con¬ 
sider a rectangle with sides 2x and 2 y inscribed in a 
given fixed circle x 2 + y 2 = a 2 , and let n be a positive 
number. We wish to find the rectangle that maximizes 
the quantity z — x n + y n . If n = 2, it is clear that z has 
the constant value a 2 for all rectangles. If n < 2, show 
that the square maximizes z; and if n > 2, show that 
z is maximized by a degenerate rectangle in which x 
or y is zero. 

49 Show that of all triangles with given base and given 
perimeter, the one with the greatest area is isosceles. 
Hint: Use Heron’s formula for the area, 

A = Vs(s — a)(s — b)(s — c), 

where a, b, c are the sides and 5 is the semiperimeter 
(half the perimeter). 

50 Show that of all triangles with given base and given 
area, the one with the least perimeter is isosceles. Hint: 
If the base lies on the x-axis and is bisected by the ori¬ 
gin, and if the third vertex (x, h) has a fixed height 
above the x-axis, then the triangle is isosceles if x = 
0 . 

51 If a and b are positive constants, the region between 
the parabola a 2 y = a 2 b — 4 bx 2 and the x-axis is a par¬ 
abolic segment of base a and height b. Find the base 
and height of the largest rectangle with lower base on 
the x-axis and upper vertices on the parabola. 

52 A circle of radius a is divided into two segments by a 
line L at a distance b from the center. The rectangle 
of greatest possible area is inscribed in the smaller of 
these segments. How far from the center is the side of 
this rectangle that is opposite to the line L? 

*53 Two straight fences meet at a point, but not necessar¬ 
ily at right angles. A post stands in the angle between 
them. If a triangular corral is constructed by building 
a new straight fence containing this post, show that the 
fenced-off triangle has minimal area when the old post 
is in the center of the new fence. (Notice that this gen¬ 
eralizes the result of Problem 7 in Section 4.3.) 

*54 A line through a fixed point ( a , b) in the first quad¬ 
rant intersects the x-axis at A and the y-axis at B. Show 
that the minimum values of AB and OA + OB are 

( a 2/3 + £>2/3) 3/2 and (Va + Vb) 2 . 

*55 (A generalization of Example 4 and Problem 26 in 
Section 4.3) First, notice that areas of similar figures 
are proportional to the squares of corresponding 
lengths, as in Fig. 4.47, where 

A = c\p 2 = c 2 d 2 = c 3 x 2 = c 4 y 2 
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P 



y 


Figure 4.47 


for suitable constants c\, C 2 , c 3 , c 4 . Here p is the 
perimeter, d is the diameter—the length of the longest 
chord—and x and v are the indicated lengths. The con¬ 
stants ci, C 2 , c 3 , c 4 are evidently the areas when p = 
1, d = 1, x - 1, y = 1. Now, cut a wire of length L 
into two pieces and use these pieces as the perimeters 
p and P of figures of two specified shapes ( Fig. 4.48), 
so that p + P = L. Then the sum of the areas is 

A = A] + A 2 = ap 2 + bP 2 = ap 2 + b(L — p) 2 , 



Figure 4.48 



where 0 < /? < L (we allow either perimeter to be 
zero). Show that 

(a) the minimum combined area is abL 2 /(a + b), cor¬ 
responding to 


P = 


b 

a + b 


L 


and P= - , L; 

a + b 


(b) the maximum combined area is the larger of the 
one-figure areas aL 2 and bL 2 , corresponding to 
p = L and p = 0. 

Also, verify that these conclusions contain as special 
cases the results of Example 4 and Problem 26 in Sec¬ 
tion 4.3. 

56 A printed page must have A square inches of printed 
matter and is required to have side margins of width 
a inches and top and bottom margins of width b inches. 
Find the length of the printed lines if the page is de¬ 
signed to use the least paper. 

57 For a certain printed page, the widths of the four mar¬ 
gins (possibly all different) and the area of the printed 
matter are specified. Show that the least paper is re¬ 
quired if the full page is similar in shape to the rec¬ 
tangle of printed matter. 

58 A dormer window has the shape of a rectangle sur¬ 
mounted by an equilateral triangle. If the total perime¬ 
ter is fixed, find the proportions of the window (i.e., 
the ratio of the height of the window to its base) that 
will admit the most light. 


59 A long strip of sheet metal 8 in wide is to be made 
into a rain gutter by turning up two sides at right an¬ 
gles to the bottom. If the gutter is to have maximum 
capacity, how many inches should be turned up on the 
sides? 

60 A playing field is to be built in the shape of a rectan¬ 
gle with a semicircular part at each end, and the 
perimeter is to be a race track of specified length. Find 
the proportions of the field that will give the rectan¬ 
gular part as large an area as possible. 

61 A farmer wishes to use 5 acres of land along a straight 
river to construct 6 small pens by means of a fence 
parallel to the river and 7 fences perpendicular to it. 
Show that if the total amount of fencing is to be min¬ 
imized, the parallel fence should be as long as all the 
others combined. 

62 An automobile manufacturer estimates that he can sell 
5000 cars a month at $10,000 each and that he can sell 
500 more cars per month for each $200 decrease in 
price. 

(a) What price per car will bring the largest gross in¬ 
come? 

(b) If each car costs $4000 to make, what price will 
bring the largest total profit? 

63 A manufacturer of electric knives estimates that her 
weekly production costs are given by the formula C = 
9500 + 8x + 0.00025x 2 , where x is the number of 
knives manufactured in a week. 1 * The sales department 
estimates that if the selling price is set at y, then x = 
13,000 — 500v knives can be sold.* How many knives 
should be manufactured each week, and what should 
their selling price be, in order to achieve maximum 
profit? 

*64 The cost for fuel of running a large paddlewheel steam¬ 
boat at a speed of v miles per hour through still water 
is $?; 3 /24 per hour. Other costs—wages, insurance, 
etc.—are $108 per hour. What is the most economi¬ 
cal speed for a certain trip upstream against a current 
of 2 mi/h? 

65 A feedlot operator has a herd of 200 cows in his pens, 
each weighing 600 lb. The cost of keeping one cow 
for a day is 80 cents. The cows are gaining weight at 
the rate of 8 lb/day. The market price for cows is now 
$ 1.25/lb, but is dropping 1 cent a day. How many days 
should the operator wait in order to sell his cows for 
the largest profit? 


f The overhead is $9500 per week; the cost of labor and materials is 
$8 per knife; and the term 0.00025x 2 , which is small unless x is very 
large, says—in effect — that the factory has a fixed size and loses ef¬ 
ficiency if too much production is attempted. 

*This formula says that sales are expected to be 5000 at a selling price 
of $16, with a loss of 500 sales for each $1 increase in price. 
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66 An estimate of the numerical value of a certain quan¬ 
tity is to be determined from n measurements *i, * 2 , 

. . . , x n . The least squares estimate is the number x 
that minimizes the sum of the squares, 

S =(x - Xi) 2 + (x - x 2 ) 2 + ■ ■ ■ + {x - x„) 2 . 

Show that this least squares estimate is the arithmetic 
mean of the measurements, 

X\ + X 2 + • ‘ • + x n 
X = -. 

n 

67 As a woman starts jogging across a 300-ft bridge, a 
man in a canoe passes directly below the center of the 
bridge. The woman is moving at the rate of 9 ft/s and 
the man at the rate of 12 ft/s. 

(a) What is the shortest horizontal distance between 
the woman and the man? 

(b) If the bridge is 288 ft high, what is the shortest 
distance between the woman and the man? 

SECTION 4.4 

68 Find the height of the cylinder of maximum lateral 
area that can be inscribed in a sphere of radius R. Show 
that this maximum lateral area is half the surface area 
of the sphere. 

69 A cylinder is generated by revolving a rectangle of 
given perimeter about one of its sides. What is the 
shape (ratio of height to diameter of base) of the cylin¬ 
der of maximum volume? 

*70 The cone of smallest possible volume is circumscribed 
about a given hemisphere. What is the ratio of its 
height to the diameter of its base? 

71 If the volume of a cone is fixed, what shape (ratio of 
height to diameter of base) minimizes its total surface 
area? 

72 A pyramid has a square base and four equal sloping 
triangular faces. If the total area of the bottom and 
faces is given, show that the volume is greatest when 
the height is V2 times the edge of the base. 

73 A cylinder is generated by revolving a rectangle about 
the x-axis, where the base of the rectangle lies on the 
x-axis and its upper vertices lie on the curve y = 
x/(x 2 + 1). What is the largest volume such a cylinder 
can have? 

74 (A problem of Kepler) Consider a cylinder with a given 
fixed distance D from the center of a generator to the 
farthest point of the cylinder. If this cylinder has the 
largest possible volume, what is the ratio of its height 
to the diameter of its base? 

75 A solid is formed by cutting hemispherical cavities in 
the ends of a cylinder. If the total surface area of this 
solid is given, find the shape of the cylinder (ratio of 
height to diameter of base) that maximizes the volume 
of the solid. 


76 A given cone has height H and radius of base R. A 
second cone is inscribed in the first with its vertex at 
the center of the base of the given cone and its base 
parallel to the base of the given cone. Find the height 
of the second cone if its volume is as large as possi¬ 
ble. 

77 Closed cylindrical cans are to be made with a speci¬ 
fied volume. There is no waste involved in cutting the 
sheet metal that goes into the curved lateral part, but 
each end is to be cut from a square piece of metal and 
the scraps discarded. Find the ratio of the height to the 
diameter of the base that minimizes the cost of sheet 
metal. 

78 A certain tank consists of a cylinder with hemispher¬ 
ical ends. For a given surface area, describe the shape 
of the tank with maximum volume. 

79 A rectangle of tin whose sides are a and b is to be 
made into an open-top box by cutting a square from 
each corner and bending up the flaps to form the sides. 
How large a square should be cut from each comer to 
make the volume of the box as large as possible? 

80 An aquarium is to be 4 ft high and is to have a vol¬ 
ume of 88 ft 3 . The base, ends, and back are to be made 
of slate, and the front is to be made of special rein¬ 
forced glass that costs 1.75 times as much as the slate 
per square foot. What should the dimensions be to min¬ 
imize the cost of materials? 

81 A circular filter paper of radius a is to be formed into 
a conical filter by folding under a circular sector. Find 
the ratio of the radius to the depth for the filter of great¬ 
est capacity. 

82 A frame for a cylindrical lampshade is to be made from 
a piece of wire 20 ft long. The frame consists of two 
equal circles, four wires from the upper circle to the 
lower circle, and two diametrical wires in the upper 
circle. Find the height and radius that will maximize 
the volume of the cylinder. 

83 A box with a lid is to be made from a square sheet of 
cardboard 18 in on a side by cutting along the dotted 
lines as shown in Fig. 4.49. The cardboard is then 


-1 L_ 


Figure 4.49 
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folded up to form the ends and sides, and the flap is 
folded over to form the lid. What are the dimensions 
of the box of greatest volume? 

84 On a calm day the atmospheric pollution spreading out 
from a city is directly proportional to the population 
of the city and inversely proportional to the distance 
from the city. A retired forester wishes to start a tree 
nursery somewhere on a straight highway between two 
cities 60 km apart. If the first city is four times as large 
as the second, where should the forester locate his 
nursery to minimize the effect of pollution on his 
young trees? 

85 The x-axis is the southern shore of a lake containing 
a small island at the point (a, b ) in the first quadrant. 
A woman at the origin can run r meters per second 
along the shore and can swim s meters per second, 
where r > s. If she wants to reach the island as quickly 
as possible, how far should she run before she starts 
to swim? 

86 Two towers 30 m apart are 30 and 70 m high, respec¬ 
tively. A taut wire fastened to the top of each tower is 
anchored to the ground between the towers. How far 
from the shorter tower will the wire touch the ground 
if its total length is a minimum? (Can you solve this 
problem without calculus?) 

87 Find the equation of the circle with center at the ori¬ 
gin that is internally tangent to the parabola 8y = 48 - 
x 2 . 

88 Sketch the curve y = Vx 2 + 16 and find the point on 
it that is closest to the point (6, 0). 

*89 Find the point on the parabola y 2 = 3x that is closest 
to the point (4, 1). 

90 What points on the curve x 2 y = 16 are closest to the 
origin? 

91 For what points on the circle x 2 + y 2 = 25 is the sum 
of the distances from (2, 0) and ( — 2, 0) a minimum? 

92 Let P = (x, y) be a variable point on the line ax + 
by + c = 0 and let Pq = (xo, Jo) be a fixed point not 
on this line. 

(a) If s is the distance from Pq to P, use the methods 
of calculus to show that s 2 (and therefore s) is a 
minimum when PPy is perpendicular to the given 
line. 

(b) Show that the minimum distance is 

|ar 0 + byp + c| 

Va 2 + b 2 

93 A smooth graph not passing through the origin always 
has a point (xo, yo) that is closest to the origin. + If this 

+ For the purposes of this problem, interpret the phrase “smooth graph” 
to mean the graph of a function y =/(x) defined for all x or on a 
closed interval a < x < b, whose derivative/'(x) exists in the interior 
of its domain. 


point is not an endpoint, show that the line from the 
origin to (xo, yo) is perpendicular to the graph. 

94 If a, b, c are positive constants, show that ax + blx > 
c for all positive numbers x if and only if 4 ab ^ c 2 . 

95 If a, b, c are positive constants, show that ax 2 + blx > 
c for all positive numbers x if and only if 21 ab 2 > 4c 3 . 

96 Consider the general quadratic function/(x) = ax 2 + 
bx + c with a > 0. By calculating the minimum value 
of this function, show that/(x) ^ 0 for all x if and only 
if b 2 — 4 ac < 0. 

97 By applying the idea of Problem 96 to the function 
fix) = ia\X + b x ) 2 + {a 2 x + b 2 ) 2 

+ • • • + ( a n x + b,,) 2 , 

establish Schwarz’s inequality. 

\ a \b\ + • • • + a n b„\ 

< (a, 2 + ■ • ■ + a n 2 ) l/2 (b\ 2 + • • • + b 2 ) U2 . 

Also show that equality holds here if and only if 
there exists a number x such that b { = —a,x for every 
i = 1, 2, . . . , n. 

SECTION 4.5 

98 A cubic block of ice is melting at the rate of 6 in 3 /min. 
How fast is its surface area changing when its edge is 
12 in long? 

99 A light is on the ground 50 ft from a building. A man 
6 ft tall walks from the light toward the building at 4 
ft/s. Find how rapidly the length of his shadow on the 
building is decreasing (a) when he is 40 ft from the 
building; (b) when he is 30 ft from the building. 

100 Two airplanes are flying westward on parallel courses 
9 mi apart. One flies at 425 mi/h and the other at 500 
mi/h. How fast is the distance between the planes de¬ 
creasing when the slower plane is 12 mi farther west 
than the faster plane? 

101 A conical tank with its vertex down is 8 ft high and 4 
ft in diameter at the top. It is full of water, but the wa¬ 
ter is leaking out through a hole in the bottom at the 
rate of 1 ft 3 /min. Find the rate at which the water level 
is falling when the tank is | empty. 

102 Assume that water squirts out a hole in the bottom of 
a tank at a speed proportional to the square root of the 
depth y of the water. If the tank has the shape of a cone 
with its vertex down, show that the rate of change of 
the depth is 



where c is a positive constant. 

103 Water is being pumped into an open-top cylindrical 
tank of radius 5 ft at the rate of 6 ft 3 /min. At the same 
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time, water is squirting out a hole in the bottom of the 
tank at the rate of 2Vy ft 3 /min, where y is the depth 
of the water. How high must the tank be for the water 
level to stabilize before it overflows? 

104 A long rectangular tank has a sliding panel that di¬ 
vides it into two adjustable tanks of width 4 ft (see 
Fig. 4.50). Water is pumped into the left compartment 
at the rate of 12 ft 3 /min. At the same time the panel is 
moved steadily to the right at the rate of 1 ft/min. In 
each of the following situations determine whether the 
water level is rising or falling, and how fast: (a) the 
left compartment contains 144 ft 3 of water and is 9 ft 
long; (b) the left compartment contains 144 ft 3 of wa¬ 
ter and is 18 ft long. 



105 A large volume V of oil is spilled into a calm sea from 
a broken tanker. After the initial turbulence has died 
down, the oil spreads in a circular pattern of radius r 
and uniform thickness h, where r increases and h de¬ 
creases in a manner determined by the viscosity and 
buoyancy of the oil. Laboratory experiments suggest 
that the thickness is inversely proportional to the 
square root of the elapsed time, h = d\ft. Show that 
the rate dr/dt at which the oil spreads is inversely pro¬ 
portional to r 3/4 . 

106 String of radius ^ in is being wound onto a ball at the 
rate of 32 in/s. If the ball is assumed to remain spher¬ 
ical and to consist entirely of string with no empty 
space, find the rate at which its radius is increasing 
when the radius is 2 in. 

107 Thread is being unwound at the rate of a inches per 
second from a spool of radius r inches. The unwound 
part of the thread has length * inches and is stretched 
taut into a segment PT tangent to the spool at the point 
T. Find the rate of increase of the distance y from the 
axis of the spool to the point P at the end of the thread. 

108 Meteorologists are interested in the adiabatic expan¬ 
sion or compression of large masses of air, in which 
temperatures may change but no heat is added or sub¬ 
tracted. The adiabatic gas law for air is pV lA = c, 
where p is pressure, V is volume, and c is a constant. 
The volume of a certain insulated chamber of air is 


decreasing steadily at the rate of 1 ft 3 /min. Find how 
rapidly the pressure is increasing at an instant when 
the pressure is 65 lb/in 2 and the volume is 13 ft 3 . 

109 If a rocket weighs 1000 lb on the surface of the earth, 
then it weighs 


(1 + r/4000) 2 

pounds when it is r miles above the surface of the earth. 
If the rocket is rising at the rate of 1.25 mi/min, how 
fast is it losing weight when its altitude is 1000 mi? 

110 Wheat is being poured into a pile at the constant rate 
of 36 ft 3 /min. If the pile always has the shape of a cone 
whose height is half the radius of the base, at what 
rate is the height increasing when the diameter of the 
pile is 12 ft? 

111 Gravel is being poured onto a pile, forming a cone. If 
the radius of the base is increasing at the rate of 
3 m/min and the height is increasing at the rate of 
1 m/min, how rapidly is the volume increasing when 
the height is 4 m? 

112 A chord moves across a circle of radius 5 ft at the rate 
of 4 ft/min. How fast is the length of the chord de¬ 
creasing when it is f of the way across the circle? 

113 A point moves along the parabola x 2 = 4 py in such a 
way that its projection on the x-axis has constant ve¬ 
locity. Show that its projection on the y-axis has con¬ 
stant acceleration. 

114 Two points A and B are moving along the x-axis and 
y-axis, respectively, in such a way that the perpendic¬ 
ular distance k from the origin O to the segment AB 
remains constant. If A is moving away from O at the 
rate of Ak units per minute, find how fast OB is chang¬ 
ing, and whether it is increasing or decreasing, at the 
moment when OA = 3k. 

115 One side of a rectangle is increasing at the rate of 
7 in/min and the other side is decreasing at the rate of 
5 in/min. At a certain moment the lengths of these two 
sides are 10 and 7 in, respectively. Is the area of the 
rectangle increasing or decreasing at that moment? 
How fast? 

116 Two concentric circles are expanding. At a certain mo¬ 
ment, designated by t = 0, the inner radius is 2 ft and 
the outer radius is 10 ft; and for t > 0, these radii are 
increasing at the steady rates of 4 ft/min and 3 ft/min, 
respectively. If A is the area between the circles, when 
will A have its largest value? 

*117 Two concentric spheres are expanding. At time t = 0, 
the inner and outer radii r and R have the values ro 
and Ro feet, respectively. For t > 0, these radii are in¬ 
creasing at the steady rates of a and b feet per minute, 
where a > b > aro 2 /Ro 2 . If V is the volume between 
the spheres, when will V have its largest value? 
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-SECTION 4.6 

118 Show that each of the following equations has only 
one real root, and calculate it to six decimal places: 
r— (a) jc 3 + 5jc - 2 = 0; (b) jc 3 + 2jc - 4 = 0. 

"■■1 119 Calculate each of the following to six decimal places 
of accuracy: 

(a) VTT; (b) ^9\ (c) 

120 Let a be a given positive number and x\ a positive 
number that approximates Va. 

(a) Show that Newton’s method applied to the equa¬ 
tion x 2 — a = 0 gives x 2 = y(jq + alx\) as the next 
approximation. 1- 

(b) If X! 4 y/a, show that the approximation y(jci + 
a/x\) is greater than y/a, regardless of whether x\ 
is greater than y/a or less than y/a. _Hint: Show 
that the inequality y(jci + a/x\) > \/a is equiva¬ 
lent to (Va^ - Vo/jc i) 2 > 0. 

(c) If the approximation x\ is too large, i.e., if 
x\ > y/a, show that y(jcj + a/x\) is a better ap¬ 
proximation in the sense that 

\[ x , + fyv- a < x ,-v a . 

(d) Assumejhat the approximation x\ is too small, i.e., 
x\ < y/a, but is large enough so that x\ > yV*. 
Show that \(x\ + a/x\) is a better approximation 
in the sense that 


the cup if the new volume is four-fifths of the origi¬ 
nal volume? 


2 



124 The formula in Problem 7 of Section 4.6 was discov¬ 
ered by Archimedes. Use it to show that if a plane at 
distance x from the center of a sphere of radius 1 cuts 
off y the volume of the sphere, then jc is a solution of 
the equation 

3jc 3 — 9* + 2 = 0. 

Use Newton’s method to calculate jc correct to six dec¬ 
imal places. 


\[ Xx+ fyV-a<V-a- X , 

Hint: Show that this inequality is equivalent to x l + 
alx\ — 2 y/a < 2Va — 2x\, 3x\ — wfa + a/x\ < 
0, and 

(3*i ~ Va)(*l ~ Vfl) < Q 

X\ 

121 If a is a given positive number and ^/a is calculated 
by applying Newton’s method to the equation jc 3 - 
a = 0, show that 


122 Consider a spherical shell 1 ft thick whose volume is 
twice the volume of the hollow space inside it. Use 
Newton’s method to calculate the shell’s outer radius 
to six decimal places of accuracy. 

123 A conical paper cup is 4 in deep and 4 inches in di¬ 
ameter. Its vertex is pushed up inside, as shown in Fig. 
4.51. How far does its tip penetrate the space inside 

f See Additional Problem 4 at the end of Chapter I. 


SECTION 4.7 

125 An economist studying a certain appliance business 
finds that the overhead and wholesale cost involved in 
handling x electric mixers a week is 56 + 24 jc dollars, 
and that each week x = 30 — \p mixers are sold at a 
retail price of p dollars apiece. What retail price should 
she advise the owner to charge in order to earn the 
greatest profit? 

126 (a) Suppose a manufacturer can sell x bicycles per 

year at a price of p = 300 - O.OIjc dollars apiece, 
and that it costs him C(jc) = 60,000 + 15x dollars 
to produce the jc bicycles. For maximum profit, 
what should his production be and what price 
should he charge? 

(b) If the government imposes on the manufacturer a 
tax of $25 for each bicycle and the other features 
of the situation are unchanged, how much of the 
tax should he absorb himself and how much should 
he pass on to his customers if he wishes to con¬ 
tinue making the maximum profit? 

127 If the marginal revenue from producing jc units of a 
certain commodity is 40 — ^jc 2 dollars/unit and the 
marginal cost is 10 4- ^jc 2 dollars/unit, how many 
units should be produced to maximize the profit? 
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INDEFINITE 
INTEGRALS AND 
DIFFERENTIAL 
EQUATIONS 


Our work in the preceding chapters was concerned with the problem of tangents 
as described in Section 2.1 —given a curve, find the slope of its tangent; or equiv¬ 
alently, given a function, find its derivative. 

In addition to launching the full-scale study of derivatives, Newton and Leib¬ 
niz also discovered that many problems in geometry and physics depend on “back¬ 
wards differentiation,” or “antidifferentiation ” This is sometimes called the in¬ 
verse problem of tangents: Given the derivative of a function, find the function 
itself. 

In this chapter we work with the same derivative rules as in Chapter 3. Here, 
however, these rules are read backwards, and lead in particular to the “integra¬ 
tion” of polynomials. Even these relatively simple procedures have some re¬ 
markable applications, which we discuss in Section 5.5. 


5.1 


INTRODUCTION 


As we know, the definition of the derivative f’(x) of a function y = /(x) can be 
stated as follows: 


Ay 

/'(x) = lim -rU 
J A* 


(1) 


It is understood here that Ax is a nonzero change in the independent variable x, 
and that Ay =/(x + Ax) —/(x) is the corresponding change in y. In Section 2.3 
we introduced the equivalent notation 


5.2 

DIFFERENTIALS AND 
TANGENT LINE 
APPROXIMATIONS 


dy_ 

dx 


( 2 ) 


for this derivative, and we emphasized there that (2) is a single symbol and not 
a fraction. However, it is certainly true that (2) looks like a fraction, and in some 
circumstances it even acts like one. The most important example of this is the 
chain rule, 

dy du _ dy dd _ dy 
du dx ydix dx dx ’ 

where the correct formula for the derivative of a composite function is obtained 
by canceling as if the derivatives were fractions. 
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Our present purpose is to give individual meanings to the pieces of (2), namely, 
to dy and dx, in such a way that their quotient is indeed the derivative /'(jc). Our 
reasons for doing this are difficult to explain in advance. Suffice it to say that 
this notational device is a necessaiy prelude to the powerful computational meth¬ 
ods introduced in this chapter—integration by substitution, and the solution of 
certain differential equations by separating the variables. 

We begin by considering the special case in which y is a linear function of x y 

y = mx + b. (3) 



Let P = (jc, y) be a point on this line (Fig. 5.1). If jc is given an increment Ax 
and if the corresponding increment in y is Ay, then the slope of the line (3) is 

change in y Ay 

m = ———:-= —, 

change in jc Ax 


so 


Ay = m Ax. (4) 

When working in this way with increments along a straight line, we denote these 
increments by the symbols dx and dy , so that by definition 


dx = Ax and dy = Ay, 


and we call them differentials. With this notation, (4) becomes 

dy = m dx. (5) 


Now consider an arbitrary function 

y = /«, 


(6) 


and assume that this function has a derivative at jc. If P is the corresponding point 
on the graph (Fig. 5.2), then the tangent at P is the straight line PR with slope 
m = /'(jc). By the differentials dx and dy arising from (6), we mean the incre¬ 
ments in the variables jc and y that are associated with this tangent line. To state 
this more precisely, the differential dx of the independent variable jc is any in¬ 
crement Ajc in jc, as shown, 

dx= Ax\ (7) 


and the differential dy of the dependent variable y is the corresponding increment 
in y along the tangent line , namely, 

dy =f'(x) dx. (8) 



Thus, as Fig. 5.2 shows, if dx = Ax = PQ is any change in jc, then Ay = QS and 
dy = QR are the corresponding changes in y along the curve and along the tan¬ 
gent line, respectively. We observe that (8) reduces to (5) when /(jc) = mx + b. 

If dx 0, then we can divide (8) by it and obtain 

^ =/'(*). (9) 

dx 

Up to this point equation (9) has been trivially true because its two sides have 
been merely two different ways of writing the same thing, namely, the derivative 
of the function y = /(jc). The new feature of (9) in our present discussion is that 
now the Leibniz symbol on the left not only looks like a fraction but is a frac¬ 
tion, 
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dy _ differential of y 
dx differential of x' 


The Leibniz notation for derivatives makes it particularly easy to produce the dif¬ 
ferential formula (8) when the function y = f(x) is given, by computing the de¬ 
rivative and multiplying by dx. The calculation in the first column gives the gen¬ 
eral pattern, 

y=f(x) y = x 2 





dy = f'{x) dx dy = 2x dx, 


and the calculation in the second column shows how it works for the special case 
y = x 2 . A little experience with the use of this notation makes us realize that we 
can proceed directly from y = x 2 to the formula dy = 2x dx without bothering to 
write the intermediate step dy/dx = 2x. We emphasize that a differential on the 
left side of an equation requires that the right side must also contain a differen¬ 
tial. Thus, we never write dy = 2x, but instead dy = 2x dx. 

It is often convenient to write df(x) instead of dy. 


Example 1 As illustrations of this remark we have 

d(x 2 ) = 2x dx, d(5x 4 ) = 20x 3 dx, d j = dx = — 

d(x 4 + lx 2 + 6) = (4x 3 + 14 x) dx, 


and 


d(x sin x) = (x cos x + sin x) dx. 


Our familiar formulas for calculating derivatives can now be given useful equiv¬ 
alent formulations in the notation of differentials. Suppose y = f(u), so that dy = 
f\u) du. Then for various choices of the function f(u) we get the formulas 

d{u n ) = nu n ~ ] du, d (sin u) = cos u du, (10) 

and so on. When the differential notation is used in this way, it allows us to write 

derivative formulas without any need to mention the independent variable. In this 
spirit, if we multiply the product and quotient rules by dx, then they take the form 

d(uv) = u dv + v du and d (^j = V ^ U ^ . 

Further, if we have y = f(u ), and u in turn is a function of another variable x, 

say u = g(jt), then we can substitute du = g'M dx in the formula dy = f'(u) du 
and obtain 


dy =f\u)g\x) dx. 

This is the differential version of the chain rule 

dy _ dy du 
dx du dx 


( 11 ) 


( 12 ) 


that we mentioned earlier. In both versions (11) and (12), the chain rule appears 
to be the result of simple algebraic manipulations of differentials. It is this per- 
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feet key-in-the-lock fit with the chain rule that makes the differential notation 
such an indispensable tool in calculus, as we shall see in the next section and 
thereafter. 


Example 2 In the following specific applications of formulas (10) we see the 
differential chain rule in action: 


and 


d(x 2 + l) 4 = 4(x 2 + l) 3 d(x 2 + 1) 
= 4(x 2 + 1 ) 3 • 2 x dx 
= Sx(x 2 + l) 3 dx. 


d(s\n 4x 3 ) = cos 4* 3 d(4x 3 ) 

= cos 4x 3 • 12x 2 dx 


= \2x 2 cos 4x 3 dx. 



Most people who use calculus routinely as a tool in their work think of dif¬ 
ferentials as very small quantities, even though the definitions contain no such 
requirement. There are several good reasons for this. One such reason can be 
seen in Fig. 5.2, which shows that the tangent to a curve hugs the curve closely 
near the point of tangency. This means that when dx is small, the curve is virtu¬ 
ally indistinguishable from its tangent, and therefore the differential dy ; which is 
comparatively easy to calculate, provides a very good approximation to the ex¬ 
act increment Ay, which may be harder to calculate. We express this as a prac¬ 
tical procedure in the following way (Fig. 5.3): 

Whenf(x) and f\x) have been found for a particular value of x so that dy =f\x) dx 
is known, then we can use the formula 

f(x + dx) =f(x) + dy (13) 

to compute approximate values of the function at nearby values of x. 

Formula (13) is called a tangent line approximation, or sometimes a linear ap¬ 
proximation. 

Example 3 Use differentials to find an approximate value for 3^28. 


Solution The evaluation of ^/x is easy for jc = 27, so we take y = f(x) = 'Vx 
and dx = 1. Since dy = jx" 2/3 dx , we have 


dy = • 1 


1 = 1 
3 • 9 ” 27’ 


and therefore 

^28 = /(28) 3/(27) + dy 

= 3 + jy s 3.037. 

The actual value of ''^28 (by calculator) is 3.036588972.... Our approximation 
by differentials is therefore accurate to three decimal places even when dx = 1. 
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Of course, the calculation in this example has little practical value, because 
calculators can easily find cube roots to great accuracy. The real purpose of the 
example is to emphasize that differentials provide good linear approximations to 
the increments of more complicated functions. We will understand the great im¬ 
portance of this idea more clearly in a later chapter, when we work with differ¬ 
entials of functions of several variables. 

Perhaps a few more examples would not be amiss. 

Example 4 Calculate the actual and approximate volumes of a 4.01-ft cube. 

Solution If x is an edge, then the volume is V{x) = jc 3 . The actual volume is 
V(4.01) = (4.01) 3 = 64.481201 ft 3 . Now dV = 3x 2 dx, and by putting x = 4 and 
dx = 0.01 we have the approximate volume 

V(4.01) = V(4) + dV = 4 3 + 3-4 2 (0.01) = 64.48 ft 3 , 

which is not bad. 


Example 5 If the earth’s radius were increased by 1 ft, approximately how much 
would its surface area increase? 

Solution The surface area of a sphere of radius r is A = Airr 2 , and the earth’s 
radius is about 4000 mi.* * If we approximate the actual increment A A of the sur¬ 
face area by the differential dA evaluated at r — 4000 with dr = 1 ft, we get 

A A = dA = 8777 * dr = 87t( 4000) ■ ^ - mi 2 , 

5280 

since 1 ft = 1/5280 mi. By doing the arithmetic we find that A A = 19.04 mi 2 . 
This is close to the area of Manhattan Island, which is about 22 mi 2 . 


Example 6 If the earth’s radius were to shrink by 1 in, approximately how much 
would its volume decrease? 

Solution The volume of a sphere of radius r is V = f 7rr 3 , so dV = 4irr 2 dr and 

AV.^-WMOO^-JT^) 

= -3173.32 mi 3 . 

The minus sign appears here because r decreases, so the answer is that the vol¬ 
ume decreases about 3173.32 mi 3 . 


Remark I One of our standard notations for the second derivative of a function 
y = f(x) is d 2 y/dx 2 . In view of our work in this section it is desirable to point out 
that the numerator, d 2 y, and the denominator, dx 2 , have absolutely no meaning 


*In this example and the next we ignore minor irregularities like mountain ranges and deep ocean 
trenches, and the ellipsoidal shape of the earth, and assume that the earth is a perfect sphere of ra¬ 
dius 4000 mi. 
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by themselves and will never be given such a meaning. The expression d 2 y/dx 2 
is an inseparable symbol representing the second derivative and is written this 
way for reasons explained in the second paragraph of Section 3.6. 

Remark 2 The Leibnizian myths about curves and differentials. The modem 
concept of limit did not arise until the early nineteenth century, so no definition 
of the derivative resembling equation (1) was possible for Leibniz or his imme¬ 
diate successors. What were the early ideas about the nature of derivatives and 
differentials? 

Most of the fruitful mathematical thinking of the period was based on one form 
or another of the notion of the “infinitely small.” Leibniz’s attitude toward the 
equation 

# = ' im 4 1 

ax Ax—>o Ax 

would have been essentially as follows: As Ax approaches zero, both Ay and Ax 
become “infinitely small” or “infinitesimal” together. It is therefore reasonable 
to think of the limit dy/dx as the quotient of two infinitesimal quantities denoted 
by dy and dx and called “differentials.” In Leibniz’s imagination, an infinitesimal 
was a special kind of number that is not zero and yet is smaller than any other 
number. 

There was also a geometric version of these ideas, in which a curve was thought 
of as consisting of an infinite number of infinitely small straight line segments 
(Fig. 5.4). A tangent was a line containing one of these tiny segments. To find 
the slope of the tangent at a point (x, y), we move an infinitesimal distance along 
the curve to a point (x + dx, y + dy) and observe that the slope of the infinites¬ 
imal segment joining these two points is the quotient of two infinitesimals, dy/dx. 

We have suggested that Leibniz introduced his differentials dx and dy to de¬ 
note corresponding infinitesimal changes in the variables x and y. To get an idea 
of how these differentials were used, let us suppose that the variables x and y are 
related by the equation 

y = x 2 . (14) 

Leibniz would then replace x and y by x + dx and y + dy to obtain 
y + dy = (x + dx) 2 = x 2 + 2x dx + dx 2 , 
which in view of (14) yields 

dy = 2x dx + dx 2 . (15) 

At this stage Leibniz would simply discard the term dx 2 and arrive at our famil¬ 
iar formula 



dy = 2x dx, (16) 

which after division by dx takes its fractional form 

= 2x. (17) 

dx 

He would justify this step by claiming that any square of an infinitely small num¬ 
ber is “infinitely infinitely small,” or “an infinitesimal of higher order,” and there¬ 
fore entirely negligible. For Leibniz the derivative was a genuine quotient, a quo- 
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tient of infinitesimals as calculated in formula (17) and illustrated in Fig. 5.4, 
and his form of calculus came to be widely known as “infinitesimal calculus.” 

It may be instructive to compare this Leibnizian use of infinitesimals with the 
modem approach based on limits. Thus, with the function y = x 2 , if Ax is a given 
nonzero change in x and Ay is the corresponding change in y, then by essentially 
the same calculation as above we obtain 


Ay = 2x Ax + Ax 2 . 


Instead of discarding the term Ax 2 as Leibniz would have done, in the modem 
approach we divide through by Ax to obtain the quotient Ay/Ax and then define 
the derivative to be the limit of this quotient as Ax approaches zero, 


= lim -7^- = lim (2x + Ax) = 2x. 
dx Ax—>0 Ax Ax—>0 


This produces formula (17) in a way that replaces the use of infinitesimals by a 
limit calculation. 

The ideas of Leibniz worked with almost miraculous ease and effectiveness, 
and dominated the historical development of calculus and the physical sciences 
for almost 150 years. However, these ideas were flawed by the fact that infini¬ 
tesimals in the sense described above clearly do not exist, for there is no such 
thing as a positive number that is smaller than all other positive numbers. 
Throughout this period of more than a century the enormous success of calculus 
as a problem-solving tool was obvious to all, and yet no one was able to give a 
logically acceptable explanation of what calculus is. The fog that obscured its 
fundamental concepts was at last dispelled in the early nineteenth century by the 
classical theory of limits. Fortunately the early mathematicians of the modem pe¬ 
riod—Leibniz himself, the Bemoullis, Euler, Lagrange—had sound intuitive 
feelings for what was reasonable and correct in the problems they studied. Even 
though their arguments often lacked rigor from the modem point of view, these 
pioneers rarely went astray in their conclusions. 

If a myth is a veiled, condensed, symbolic expression of a more complicated 
and perhaps partially hidden truth, then mathematics has its myths just as his¬ 
tory and literature do. Leibniz’s differentials were banished from “official calcu¬ 
lus” by the theory of limits, but nevertheless they remain a living part of the 
mythology of the subject.* 


Tt should be added that a logically acceptable concept of infinitesimals was constructed in the 1960s 
by the American mathematician Abraham Robinson [see his book Non-Standard Analysis (North- 
Holland Publishing Co., 1966), especially Sections LI and 10.1]. While Robinson’s achievement is 
of great interest to logicians and mathematicians, his ideas depend on mathematical logic and ab¬ 
stract set theory and are not likely to have much influence on the teaching or learning of calculus. 


PROBLEMS 

Calculate each of the following. 

1 d(lx 9 - 3x 5 + 34). 2 d(V 1 - t 2 ). 

3 


5 d(V 4x~- x 2 ). 

7 d(3x 2/3 + 10x I/5 — 17x). 


6 

8 


d 

d 


~ (1 - 2x) 3 ' 
3 - 4x 
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9 

11 


12 


13 


14 


15 


d(x 2 \^3x + 2). 10 d(\/x + Vx + 1). 

Use differentials to find dy/dx, given that 

y = —\ -- and u = (jc 3 + 2) 5 . 


Use differentials to find dy/dx, given that 


y = 


u + 1 
u — 1 ’ 


i? + 6 v — 2 
Vv- 1 


v = X 4 + 5* 2 - 3. 


Consider a circle of radius r and area A = irr 2 . If the ra¬ 
dius is increased by a small amount Ar, find the incre¬ 
ment A A and the differential dA. Draw a sketch, and ob¬ 
serve that A A is the area of the thin circular ring between 
two concentric circles. Use the fact that the inner circle 
has circumference 2irr to understand geometrically why 
dA is a good approximation to A A. 

A sphere of radius r has volume V = jnr 3 and surface 
area A = 47rr 2 . If the radius is increased by a small 
amount A r, find AV and dV. In the spirit of Problem 13, 
understand geometrically why d Vis a good approxima¬ 
tion to AV. 

A coat of paint of thickness 0.02 in is applied to the faces 
of a cube whose edge is 10 in, thereby producing a 


slightly larger cube. Use differentials to find approximately 
the number of cubic inches of paint used. Also find the exact 
amount used by computing volumes before and after paint¬ 
ing. 

•ml In Problems 16-23, use differentials to find approximate val¬ 
ues for the given quantities. In each case use a calculator to 
find the value correct to six decimal places, and compare. 

16 N/83. 17 65 2/3 . 

18 VT02. i9 803/4. 

20 VTT9. 21 N/2l8. 

22 sin 59°. (Remember: First translate into radians.) 

23 cos 32°. 

Hill In Problems 24-26, find the approximate amount by which 
the radius of the earth would have to be increased to produce 
additional surface area the size of each of the given states. 

24 Rhode Island (= 1215 mi 2 ). 

25 Colorado (= 104,250 mi 2 ). 

26 Alaska (= 580,400 mi 2 ). 

27 Suppose a red ribbon is wrapped tightly around the earth 
at the equator. Approximately how much must the rib¬ 
bon be lengthened if it is to be strung on poles 20 ft above 
the ground all the way around the earth? 


5.3 

INDEFINITE INTEGRALS. 
INTEGRATION BY 
SUBSTITUTION 


If y = F(x) is a function whose derivative is known, say, for example, 

-f F(x) = 2x, (l) 

dx 

can we discover what the function F(x) is? It doesn’t take much imagination to 
write down one function with this property, namely, F(x) = x 2 . Moreover, adding 
a constant term doesn’t change the derivative, so each of the functions 

A 2 +1, x 2 V3, * 2 + 577, 


and more generally 

jc 2 + c 

where c is any constant, also has the property (1). Are there any others? The an¬ 
swer is no. 

The justification for this answer lies in the following principle: 

If F(x) and G(x) are two functions having the same derivative f{x) on a certain inter¬ 
val , then G(x) differs from F(x) by a constant , that is, there exists a constant c with the 
property that 


G(x) = F(x) + c 

for all x in the interval. 

To see why this statement is true, we notice that the derivative of the difference 
G(x) — F(x) is zero on the interval. 
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Y x [GW - FCX)] = / GW - / F W =/(*) - /« = 0. 

It now follows that this difference itself must have a constant value c, so 

G(x) - F(x) = c or G(x ) = FU) + c, 

which is what we wanted to establish.* 

This principle allows us to conclude that every solution of equation (1) must 
have the form x 2 + c for some constant c. 

The problem just discussed involved finding an unknown function whose de¬ 
rivative is known. If/(;c) is given, then a function F(x) such that 

-f F{x)=f{x) (2) 

dx 

is called an antiderivative of /(; c), and the process of finding F(x) from /(jc) is 
antidifferentiation . We have seen that /(; c) does not have a single, uniquely de¬ 
termined antiderivative, but if we can find one antiderivative F(jc), then all oth¬ 
ers have the form 


F(x) + c 

for various values of the constant c. For example, yjc 3 is one antiderivative of jc 2 , 
and the formula 

I* 3 + c 

comprises all possible antiderivatives of x 2 . 

For historical reasons, an antiderivative of /(jc) is usually called an integral of 
/(;c), and antidifferentiation is called integration. The standard notation for an in¬ 
tegral of /(jc) is 

J/(*)<&, (3) 

which is read “the integral of /(jc) dxF The equation 

J/M dx = F(x) 

is therefore completely equivalent to (2). The function/(jc) is called the integrand. 
The “elongated S” symbol in (3) is called the integral sign, and was introduced 
by Leibniz in the earliest days of calculus. Its origin will become clear in the 
next chapter. 

To illustrate a point of usage, we remark that the formulas 

Jx 2 dx = yjc 3 and jx 2 dx = y;c 3 + c (4) 

are both correct, but the first provides one integral while the second provides all 
possible integrals. For this reason the integral (3) is usually called the indefinite 


‘“The crucial step in this reasoning can be expressed in several different ways: for instance, if the rate 
of change of a function is always zero, then the function cannot change and therefore must be con¬ 
stant; or equivalently, if every tangent line to a graph is horizontal, then the graph can neither rise 
nor fall and therefore must be a horizontal straight line. The theoretical basis for this inference is 
called the Mean Value Theorem and is examined more closely in Section 2.6. 
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integral, in contrast to the definite integrals discussed in the next chapter. The 
constant c in the second formula of (4) is called the constant of integration and 
is often referred to as an “arbitrary” constant. Our previous discussion shows that 
to find all integrals of a given function /(;c), it suffices to find one integral by any 
method that works—calculation, intelligent guessing, or asking a knowledgeable 
friend—and then to add an arbitrary constant at the end. 

Every derivative that we have ever calculated can be reversed and rewritten as 
an integral. In particular, the power rule 

x n = nx n ~ ] becomes lnx n ~' dx = x n . 
ax J 

For our present purposes the formula 

d x n+l 

-= x n 

dx n + 1 


is a more convenient version of the power rule. This gives the form of the inte¬ 
gral that we shall memorize and use, 



jt ” +1 

n+ r 


n =£ — 1, 


(5) 


In words: To integrate a power, add 1 to the exponent and divide by the new ex¬ 
ponent. 


Example 1 


The following integrals are all special cases of (5): 



r 572 


dx = 


X 573 _ 1 

573 573 


v 573 



Jvx dx = j* l/2 dx = 


1 

4jc 4 ’ 


2 

3 


x 


3/2 


The reader will notice that when n = —1, the right side of (5) has zero de¬ 
nominator and is therefore meaningless. The treatment of this case, that is, the 
determination of the integral 

f 

J X ’ 

is one of the most important and fascinating parts of calculus, with a wide vari¬ 
ety of applications. We return to this problem in Chapter 8. 

The following additional integration rules are also slightly disguised versions 
of familiar facts about derivatives: 

jcf(x ) dx = c jf(x) dx (6) 


and 


j LfW + gWl dx = jf(x) dx + jg(x) dx. 


(7) 
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The first says that a constant factor can be moved from one side of the integral 
sign to the other. It is important to understand that this does not apply to vari¬ 
able factors, as can be seen from the fact that 

Jx 2 dx =£ x Jx dx, 

since the left and right sides are, respectively, \x 3 and x • jx 2 = yx 3 . Formula (7) 
says that the integral of a sum is the sum of the separate integrals. This applies 
to any finite number of terms. 

To verify (6) and (7), it is enough to notice that they are equivalent to the dif¬ 
ferentiation formulas 


-j- cF(x ) = c -j- F(x) 
dx dx 


and 


-j- [F(x) + G(x)] = -j- F(x) + -j- G(x), 
dx dx dx 

where (< d/dx)F(x ) = f(x) and (d/dx)G(x) = g(x). 

Example 2 When rules (5), (6), and (7) are combined, they enable us to inte¬ 
grate any polynomial. For instance, 

| (3x 4 + 6x 2 ) dx — 3 jx 4 dx + 6 Jx 2 dx 

= };c 5 + 2x 3 + c 

and 

J(5—2x 5 + 3X 11 ) dx = 5 J dx — 2 J x 5 dx + 3 J x l 1 dx 
= 5x - \x 6 + \x n + c. 

Observe that f dx = f 1 dx = x. In each of these calculations an arbitrary con¬ 
stant is added at the end so that all possible integrals are included. 


Example 3 We can also integrate many nonpolynomials that are expressible as 
linear combinations of powers: 

J ^/x 2 dx = J > 3 dx = jx 5/3 + c; 


2x 3 — x 2 — 2 


5jc" 3 - 2x~ {f3 


dx = J(2x — l — 2x 2 ) dx 
2 


Vx 


dx 


= X 2 - X + — + c: 


= J(5x 1/6 - 2x 5/6 ) dx 


= 6jc 5/6 — 1 2jc 1/6 + c. 
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The formula 


\u n du = 


n + r 


n =£ -1, 


(8) 


appears to be a trivial variation of (5) in which the letter jc is replaced by u. How¬ 
ever, let us think of u as some function /(jc) of jc and take du seriously as the dif¬ 
ferential of w, so that 



U 

=/to 



and 






du = 

■-fix) dx. 



Then (8) becomes 






jifwyfix) dx = 

[/(*)]" +l 

n + 1 ’ 

n * -1, 

(9) 


which is a far-reaching generalization of (5). 


Example 4 In practice, we usually exploit this idea by explicitly changing the 
variable in order to reduce a given complicated integral to an integral of the sim¬ 
ple form (8). For instance, in the case of 

J(3jc 2 — 1) 1/3 4jc dx, 

we notice that the differential of the expression in parentheses is 6jc d: c, which 
differs from 4jc dx only by a constant factor, so we write 

u = 3jc 2 — 1 , 
du = 6jc dx, 
x dx = {du. 

These equations constitute a small “dictionary” that enables us to translate the 
given integral from the jc-notation to the w-notation, as follows: 

j(3x 2 - 1) 1/3 4jc dx = Jm 1/3 -4 ' \ du = j ju l/3 du 

= y • |m 4/3 + c = yw 4/3 + c; 

and by returning to the jc-notation we obtain our result, 

J(3x 2 - 1) 1/3 4jc dx = \(3x 2 - 1) 4/3 + c. 


This method is called integration by substitution, because it depends on a sub¬ 
stitution or change of variable to simplify the problem. As formula (9) suggests, 
the success of the method depends on having an integral in which one part of 
the integrand is essentially the derivative of another part—where “essentially” 
means “except for a constant factor” 

Remark 1 The integral in Example 4 was deliberately constructed so that the 
method of substitution works. To emphasize this point, we observe that the sim¬ 
ilar integral 
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j(3x 2 — 1) 1/3 dx (10) 

seems to be “simpler” than the one in Example 4 but is actually much more dif¬ 
ficult because the integrand lacks the important factor x. If we try the substitu¬ 
tion that worked before, we get 

j(3x 2 - iy /3 dx = |« l/3 -|p 

and there is no practical way to get rid of the ;t in the denominator. In a later 
chapter we will study deeper methods that succeed in this type of problem, but 
just now there is nothing further we can do. 

Remark 2 Many students are tempted to try to integrate (10) by writing 

j(3x 2 - 1) 1/3 dx = (3x2 4 ~ 3 1)4Q = j (3x 2 - 1) 4/3 + c, (11) 

which is incorrect. To understand why this is incorrect, recall that in calculating 
integrals we can always check our work quite easily, for if we have a suspected 
integral of a function f(x), we can test it by computing its derivative to see if the 
result really equals f(x). It is clear that (11) fails this test, because the derivative 
of the right side is 

| • j(3x 2 - 1) 1/3 • 6x = ( 3>x 2 — 1) 1/3 6x, 
which is certainly not the integrand of (10). 

Finally, our derivative formulas for the sine and cosine yield the following im¬ 
portant integration formulas: 

Jcos u du = sin u + c (12) 

and 

Jsin u du = —cos u + c. (13) 

These are tools with innumerable applications, ranging from the theory of prob¬ 
ability to the propagation of sound waves. 

Example 5 (a) To integrate 

Jcos 3x dx , 

we look at (12) and see that we must put u = 3x so that du = 3 dx and dx = y du. 
We then write 


Jcos 2>x dx = Jcos u • } du = \ Jcos u 


du 


= t sin u + c = t sin 3>x + c. 


(b) To integrate 


x sin (1 — x 2 ) dx , 
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we put u = 


— x 2 so that du = - 
Jx sin (1 — x 2 ) dx = 


-2x dx and x dx = —y du , and then use (13): 
jsin u • ( —y du) = —y Jsin u du 
y cos u + c = y cos (1 — Jt 1 2 3 4 5 ) + c. 


Remark 3 It is clear from Examples 4 and 5 that the notation of differentials is 
extremely useful for calculating indefinite integrals by the method of substitu¬ 
tion. This method strikes many students as a kind of magic. To understand why 
it is legitimate (magic is not allowed in mathematics!), let us consider the form 
of integral to which the method applies: 

jf[g(x)]g'Wdx. (14) 

What we have done above is put u = g(;c) and then write du = g'(jc) dx. The in¬ 
tegral (14) now takes the new form 

//[gtolg'to dx = jf(u) du. 

If we can integrate this, so that 

Jf(u) du = F(u) + c 


or 

F\u) =/(«), 

then since u = g(;c) we want to be able to integrate (14) by writing 

//[gWlg'M dx = jf(u) du = F(u) + c = E[g(_*r)] + c. (15) 

All that is needed to justify this procedure is to observe that (15) is a correct re¬ 
sult, because 

^ F[g(x)] - E'lgMlg'Or) =f[g(x)]g\x) 

by the chain rule. It is therefore the chain rule that allows us to work with the 
symbols dx and du after the integral signs as if they were differentials. This 
smooth compatibility with the chain rule is the main reason for the extraordinary 
value of the differential notation in calculus. 

Finally, it may be helpful to students if we give a formal outline of the process 
of integration by substitution : 

1 Make a careful choice of u } say u = g(.t). 

2 Compute du = g'(x) dx. 

3 Substitute g(.x) = u and g'(x) dx = du in the given integral. At this point the 
integral must be wholly in terms of u, and no jc’s should be present. If this is 
not the case, try another choice of u. 

4 Calculate the integral obtained in Step 3. 

5 Replace u by g(;c) to express the final result wholly in terms of x. 
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PROBLEMS 


In Problems 1-36, compute the integrals. Be sure to include 
the constant of integration in each answer. 


1 /(jc + 1) dx. 


f(x 2 + x 3 + jc 4 ) dx. 
f dx 

j vT 

/ x 3 ' 4 dx. 
r dx 

i v/;* 


1Tx 


13 


i 


3 + 2x 


dx. 


■ dx. 


f(Vx - I4x 5/2 +A 


V~x 

15 /jc 2 (1 + jc 3 ) dx. 

17 j (7 + x) dx. 

19 /(4jc 5 + 6jc — 5) dx. 

» r 

23 

24 

26 

28 /10 dx. 

30 f(l2x 5 - 3x~ 2 ) dx. 

32 [^±^-dx. 

J Vi 

34 Jjc 4 (5 — 1 2jc 55 ) d.x. 
36 Vi dx. 


12 


14 


18 


22 


dx. 


f(3x - 2) dx. 
fx 7 dx. 


6 J (3jc 2 + 2x + \) dx. 

8 fx 2 (x 2 1) dx. 

10 


|(600jc — 6jc 5 ) dx. 
J(2x — 7) dx. 

Jx 2 Vx dx. 


16 Jjc 7/5 dx. 


a* 


20 /(I — 2x 2 — 3jc 3 ) dx. 

6 


id 


■ + 2x 3/2 - 5 \dx. 


f(jx l/2 - 5) dx. 

25 /(?* 



J lvi 


29 J(4jc 3 - 8x + 17) dx. 
31 Jx 1/3 (x + 2) 2 dx. 

33 jV^(2-3x 2 ) 2 dx. 

35 J100x 499 dx. 


In Problems 37-44, compute the integrals by using the given 
substitutions. 

37 fV 3 + 4jc dx, u = 3 + Ax. 

38 /V3^T7 jc c/x, m = 3jc 2 T 1 . 

39 /(ITTsF- 

40 /jc 2 (1 — 4jc 3 ) 1/5 m = 1 — 4jc 3 . 


41 

42 

43 


f x dx 
' V5 - 4X 2 ’ 
f x 2/3 (2 - x 5/3 

f ( 1 + Vr) 174 

J Vx 


u = 5 - 4* 2 . 

) -5 u = 2 — x 
■ dx, u = 1 + Vx 


5/3 


44 


(2 + 3jc) dx 
Vl + 4x + 3x 2 ’ 


m = 1 + 4x + 3xr 2 . 


In Problems 45-58, compute the integrals by using substitu¬ 
tions of your own devising. 


j Vx 2 + x: 4 dx. 

46 

f dx 

3 (x — 7) 7 

f dx 

48 

r 4 dx 

'(7-xy 

' V(x - l) 3 ' 

fxV 2 — x 2 dx. 

50 

J24x(4x 2 - l) 9 dx. 

f 3x 2 dx 

52 

f 40 dx 

Vx 3 - 5 

I (4x + 5) 6 ' 

J(l(k+ 10) 10 dV. 

54 

Jx 6 (x 7 + 8) 9 dx. 

JxV(3x 2 + 4) 3 dx. 

56 

fx 2 Vx 3 + 1 dx. 

f (18x 2 - 2) dx 

58 

f (4 + 12jc) dx 

' V3x 3 - x + 2 

<N 

1 

1 


59 Integrate /(jc 3 ) 4 • 3x 2 ci c as /w 4 du and also as / 3jc 14 dx, 
and compare your results. 

60 Integrate /(jc 3 + l) 2 • 3jc 2 dx as fu 2 du and also by mul¬ 
tiplying out, and compare your results. 

61 Find the integral of 3jc 2 that has the value 10 when 
jc = 2. Hint: Since every integral of 3jc 2 has the form 
y = jc 3 + c, find the value of c that makes y = 10 when 
jc = 2. 

62 Find the integral F(jc) of Vjc with the property that 


F(9) = 9. 

63 Find the following integrals: 

(a) /cos 2 jc dx, 

(b) /sin 5jc dx, 

(c) /(4 cos 2jc + 15 sin 5jc) dx, 

(d) /(sin 2jc + cos 5jc) dx. 

64 Construct an example to show that 


ff(x)g(x) dx = ^ff(x) dxj(fg(x) dxj 

is not a valid integration formula. 

65 (a) Show that 

J sin 2 x dx — j sin 3 jc 

is not true. 

(b) Calculate 

J sin 2 jc dx 

by using the half-angle formula sin 2 jc = y(l - 
cos 2 jc). 

(c) Use differentiation to verify the correctness of your 
answer in (b). 
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66 Find the following integrals: 

(a) Jsin(2 - jc) dx\ 

(b) fx cos x 2 dx\ 

(c) fx 3 sin x 4 dx. 

67 Find the following integrals: 

(a) /sin 4 jc cos jc dx\ 

(b) Jcos^ x sin x dx\ 

(c) /cos jc cos(sin jc) dx. 


68 Show that both of the following integrals are correct: 
dx 1 , f dx x 


\— 
J (\ - 


(1 - jc) 2 1 - jc 


and 


(1 -x) 2 


Explain. 

69 Calculate /sin jc cos jc dx 

(a) by using the substitution u = sin jc; 

(b) by using u = cos jc. 

Reconcile your answers in (a) and (b). 


5 


.4 

DIFFERENTIAL 
EQUATIONS. 
SEPARATION OF 
VARIABLES 


We have seen that the equation 


is equivalent to 


jf(x)dx=F(x) 


dx 


F(x) = f(x). 


( 1 ) 


( 2 ) 


This statement can be interpreted in two ways. 

(a) In accordance with the explanation in Section 5.3, we can think of the sym¬ 
bol 

J ■ ■ ■ dx 

as operating on the function fix) to produce its integral, or antiderivative, Fix). 
From this point of view the integral sign and the dx go together as parts of a sin¬ 
gle symbol; the integral sign specifies the operation of integration, and the only 
role of the dx is to tell us that jc is the “variable of integration.” 

(b) A second interpretation is suggested by our treatment of Examples 4 and 
5 in Section 5.3. Let us write (2) in the form 

dFix) =fix) dx , 

so that fix) dx is explicitly seen to be the differential of Fix). If we now take dx 
in (1) at its face value, as the differential of jc, then the integral sign in (1) acts 
on the differential of a function F(jc), namely, on /(jc) d: c, and produces the func¬ 
tion itself. Thus, the symbol / for integration (without considering the dx as part 
of the symbol) stands for the operation which is the inverse of the operation de¬ 
noted by the symbol d. 

We shall use both interpretations. However, the second is particularly conve¬ 
nient, not only for the actual procedures used in computing integrals, but also for 
solving certain simple differential equations. 

A differential equation is an equation involving an unknown function and one 
or more of its derivatives. The order of such an equation is the order of the high¬ 
est derivative that occurs in it. 

In the process of integration we have been solving first-order differential equa¬ 
tions of the form 


* =/w ’ 


where /(jc) is a given function. Thus, the equation 
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= 3x 2 is equivalent to dy = 3x 2 dx, 
dx 


(3) 


and we integrate to obtain the solution, 

fdy = J3x 2 dx or y = x 3 + c. (4) 

Notice that a constant of integration arises on both sides here, 

y + c i = x 3 + c 2 » 

but this can be written as y = x 3 + (c 2 - cj), and no generality is lost by re¬ 
placing C 2 ~ c\ by c. Accordingly, it suffices to add a constant of integration to 
one side only, as we have done in (4). 

We can also handle more complicated differential equations. Let us find y if 


If we set aside the obvious trivial solution y = 0, this can be written as 


(5) 


dy_ _ 


2x dx. 


Integration now yields 


— = x 2 + c 

y 


or 


y = 


x 2 + c* 


( 6 ) 


This is called the general solution of (5), and different choices of c give differ¬ 
ent particular solutions . 

We were able to solve equation (5) by the method of separation of variables , 
that is, by isolating the y’s from the jc’s and integrating. In general, if a first- 
order differential equation can be written in the form 

g(y) dy =f(x) dx, 

with its variables separated, and if we can carry out the integrations, then we 
have the solution 


fg(y) dy = jf(x) 


dx + c. 


(7) 


It should be noted that only in very special cases can the variables be separated 
in this way. For instance, the differential equation 


dy. 

dx 


x + y 
x-y 


( 8 ) 


cannot be solved by this method. 

Each of the solutions (4) and (6) of equations (3) and (5) consists of a family 
of curves corresponding to various values of the constant c. These families are 
shown in Figs. 5.5 and 5.6. The arbitrary constant that appears in the general so¬ 
lution of a first-order equation is given a specific numerical value by prescrib- 
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Figure 5.5 y = x 3 + c 
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ing, as an initial condition, the value of the unknown function y = y(x) at a sin¬ 
gle value of jc, say y = yo when jc = Jto. In geometric language, an initial condi¬ 
tion means that the solution curve is required to pass through a specific point in 
the plane. Thus, in Fig. 5.6 the upper and lower solid curves correspond to the 
initial conditions 

y = 1 when x = 0 and y = — 1 when x = 0, 

respectively. We shall see in the next section that this terminology is particularly 
suitable for mechanical problems, where time is the independent variable and the 
initial positions or initial velocities of moving bodies are specified. 

In the problems just discussed, equation (7) was easily solved for y to yield 
the solution of the given differential equation as a family of functions. It is of¬ 
ten convenient not to press this point, and to accept a family of equations as the 
general solution, without demanding explicitly displayed functions. 

We illustrate by finding the most general curve whose normal at each point 
passes through the origin O, and also the particular curve with this property 
through the point (2, 3). The normal OP has slope y/x (see Fig. 5.7), and the 
slope of the tangent is the negative reciprocal of this, so our differential equation 
is 

(9) 

ax y 

\ Separating variables gives y dy = —x dx , and by integrating we get 

\ 

\ l v 2 -_L r 2 i r 

_ 2 y ~ 2 x ^ c - 

If we put r 2 = 2c, our general solution of (9) takes the neater form 

x 2 + y 2 = r 2 . 

This is the family of all circles with center at the origin, as the reader has prob¬ 
ably foreseen. By setting jc = 2 and y = 3, we find that r 2 = 13, so 

x 2 y 2 = 13 

is the particular solution of (9) passing through the point (2, 3). It is clearly more 
reasonable to leave this solution as it is than to insist that it be solved for y. 

Remark 1 By rights, differential equations should perhaps be called derivative 
equations. However, as we saw in Section 5.2, in the early days of calculus dif¬ 
ferentials were the primary concepts and derivatives were secondary, so the term 
arose in a natural way. In any case, it has been in standard use for hundreds of 
years and no one dreams of changing it now. 

Remark 2 The mathematical description of a physical (or biological or chemi¬ 
cal) process is usually given in terms of functions that show how the quantities 
involved change as time goes on. When we know such a function, we can find 
its rate of change by calculating the derivative. Often, however, we are faced with 
the reverse problem of finding an unknown function from given information about 
its rate of change. This information is usually expressed in the form of an equa¬ 
tion involving derivatives of the unknown function. These differential equations 
arise so frequently in scientific problems that their study constitutes one of the 
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main branches of mathematics. We continue with some important applications of 
this subject in the next section, and return to it from time to time throughout the 
rest of our work. 


PROBLEMS 


Find the general solution of each of the following differential 
equations. 

dy 


2 -f- = (3 jc + l) 3 . 
dx 


1 ^- = 6x 2 + 4x - 5. 

dx 

3 = 24x 3 + 1 8 jc 2 - 8* + 3. 4 ^-= lV\. 

dx dx 

dy^ = x + Vx 


dX y — 

7 ^ L = ~^ + X - 


dy 
dx 

dy_ = Jy . 
dx V jc 


Find the particular solution of each of the following differen¬ 
tial equations that satisfies the given initial condition. 

8 -^ 7 - = IOjc + 5, y = 15 when jc = 0. 
dx 

9 “■ = 2 xy 2 , y = 1 when jc = 2. 
dx 

10 = —, y = 3 when x = 2. 
dx y 

11 y^ = Vy 4 + 2j 2 + 1), y = 1 when x = 4. 

.. dy 5 + 3jc 2 

12 ^ = YTV’ y = 2whenx = ~ 2 - 

13 ^ = Vjty, y = 64 when x = 9. 


In Problems 14-19, verify that the given function is a solu¬ 
tion of the given differential equation for all choices of the 
constants A and B. 


14 y = x + Ax 2 , jc= 2 y — x. 

dx 

15 y = Ax + x 3 , x^y- = y + 2x 3 . 

dx 


16 y = x T A\^x 2 + 1, (x 2 + 1) = xy 4- 1. 

dx 

17 y = Ax + Vx I TT, x^j- = y - -- 1 — ■. 

^ V^TT 

18 y=Ax + ^ x2 i? + x %~y = 0- 

19 y = Ax + Bx 2 , jc 2 _ 2jc + 2y = 0. 

dx 2 


20 In a certain barbarous land, two neighboring tribes have 
hated one another from time immemorial. Being bar¬ 
barous peoples, their powers of belief are strong, and a 
solemn curse pronounced by the medicine man of the 
first tribe deranges the members of the second tribe and 
drives them to murder and suicide. If the rate of change 
of the population P of the second tribe is -VP people 
per week, and if the population is 676 when the curse is 
uttered, when will they all be dead? 


Much of the original inspiration for the development of calculus came from the 
science of mechanics, and these two subjects have continued to be inseparably 
connected down to the present day. Mechanics rests on certain basic principles 
that were first laid down by Newton. The statement of these principles requires 
the concept of the derivative, and we shall see in this section that their applica¬ 
tions depend on integration and the solution of differential equations. 

Rectilinear motion is motion along a straight line. In contrast, motion along a 
curved path is sometimes called curvilinear motion. Our present purpose is to 
study the rectilinear motion of a single particle, that is, of a point at which a 
body of mass m is imagined to be concentrated. In discussing the motion of phys¬ 
ical objects, such as cars, bullets, falling rocks, etc., we often ignore the size and 
shape of the object and think of it as if it were a particle. 

The position of our particle is completely determined by its coordinate 5 with 
respect to a conveniently chosen coordinate system on the line (Fig. 5.8). Since 
the particle moves, 5 is a function of the time t, as measured from some conve¬ 
nient initial instant t = 0. We symbolize this by writing s = s(t). As we know 
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F = ma 
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Figure 5.8 
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from the discussion in Section 2.4, the velocity v of the particle is the rate of 
change of its position, 

ds 

V ~~dt ’ 

and the speed is the absolute value of the velocity.* In general, the velocity of a 
moving particle changes with time, and the acceleration a is the rate of change 
of velocity, 

_ dv _ d / ds \ _ d 2 s 
a ~ dt ~ dt\dtj~ dt 2 ' 

This is positive or negative according as v is increasing or decreasing. 

The basic assumption of Newtonian mechanics is that force is required in or¬ 
der to change velocity; that is, acceleration is caused by force. The concept of 
force originates in the subjective feeling of effort that we experience when we 
change the velocity of a physical object, for instance, when we push a stalled car 
or throw a rock. In the case of rectilinear motion, we assume that a force can be 
expressed by a number, which is positive or negative according as the force acts 
in the positive or negative direction. 

Newton's second law of motion states that the acceleration of a particle is di¬ 
rectly proportional to the force F acting on it and inversely proportional to its 
mass m. 


F 


a = —, 
m 


(1) 


or equivalently, 


F = ma. 


( 2 ) 


[The units of measurement for these quantities are always chosen so that the con¬ 
stant of proportionality in equation (1) has the value 1, as shown.) Thus, if the 
force is doubled, then by (1) the resulting acceleration is also doubled; and if the 
mass is doubled, the acceleration is cut in half. In this context, the mass of a 
body can be interpreted as its capacity to resist acceleration. 1 ' 

From one point of view, equation (2) can be considered as nothing more than 
a definition of force, because the right side is a quantity that can be calculated 
by measuring the mass and observing the motion, and this determines the force. 
On the other hand, the force F is often known in advance from fairly simple phys¬ 
ical considerations. The innocent-looking equation F = ma then becomes the 
second-order differential equation 



(3) 


This equation has profound consequences, for in principle we can find the par- 


*We have pointed out before that even though the words “velocity” and “speed” are more or less syn¬ 
onymous in ordinary usage, in physics (and mathematics) they have different meanings. The dis¬ 
tinction lies in the fact that the velocity v is sometimes positive and sometimes negative, depending 
on whether s is increasing or decreasing. On the other hand, the speed is \ v\, and hence is never neg¬ 
ative. 

f Newton s first law of motion asserts that if no force acts on a particle, then its velocity does not 
change, that is, its acceleration is zero. This is clearly a special case of (1). 


5.5 MOTION UNDER GRAVITY. ESCAPE VELOCITY AND BLACK HOLES 


183 


tide’s position 5 at any time t by solving (3) with appropriate initial conditions.* 

Example 1 Find the motion of a stone of mass m which is dropped from a point 
above the surface of the earth. 

Solution The most important example of a known force is the familiar “force 
of gravity.” From direct experimental evidence, we know that the force of grav¬ 
ity acting on the stone (this is the weight of the stone) is directed downward and 
has magnitude F = mg, where g is the constant acceleration due to gravity near 
the surface of the earth (g = 32 ft/s 2 or 9.80 m/s 2 , approximately). If 5 is the 
stone’s position as measured along a vertical axis, with the positive direction 
pointing downward and the origin at the initial position of the stone (Fig. 5.9), 
then equation (3) is 

d 2 s d 2 s 

m l? = m 8 0f = * 

Integrating this equation twice gives 


II 

5-IS* 

ii 

+ 

(4) 

S = Jgt 2 + C\t + c 2 , 

(5) 


where c\ and c 2 are constants of integration. Since the stone is “dropped” (that 
is, released with no initial velocity) at time t — 0 from the point chosen as the 
origin, the initial conditions are 

v = 0 and 5 = 0 when t = 0. 

The condition v = 0 when t — 0 gives c\ — 0, and 5 = 0 when t — 0 gives c 2 — 
0. We therefore have 

v = gt , (6) 

s = Igt 2 , (7) 

at least until the stone hits the ground. If we change the situation and require that 

the stone be thrown downward with an initial velocity Vq> from the initial posi¬ 

tion 5 = 5o at time t — 0, then the initial conditions are 

v - Vq and s = sq when t = 0, 

and (4) and (5) become 


V= gt + Vo , 
s = {gt 2 + VQt + SQ. 


It should be pointed out that in this discussion we have ignored the effect of 
air resistance, and have assumed that the only force acting on the falling stone 


*The intellectual impact of Newton’s F = ma on the seventeenth and eighteenth centuries was even 
greater than that of Einstein’s E = me 2 on the twentieth century. 


0 


s 



s -axis 

Figure 5.9 
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is the force of gravity. It is possible to take the air resistance into account, but in 
this case equation (3) is too complicated for us to cope with here. We return to 
this topic in Chapter 8 . 

We also remark that if distance is measured in feet and time in seconds, so 
that g has the numerical value 32, then ( 6 ) and (7) become 

v = 32r and s = 16r 2 . 

It is clear from the first of these equations that the velocity of the stone increases 
by 32 ft/s during each second of fall, and of course this is what is meant by the 
statement that the acceleration due to gravity is 32 feet per second per second 
(ft/s 2 ). 


Example 2 A stone is thrown upward with an initial velocity of 128 ft/s from 
the roof of a building 320 ft high. Express its height above the ground as a func¬ 
tion of time. Find the maximum height the stone attains. Assuming that the stone 
misses the building on its way down, how long does it take to hit the ground? 
What arc the velocity and speed uf die stone at the moment it Hits the ground? 


r\ 



Solution We place the s-axis with its origin on the ground and the positive di¬ 
rection pointing upward ( Fig. 5.10). Since the force of gravity is directed down¬ 
ward, and by equation ( 2 ) the force and acceleration have the same sign, the ac¬ 
celeration of the stone is given by 

d 2 s 

a=-L r = -32. (8) 


Integrating this equation yields 


ds 


v = — = — 32r 4- ci, 
dt 


and by using the initial condition ^=128 when t = 0 , we get 

v=^~= -32 t + 128. 
dt 


(9) 


A second integration gives 

s = — 16r 2 + 128r -l- C 2 , 
and since 5 = 320 when t = 0, we obtain 

5 = — 16r 2 + 128r + 320 (10) 

as the height of the stone above the ground at any time t. 

To find the maximum height attained by the stone, we write (9) in the form 

v= -32(r-4). 

This tells us that for t < 4, the velocity is positive, so the stone is moving up¬ 
ward. When t = 4, the velocity is zero and the stone is motionless for an instant. 
For t > 4, the velocity is negative and the stone is falling. We therefore find the 
maximum height by putting t = 4 into equation (10). This gives 5 = —16 • 16 + 
128 • 4 + 320 = —256 + 512 + 320 = 576 as the maximum height. 
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The stone hits the ground when s = 0. By using equation (10) we see that this 
leads us to the sequence of equivalent equations 

- 16r 2 + 128r + 320 = 0, 

-16(r 2 - 8 1 - 20) = 0, 

(t - 10)(r + 2) = 0. 

Thus s = 0 when t — 10 or t = —2. The second answer is meaningless in the cir¬ 
cumstances, and can be discarded. Therefore the stone hits the ground 10 s after 
being thrown. 

To find the velocity of the stone at the moment it hits the ground, we put 
t = 10 into equation (9): v = —32 • 10 + 128 = —320 + 128 = —192. The ve¬ 
locity at that moment is therefore —192 ft/s, and the minus sign tells us that the 
stone is moving downward. The speed at that moment is | — 192| = 192 ft/s. 


In these examples we have treated the acceleration due to gravity as if it were 
a constant. This is almost true for moving bodies that stay fairly close to the sur¬ 
face of the earth. However, to study the motion of a body that moves away from 
the earth into space, we must take account of the fact that the force of gravity 
varies inversely as the square of the distance from the center of the earth. 

Example 3 Suppose a rocket is fired vertically upward with initial velocity v$ 
and thereafter coasts with no further expenditure of energy. For larger values of 
Vq it rises higher before coming to rest and falling back to earth. What must v 0 
be in order for the rocket never to come to rest, and thereby to escape completely 
from the earth’s gravitational attraction? 


Solution According to Newton s law of gravitation, any two particles of mat¬ 
ter in the universe attract each other with a force that is jointly proportional to 
their masses and inversely proportional to the square of the distance between 
them. In the present situation (see Fig. 5.11), this means that the force F attracting 
the rocket back to earth is given by the inverse square law 


F = 



where G is a positive constant, M and m are the masses of the earth and the 
rocket, and s is the distance from the center of the earth to the rocket.* 

We begin our detailed analysis of the problem by observing that in this case 
Newton’s second law of motion F = ma becomes 


d 2 s Mm 

m —rrr = — G — , 
dt 2 s 2 



Figure 5.11 


SO 


d 2 s _ GM 
dt 2 s 2 


(ID 


*It can be proved—and will be proved in a later chapter—that the gravitational attraction exerted 
on the rocket by the earth as a whole is the same as that which would be exerted by a particle of 
mass M located at the center of the earth. In other words, the entire mass of the earth can be treated 
as if it were concentrated at its center. 
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This tells us at the outset that the motion of the rocket does not depend on the 
rocket’s own mass. We can put the constants here into a more convenient form 
by noticing that the acceleration d 2 s/dt 2 has the value —g when s = /?, where R 
is the radius of the earth. This gives 

GM rAA d 2 

-g= — or GM = gR l \ 


and since d 2 s/dt 2 = dvldt, we can write (11) as 


dv = gR 2 
dt s 2 


( 12 ) 


Our next step is to eliminate t from this equation by using the chain rule to write 

dv _ dv ds _ dv 
dt ds dt ds 


Equation (12) now becomes 


dv gR 2 

v ~r = —t 

ds s l 


By separating variables and integrating, we obtain 

jvdv=gR 2 j -p- 


or 


v = 


sEL 


(13) 


To evaluate the constant of integration c, we use the initial condition that 
v = Vo when s = R, so 

W = gR + C 


and 


C = W - gR. 


With this value of c, equation (13) becomes 

\v 2 = + (\vq 2 - gR). 


(14) 


Our final conclusion emerges from (14) as follows: For the rocket to escape from 
the earth, it must move in such a way that \ v 2 is always positive, for if \v 2 van¬ 
ishes, the rocket stops moving and then falls back to earth. But the first term on 
the right of (14) evidently approaches zero as s increases. Therefore, in order to 
guarantee that \v 2 is positive no matter how large s is , we must have {vq 2 — 
gR > 0. This is equivalent to Vq 2 > 2 gR or Vq > V2 gR. The quantity V2 gR is 
usually called the escape velocity for the earth. We can easily estimate its value 
by using the approximations g = 32 ft/s 2 and R = 4000 mi: 

V2 Jr = V2 • 32 ft/s 2 • 4000 mi 

— ^ 2 ■ 32 ■ 3^80 mi/s 2 * 4000 mi 


= 7 mi/s = 25,000 mi/h. 
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Remark 1 In just the same way as in this example, the quantity \Z2g'R' is the 
escape velocity for any planet, satellite, or star, where R' and g' are understood 
to be the radius and the acceleration due to gravity at the surface. If the radius 
of such a body is decreased while the mass is unchanged, the escape velocity at 
the surface increases. Why? 

Remark 2 Most normal stars are maintained in their gaseous, puffed-up state 
by radiation pressure from within, which is generated by the burning of nuclear 
fuel. When the nuclear fuel gives out, the star undergoes gravitational collapse 
into a very much smaller sphere of essentially the same mass. The crushed, de¬ 
generate matter of these collapsed stars can sustain two types of equilibrium, de¬ 
pending on the mass of the star. White dwarfs are those that result when the mass 
is less than about 1.3 solar masses, and neutron stars arise when the mass is be¬ 
tween 1.3 and 2 solar masses. For heavier stars no equilibrium is possible, and 
collapse continues until the escape velocity at the surface reaches the speed of 
light. Collapsed stars of this type are completely invisible, since no radiation can 


ever escape. These are the so-called black holes. 

PROBLEMS 

1 In Example 2, how long after the stone is thrown does it 
pass the roof of the building on its way down? What are 
the velocity and speed at that moment? 

2 In Example 2, if the stone were simply dropped from the 
roof, what would 5 be as a function of time? How long 
would the stone fall? 

3 In Example 2, the origin of the 5-axis is at ground level. 
If the origin is placed at the top of the building, what are 
the formulas for v and s that correspond to (9) and (10)? 

4 A ball is thrown upward from the top of a cliff 96 ft high 
with an initial velocity of 64 ft/s. Find the maximum 
height of the ball above the ground below. Assuming that 
the ball misses the cliff on its way down, how long does 
it take to hit the ground? 

5 A bag of ballast is accidentally dropped from a balloon 
which is stationary at an altitude of 4900 m. How long 
does it take for the bag to hit the ground? 

6 With what velocity must an arrow be shot upward in or¬ 
der to fall back to its starting point 10 seconds later? How 
high will it rise? 

7 A boy at the top of a cliff 299 ft high throws a rock 
straight down, and it hits the ground 3{ seconds later. 
With what velocity does the boy throw the rock? 

8 A woman standing on a bridge throws a stone straight 
up. Exactly 5 seconds later the stone passes the woman 
on the way down, and 1 second after that it hits the wa¬ 
ter below. Find the initial velocity of the stone and the 
height of the bridge above the water. 

9 A stone is dropped from the roof of a building 256 ft 
high. Two seconds later a second stone is thrown down¬ 
ward from the roof of the same building with an initial 


velocity of Vq feet per second. If both stones hit the 
ground at the same time, what is vf! 

10 How much time does a train traveling 144 km/h take to 
stop if it has a constant negative acceleration of 4 m/s 2 ? 
How far does the train travel in this time? 

11 A man standing on the ground throws a stone straight 
up. Neglecting the height of the man, find the maximum 
height of the stone in terms of initial velocity vq. What 
is the smallest value of vq that will make it possible for 
the stone to land on top of a 144-ft building? 

12 O n the surface o f the moon the acceleration due to grav¬ 
ity is approximately j that at the surface of the earth, and 
on the surface of the sun it is approximately 29 times as 
great as at the surface of the earth. If a person on earth 
can jump with enough initial velocity to rise 5 ft, how 
high will the same initial velocity carry that person (a) 

^ on the moon? (b) on the sun? 

•llll 13 Newton’s law of gravitation implies that the acceleration 
due to gravity at the surface of a planet (or the moon or 
the sun) is directly proportional to the mass of the planet 
and inversely proportional to the square of the radius. 

(a) If g m denotes the acceleration due to gravity at the 
surface of the moon, use the fact that the moon has 
approximately the radius and the mass of the 
earth to show that g m is approximately g/6. 

(b) Use part (a) to show that the escape velocity for the 
moon is approximately 1.5 mi/s. 

14 Show that the point between the earth and the moon 
where the two exert equal but opposite gravitational 
forces on a particle is of the way from the center of 
the earth to the center of the moon. 
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CHAPTER 5 REVIEW: CONCEPTS, METHODS 


Define, state, or think through the following. 

1 Differentials dx and dy. 

2 Tangent line approximation. 

3 Indefinite integral (or antiderivative). 

4 Integrand and constant of integration. 

5 Integration formulas. 

6 Integration by substitution. 

7 Differential equation. 

8 Order of a differential equation. 


9 General solution, particular solution. 

10 Separation of variables. 

11 Initial condition. 

12 Rectilinear motion. 

13 Velocity, speed, acceleration. 

14 Newton’s second law of motion. 

15 Newton’s law of gravitation. 

16 Escape velocity. 


ADDITIONAL PROBLEMS FOR CHAPTER 5 


SECTION 5.3 


Compute the following integrals. Be sure to include the con¬ 
stant of integration in each answer. 


1 j(ix A - lx 3 + 10 ) <±c. 2 

_ f x 3 - 3jc 2 + * - 2Vx 

3 - dx. 

J x 




5 

1 

9 

11 

13 

15 

17 

19 

21 

23 

25 

27 

29 

30 

31 


Jjc(jc + l) 2 dx. 

6 

f(x + 3)(x 2 - 1) dx. 

f(5lx 2 - 1 08jc 3 ) dx. 

8 

r; 2 - 

f( 2 - Vx)(3 + Vx) dx. 

10 

/Vjc(7jc 2 — 5jc + 3) dx. 

/V2 - 3x dx. 

12 

J(3 + 7jc 2 ) 9 5jc dx. 

f(5x + 2) 164 dx. 

14 

f(3 — 4jc) 3/4 dx. 

f 5 jc dx 

Wl 

16 

fV3x 2 -2 xdx. 

r x 2 dx 

18 

r dx 

' V2x 3 - 1 

3 i/(lx + 3) 2 

f (jc — 1 ) dx 

20 

f dx 

I 

3 xV3x' 

3 S/x 2 -2x + 3 

f x dx 

22 

f xdx 

3 V(2 - x 2 ) 2 

3 V(x 2 - 4) 3 

f( , + IV* 

24 

t x 2 dx 

A 1 -J **• 

' (2 + 3x 3 ) 3 ' 

/(jc 2 + 2 jc + 1) 2/3 dx. 

26 

fx^l 4- jc 2 dx. 

\xS/\ + xdx. 

28 

fV2x 6 + x 4 , 

dx. 

J X 


f(x 3 + x + 32) 9/2 (3x 2 + 1) dx. 
f(x 2 + 1)V dx. 
f(x 3 — l) I/3 x 5 dx. 


SECTION 5.4 

32 Find the general solution of each of the following dif¬ 
ferential equations: 

(a) = 2y 2 (4jc 3 + 4 jc -3 ); 
dx 

(b) ^ = V(jc 2 - x- 2 ) 2 + 4. 


33 


34 


35 

36 

37 


Find the indicated particular solution of each of the fol¬ 
lowing differential equations: 
dy _ jc( 1 + y 2 ) 2 


^ dx y(l + jc 2 ) 2 
dy 


(b) = Vxy - 4jc - y + 4 


y = 1 when jc = 2; 

y = 8 when x = 5. 


The equation jc 2 = 4 py represents the family of all 
parabolas with vertex at the origin and axis the y-axis. 
Find the family of curves that intersect the curves of this 
given family at right angles. Hint: Show first that the 
slope of the tangent at every point (jc, y) (y i= 0) on each 
curve of the given family is 2y/x. 

Solve Problem 34 if the given family is jc y = c. 

Find y as a function of jc if dy/dx + y/x = 0. 

Equation (8) in Section 5.4 can be written as 


dy _ 1 + y/x 
dx 1 — y/x ’ 

and this suggests the substitution z = y/x. Use this idea 
to replace y by z as the dependent variable, and show 
that the variables can be separated in the resulting dif¬ 
ferential equation. Notice that the necessary integra¬ 
tions are beyond our capacity at the present stage, so 
in spite of our progress we have reached a temporary 
dead end. 


SECTION 5.5 

38 A ball is thrown vertically upward with an initial veloc¬ 
ity of 78 ft/s from the roof of a building 400 ft high. Find 





























ADDITIONAL PROBLEMS FOR CHAPTER 5 


189 


the distance 5 from the ground up to the ball t seconds 
later. If the ball misses the building on the way down, 
how long does it take to hit the ground? 

39 (a) A bullet is fired downward with a velocity of 400 ft/s 

from an airplane 20,000 ft above the ocean. Ne¬ 
glecting air resistance, how long does it take the bul¬ 
let to reach the water, and what is its velocity at the 
moment of impact? 

(b) If the bullet is merely dropped from the airplane, how 
long does it take to fall, and what is its velocity on 
impact? 

40 Show that a rock thrown straight up from the ground 
takes just as long to rise to its highest point as it does to 
fall back to its initial position. How is the velocity with 
which it hits the ground related to its initial velocity? An¬ 
swer the same question for its speed. 

41 A ball is dropped out of a window 19.6 m above the 
ground. At the same time another ball is thrown straight 
down from a window 79.6 m above the ground. If both 
balls reach the ground at the same moment, find the ini¬ 
tial velocity of the second ball. 

42 An automobile is traveling in a straight line at a veloc¬ 
ity of vo feet per second. The driver suddenly applies the 
brakes, and the car stops in T seconds after traveling S 
feet. If the brakes produce a constant negative accelera¬ 
tion of -ao ft/s 2 , find formulas for T and S in terms of 
Vo and ao . 

43 An astronaut stands on the edge of a cliff and drops a 
stone. She observes that it takes 4 seconds for the stone 
to fall to the ground at the bottom. On earth, this would 
mean that the cliff is 256 ft high. How high is the cliff 
(a) if the astronaut is on the moon, where the accelera¬ 
tion due to gravity is approximately 5.5 ft/s 2 ? (b) if she 
is on Jupiter, where the acceleration due to gravity is ap¬ 
proximately 85 ft/s 2 ? 

44 The results of Problem 13 in Section 5.5 are given in the 
second column of the following table: 



Earth 

Moon 

Jupiter 

Saturn 

Sun 

Mass (earth = 1) 

1 

1 

81 

317 

95 

332,000 

Radius (mi) 

4000 

1100 

43,000 

36,000 

432,000 

Acceleration of gravity 

8 

g/6 

2.6 g 

1-2 8 

29 g 

Escape velocity (mi/s) 

1 

1.5 

38 

23 

400 


Verify the rough approximations given in the third and 
fourth rows for Jupiter, Saturn, and the sun. 

45 If the sun could be crushed into a smaller sphere with 
the same mass, estimate what its new radius would have 
to be in order to increase the escape velocity at its sur¬ 
face to the speed of light (approximately 186,000 mi/s 
or 300,000 km/s). What would the new radius have to be 
in the case of the earth? 


46 Estimate the escape velocity from the surface of a white 
dwarf, a type of star in which a mass about equal to that 
of the sun is compressed into a volume about equal to 
that of the earth. 

47 Estimate the escape velocity from the surface of a neu¬ 
tron star, a type of star in which a mass about equal to 
two solar masses is compressed into a sphere of radius 
about 4 mi. 

48 According to currently accepted ideas among as¬ 
tronomers, the universe came into existence about 15 bil¬ 
lion years ago in an explosion called the Big Bang. Ever 
since that time the universe has been expanding in such 
a way that the velocity v of a galaxy at distance R from 
our galaxy (the Milky Way) is given by Hubble's law 
v = HR, where H is Hubble’s constant, about 16 km/s 
per million light-years (a light-year is about 9.47 X 
10 12 km). No one knows whether this expansion of the 
universe will continue indefinitely. If the universe con¬ 
tains enough matter, then the gravitational forces exerted 
by this matter on itself will ultimately slow down and 
stop the expansion. Then there will be a period of con¬ 
traction ending in a complete gravitational collapse 
called the Big Crunch, in which the universe as we know 
it—space, time, matter, energy—will cease to exist. 

(a) Show that the universe will continue to expand for¬ 
ever if the present density of matter 8 (mass per unit 
volume) is less than the critical density' 8 C = 
3H 2 /8ttG.* Hint: What is the escape velocity at a dis¬ 
tance R from our galaxy, due to the matter inside a 

— sphere of radius R centered on us? 

•Alii (b) Estimate the value of the critical density <5 C , given 
that the gravitational constant G is approximately 
6.67 X 1CT 20 km 3 /(kg*s 2 ). 

49 Newton’s second law of motion F = ma = m dv/dt can 
be written in the form F = d/dt (mv) in terms of the mo¬ 
mentum mv of a particle of mass m and velocity v, and 
remains valid in this form even if m is not constant, as 
assumed so far. Suppose a spherical raindrop falls 
through air saturated with water vapor, and assume that 
by condensation the mass of the raindrop increases at a 
rate proportional to its surface area, with c the constant 
of proportionality. If the initial radius and velocity of the 
raindrop are both zero, show that the drag exerted by the 
condensation of the water vapor has the effect of mak¬ 
ing the raindrop fall with acceleration jg. Hint: Show 
that d/dr (r 3 v) = ( 8/c ) r 3 g, where r is the radius of the 
raindrop and 8 is its density. 


*It would be interesting to know the ultimate fate of the universe, but 
unfortunately no one knows the present density of matter. 










6 


DEFINITE 

INTEGRALS 


6.1 

INTRODUCTION 


At the beginning of Chapter 2 we described calculus as the study of methods for 
calculating two important quantities associated with curves, namely, 

1 Slopes of tangent lines to curves, and 

2 Areas of regions bounded by curves. 


Of course, this description gives an oversimplified view of the subject, for it 
emphasizes calculus as a tool in the service of geometry but says nothing about 
its indispensable role in the study of science. Nevertheless, it explains the tradi¬ 
tional division of calculus into two distinct parts: differential calculus, which 
deals with slopes of tangent lines, and integral calculus, which is concerned 
with areas. 

The problem of areas was of great interest to the ancient Greeks. They knew 
a good deal about the areas of triangles, circles, and related configurations, but 
any other figure presented a new and usually insoluble problem. Archimedes was 
able to apply a technique called the method of exhaustion to calculate the area 
of a segment of a parabola, and also to calculate a few other particular geomet¬ 
ric quantities. But for almost 2000 years this handful of calculations by 
Archimedes stood as the isolated achievement of a great genius, unmatchable by 
others. However, by the middle of the seventeenth century several European 
thinkers—most notably Fermat and Pascal—began to push the method of ex¬ 
haustion beyond the point where Archimedes had left it. The decisive break¬ 
through was achieved a little later by Newton and Leibniz, who showed that if a 
quantity can be computed by exhaustion, then it can also be computed much more 
easily by using antiderivatives. This crucial discovery is called the Fundamental 
Theorem of Calculus. It binds together the two parts of the subject, and is un¬ 
doubtedly (as we have said before) the most important single fact in the whole 
of mathematics. 

This is the path we follow in the present chapter. Since calculations will seem 
to play a prominent part in our work, it is even more necessary than usual for 
students to keep firmly in mind that the underlying ideas are more important than 
the calculations. 
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Every rectangle and every triangle has a number associated with it called its area. 
The area of a rectangle is defined to be the product of its height and its base, and 
the area of a triangle is one-half the product of the height and the base (Fig. 6.1). 
Since a polygon can always be decomposed into triangles (Fig. 6.2), its area is 
the sum of the areas of these triangles. 

The circle is a more difficult figure. The Greeks solved the problem of find¬ 
ing its area in a very natural way. First, they approximated this area by inscrib¬ 
ing a square (Fig. 6.3). Then they improved the approximation step by step by 
doubling and redoubling the number of sides, that is, by inscribing a regular oc¬ 
tagon, then a regular 16-gon, and so on. The areas of these inscribed polygons 
evidently approach the exact area of the circle more and more closely. This idea 
yields the familiar formula 

A = 77 r 2 (1) 


6.2 

THE PROBLEM 
OF AREAS 


for the area A of a circle in terms of its radius r. The details of the reasoning are 
as follows. Suppose that the circle has inscribed in it a regular polygon with a 
large number of sides (Fig. 6.4). Each of the small isosceles triangles shown in 
the figure has area \hb , and the sum of these areas equals the area of the poly¬ 
gon, which closely approximates the area of the circle. If p denotes the perime¬ 
ter of the polygon, then we see that 

Apoiygon 2bb T ^hb T * * * T- -^hb 

= \h{b + b + • • • + b) = \hp . 

Now let c be the circumference of the circle, so that c = 2 irr by the definition 
of 77 .* Then, as the number of sides of the polygon increases, h approaches r (in 
symbols, h —» r\ p —» c, and therefore 



"That is, 7ris defined to be the ratio of the circumference to the diameter, so 7r= c/2r and therefore 
c = lirr. 


Figure 6.4 
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^polygon — \hp —* \rc = \ri2irr) = irr 2 , 

which establishes (1). The phrase “method of exhaustion” is clearly a good de¬ 
scription of this process, because the area of the circle is “exhausted” by the ar¬ 
eas of the inscribed polygons. 

We next examine the procedure by which Archimedes calculated the area of a 
parabolic segment, that is, the area of the part of the parabola in Fig. 6.5 bounded 
by the arbitrary chord AB and the arc ADCEB. There is no convenient way to in¬ 
scribe regular polygons in this figure, so Archimedes used triangles instead. His 
first approximation was the triangle ABC , where the vertex C is chosen as that 
point where the tangent to the parabola is parallel to AB. His second approxi¬ 
mation was obtained by adding to the triangle ABC the two triangles ACD and 
BCE , where the vertex D is the point where the tangent is parallel to AC and the 
vertex E is the point where the tangent is parallel to BC. To obtain his third ap¬ 
proximation, he inscribed triangles in the same way in each of the four regions 
still not included (one such region is that between the arc CE and the chord CE ), 
so his third approximation was the sum of the areas of the triangles ABC , ACD , 
BCE , and the four new triangles. By continuing to exhaust the parabolic segment 
in this way, he was able to show that its area is exactly four-thirds the area of 
the first triangle ABC. The details of his argument are a bit complicated; and 
since our interest here is mainly in the idea of the method of exhaustion, we omit 
these details. 

The general problem before us is that of finding the area of a region with a 
curved boundary. However, most of our work will be concentrated on a special 
case of this general area problem—namely, finding the area under the graph of 
a function y = f(x) between two vertical lines x = a and x = b, as shown in Fig. 
6.6. Such a region has a boundary that is curved only along its upper edge, and 
is therefore much easier to work with. A knowledge of this special case is often 
enough to enable us to cope with more complicated regions. To understand how 
this is possible, notice in Fig. 6.7 that the area of a region whose entire bound¬ 
ary is curved can often be obtained by subtracting the area under its lower edge 
from the area under its upper edge, where each of the latter areas is of the spe¬ 
cial type shown in Fig. 6.6. 

In Section 6.4 and thereafter, we will denote an area of the type shown in Fig. 
6.6 by the standard symbol 


| f(x) dx, (2) 

Ja 

which is read “the definite integral from a to b of f(x) dxP The reason for this 
notation will become clear in Section 6.4. For the present, however, we warn stu¬ 
dents in advance not to confuse the definite integral (2) with the indefinite inte¬ 
gral (or antiderivative) 


J fix) dx (3) 

introduced in Chapter 5. In spite of the fact that these two integrals have the same 
family name and look very much alike, they are totally different entities: The 
definite integral (2) is a number, and the indefinite integral (3) is a function (or 
a collection of functions). 
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At first sight it might appear that the problem of calculating areas is a matter 
of geometry and nothing more—interesting to mathematicians, perhaps, but with 
no practical uses in the real world outside of mathematics. This is not the case 
at all. It will become clear in the next chapter that many important concepts and 
problems in physics and engineering depend on exactly the same kinds of ideas 
as those used in calculating areas. As examples we mention the concepts of work 
and energy in physics, and also the engineering problem of finding the total force 
acting against the face of a dam due to the pressure of the water in a reservoir. 
Finding areas is therefore much more than merely a game mathematicians play 
for their own diversion. Nevertheless, for the sake of clarity we confine our at¬ 
tention in this chapter to the area problem itself, and in Chapter 7 we begin to 
sample the immense range of applications of the underlying idea. 


Remark 1 As a matter of historical interest, it appears that the first person to 
find the exact area of a figure bounded by curves was Hippocrates of Chios, the 
most famous Greek mathematician of the fifth century B.C. To understand what 
he did, consider the circle shown in Fig. 6.8, with the points A, B, C, D at the 
ends of the horizontal and vertical diameters. Using C as a center, describe the 
circular arc AEB connecting A and B. The crescent-shaped figure bounded by the 
arcs ADB and AEB is called a lune of Hippocrates (luna is Latin for “moon”), 
after the man who made the remarkable discovery that its area is exactly equal 
to the area of the shaded square whose side is the radius of the circle. Thus Hip¬ 
pocrates “squared the lune,” even though he was unable to square the circle it¬ 
self.* 

Remark 2 Most of us remember from school that the numerical value of i t is 
approximately 3.14, and some of us even remember a more accurate approxi¬ 
mation, 7 t= 3.14159. Also, in one of his treatises Archimedes derived his fa¬ 
mous inequality 

o 10 ^ 22 

37T < •”■ < 3t = T- 

which is the basis for the rough but widely used approximation 7r = t- Where 
do these values come from? 

The number tt was defined above as the ratio of the circumference of a circle 
to its diameter. As we saw, this yields the formula A = rrr 2 , which tells us that 
7 t is also the area of the unit circle (circle with unit radius) x 2 + y 2 = 1. The 
problem of computing tt therefore amounts to the problem of finding the area of 
the unit circle. 

To accomplish this, let p n and P„ be /t-sided regular polygons, with p n inscribed 
in the unit circle and P n circumscribed around it, as shown in Fig. 6.9. To find 
the areas of these polygons, it suffices to find the areas of the isosceles triangles 
making up p n and P n and then to multiply by n. If 6 is half the vertex angle, then 
6 is clearly the same for both isosceles triangles; and using degree measure, we 
have 



c 

Figure 6.8 Squaring the lune. 


J_ 360° _ 180° 
2 n n 


"Hippocrates’ exceedingly beautiful (but easy to understand) proof is given in the Appendix at the 
end of this chapter. 
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Figure 6.9 



n 

Mp„) 

MP„) 

6 

2.598076 

3.464102 

12 

3.000000 

3.215390 

24 

3.105829 

3.159660 

48 

3.132629 

3.146086 

96 

3.139350 

3.142715 

200 

3.141076 

3.141851 

400 

3.141463 

3.141657 

800 

3.141560 

3.141609 

1600 

3.141585 

3.141597 

3200 

3.141591 

3.141594 

6400 

3.141592 

3.141593 


By examining the figure we see that the area of p n is 

A(Pn) = n • 2 • j sin 6 cos 6 = sin 26 = ~ sin —^ — 

2 2 n 


and the area of P„ is 


A(P n ) = n • 2 • y tan 0 = n tan 


180° 


(4) 


(5) 


By substituting convenient values of rc in formulas (4) and (5), and using a cal¬ 
culator, we easily fill in the values shown in the adjoining table. Because 

A{p n ) < 77 < A(P n ) 

for all n, it is clear that i t = 3.14159, correct to five decimal places. In Section 
14.4 we describe other methods that have made it possible to compute 77 to more 
than 500,000 decimal places. 


6.3 


THE SIGMA NOTATION 
AND CERTAIN SPECIAL 
SUMS 


In order to clarify our discussion of definite integrals in the next section, we in¬ 
troduce here a standard mathematical notation used for abbreviating long sums. 
This is called the sigma notation, because it uses the Greek letter 2 (sigma). In 
the Greek alphabet the letter 2 corresponds to our letter S, which is the first let¬ 
ter of the word “sum.” This helps us to remember the purpose of the sigma no¬ 
tation, which is to suggest the idea of summation or addition. 

Thus, if a\, ai, . . . , a n are any given numbers, their sum is denoted by 


I 


a k . 


(1) 


This symbol is read “the sum from k = 1 to n of a k .” The idea compressed in (1) 
is that we are to write down each of the numbers a k as the subscript k varies from 
1 to n (namely, a\, 02 , . . . , a n ) and then add all these numbers together: 

n 

X a k = at + a 2 + • • • + a tv 

k= 1 

We write k = 1 below the 2 in (1), and n above it, to say that the sum starts with 
the term a k with k replaced by 1, and stops with the term a k with k replaced by 
n. The letter k used as the subscript here is called the index of summation. Any 
other letter (/ or j, for instance) would do just as well. Thus, 
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X * 3 ’ X * 3 ’ and X j 3 

Ar=l f=l y=l 

all represent the same sum, namely, l 3 + 2 3 + 3 3 + 4 3 + 5 3 = 225. 

We give a few additional specific examples of the sigma notation: 

y —— = —!— + — - — + —2— 

*=i * 2 + i i 2 + i 2 2 + i 3 2 + r 

y ( _!u+i J_ = J-L + J-L. 

1 ’ k 2 l 2 2 2 3 2 4 2 ’ 

n 

^ /c = 1 + 2 + • • ■ + rc; 

k=\ 

n 

X 2/t = 2 + 4 + • • • + 2w; 

A=l 

n 

y (2& — 1) = 1 + 3 + ''* + (2« —1). 

£=1 

Notice particularly in the second sum the use of the factor (—1)* +1 to produce the 
alternating signs +, —, + , — as the index of summation k takes the values 1, 2, 3, 
4. The last three of these sums are evidently the sum of the first n positive inte¬ 
gers, the sum of the first n even numbers, and the sum of the first n odd numbers. 

The following are some formulas from elementary algebra that will be needed 
in the next section: 


X* 


*=i 


+ 2 + • • 


• + n = 


n(n + 1) 
2 ; 


X k 2 = l 2 + 2 2 + • ■ • + n 2 

k— 1 


n(n + 1)(2 n + 1) 
6 


^ k 3 = l 3 + 2 3 + • • • + n? 

*=i 


n(n + 1) 12 
2 


( 2 ) 

(3) 

(4) 


These formulas can be proved by the method of mathematical induction. How¬ 
ever, an easier way to establish (2) is to write the sum once in the natural order, 
as shown, and then again in reverse order, 


5=1+ 2 +■■■+«, 


5 = n + (n — 1 )+■•• + 1. 


By adding these equations and noticing that each column on the right adds up to 
n + 1 and there are n columns, we get 25 = n(n + 1), from which (2) follows 
at once. 

There is yet another way of proving (2) that is worth knowing about because it 
can easily be adapted to yield (3) and (4) as well, and further formulas of the same 
type. It depends on the simple fact that (k + l) 2 = k 2 + 2k + 1, or equivalently 

(k + l) 2 - k 2 = 2k + 1. (5) 


If we let k = 1, 2, 3, . . . , n in (5), and write the resulting equations one below 
the other, we obtain 

2 2 - l 2 = 2 • 1 + 1, 

3 2 — 2 2 = 2 • 2 + 1, 
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4 2 — 3 2 = 2 • 3 + 1, 


(n + l) 2 - n 2 = 2 • n + 1. 


On the left here we have 2 2 and —2 2 , 3 2 and — 3 2 , . . . , n 2 and — n 2 . Therefore, 
when these equations are added with due attention to the cancellations on the 
left, the result is 



(n + D 2 



+ /I; 


and solving for the sum in parentheses yields (2): 


X k = \[{n + l) 2 - l 2 - n] = \[n 2 + n] 

k = 1 

= n(n + 1) 

2 ' 


PROBLEMS 


1 Find the numerical value of 


5 5 53 

(a) X «' 2 ; (b) x ^ (0 X 

( = 1 7 = 1 k =50 

8 500 300 

(d)X (-•)*; (e)X(-D'; (0 x 5; 

k= 1 / = 1 7=1 


(g) X cos 27 t/:. 

k=0 

Hint: Write the sums out, and examine them carefully. 

2 Use the sigma notation to write the following sums com¬ 
pactly: 

(a) 3 + 9 + 27 + 81; 

(b) 3 - 5 + 7 - 9 + 11 - 13; 

(c) i + io + is + • • • + ij; 

(d) 1 + 2 + 2 2 + ■ ■ ■ + 2 2( «>; 

(e) a 5 + a 6 + a 1 + ■ • • + a 10 ; 

vU 4 6 ' 8 10 ^ 12 14’ 

(g) 1 + 2 2 + 3 3 + 256. 

3 Prove formula (3) by using the expansion (k + l) 3 = 
k 3 + 3k 2 + 3k + 1 and the method suggested in the text. 

4 Prove formula (4) similarly, by using the expansion 
(k + l) 4 = k 4 + 4 k 3 + 6 k 2 + Ak + 1. 

5 Use (2), (3), and (4) to find closed formulas for the sum 
of the first n - 1 (instead of the first n) integers, squares, 
and cubes*: 

(a) 1 + 2 + ■ ■ • + (n - 1) = ? 

(b) l 2 + 2 2 + • • • + in - l) 2 = ? 

(c) l 3 + 2 3 + • • • +{n - l) 3 = ? 

*The indicated sums in (2), (3), and (4) are called open because the 

three-dot notation is used to suggest many terms that are present but 
not written. In contrast to this, the formulas on the right sides of these 
equations are called closed. 


6 Use the method suggested in the text to discover and 
prove closed formulas for (a) l 4 + 2 4 + • ■ ■ + n A \ (b) 
l 5 + 2 5 + • • ■ + n 5 . 

1 By doing a little arithmetic we see that 
p + 2 3 = (1 + 2) 2 , 
l 3 + 2 3 + 3 3 = (1 + 2 + 3) 2 , 
and 

l 3 + 2 3 + 3 3 + 4 3 = (1 + 2 + 3 + 4) 2 . 

Show that 

l 3 + 2 3 + ■■• + / 2 3 = (1 + 2+ ••• 3- n) 2 
for every positive integer n. 

8 There is a wonderful geometric proof of formula (4) that 
was known to the Arab mathematicians more than a thou¬ 
sand years ago. It depends on the square shown in Fig. 
6.10, which is constructed as follows. Beginning at the 
point O, lay off successive segments of lengths 1, 2, 3, 
etc., and finally one of length n extending up to the point 
A. Do the same on a line OB perpendicular to OA, so that 

OA = OB = \+ 2 + -- - + n 

= n(n + 1) f 
2 ’’ 

f This formula for the sum of the first n positive integers is proved 
and used in the writings of Archimedes, and was presumably dis¬ 
covered by him. This formula was therefore known to the Arab math¬ 
ematicians of the Middle Ages, who translated, honored, and pre¬ 
served the works of Archimedes during those dark centuries when 
most Europeans could not read or write and knew nothing of math¬ 
ematics, and those few who could read and write lived in monaster¬ 
ies and were submerged in piety. 
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Figure 6.10 

The area S of the square is therefore given by the formula 
_ \ n(n+ \) ]2 

* l 2 _ ’ 

However, the square is the sum of n L-shaped regions, 
as indicated: 

S ~ L\ + Lj + • • • + L n . 

Use the fact that L n can be split into the two rectangles 
in the figure to show that 

Ln = « 3 , 

so that 

S = l 3 + 2 3 + ♦ • * + A 3 , 


and thereby complete this proof of formula (4). 


The results of the next two problems will be needed in Sec¬ 
tion 6.5. 

9 (a) Use the product formula 

sin md cos nO = y[sin (m + n)0 + sin (m - n)0] 
(Additional Problem 63 in Chapter 1) to show that 
2 sin \x cos kx = sin (k + y);c — sin (k — y)jt. 


(b) 


By adding the identities in part (a) for k = 1,2,..., 
n and exploiting cancellations as in the text, estab¬ 
lish the formula 


cos kx = 

k= 1 


sin (n + {)jc — sin \x 
2 sin jx 


(C) 


where jc is not an integer multiple of 27 t. 

Use the product formula in (a) again to write the sum 
in (b) in the form 

n sin \nx cos \{n + I )x 


^ cos kx = 

k=\ 


sin yjc 


10 


Use the method of Problem 9 to establish the corre¬ 
sponding formula 

n sin \nx sin y (n + I )x 


sin kx 

k= 1 


sin 2 * 


where x is not an integer multiple of 27r. 


We begin by restating the problem we are trying to solve. Let y = f(x) be a given 
nonnegative function defined on a closed interval a < ^ < b, as shown in Fig. 
6.11. How do we calculate the area of the shaded region in the figure, that is, the 
area of the region under the graph, above the ;t-axis, and between the vertical 
lines x — a and x = bl 

Closed intervals like the one mentioned here will occur quite often in our dis¬ 
cussion, so we use the briefer notation [a, b\ Also, most of the functions we 
study will be continuous. The reader will recall that this means the following: 
From the intuitive point of view, the graph consists of a single piece, with no 
gaps or holes; and more precisely, for each point c in [a, b] we must have 

lim f{x) =/(c). 

X —)C 

Such a function has several basic properties that we wish to recognize explicitly: 
It is bounded, in the sense that there exists a constant K such that \f(x)\ < K for 
all x in [a, b\\ and it assumes maximum and minimum values, in the sense that 
the graph has a highest point and a lowest point.* 

We return to Fig. 6.11, with the specific assumption that the function y =f(x) 
is continuous on [a, b]. How do we find the area of the shaded region? If we take 


’"See the discussion of the Extreme Value Theorem in Section 2.6. 
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the nature of this region into account—that is, the fact that only the upper edge 
is curved—then the method of exhaustion suggests the following approximation 
procedure using thin rectangles. 

Let n be a positive integer and divide the interval [a, b ] into n equal subinter¬ 
vals. Using each subinterval as a base, construct the tallest rectangle that lies en¬ 
tirely under the graph. Write down the sum s n of the areas of all these thin rec¬ 
tangles. This sum approximates the area under the graph, and the approximation 
is improved by taking larger values of n, or equivalently, by dividing [a, b ] into 
a larger number of smaller subintervals. Finally, calculate the exact area under 
the graph by finding the limiting value approached by the approximating sums 
s n as n approaches infinity: 

area of region = lim s n . (1) 

n — 

The effect of this procedure is suggested in Fig. 6.12, showing a larger and larger 
number of thinner and thinner rectangles. 

We now describe this idea with greater precision by introducing some suitable 
notation. 

Again, let n be a positive integer and divide the interval [a, b] into n equal 
subintervals by inserting n — 1 equally spaced points of division jq, * 2 , . . . , x n -\ 
between a and b. If we denote a by x 0 and b by x n , then the endpoints of these 
subintervals are 


a - x o < < * 2 < • ‘ < x n -\ < x n = b, (2) 

and the subintervals themselves are 

[* 0 , *i], [Jfi, * 2 ], • ■ ■ . [Xn-U X n ], (3) 

as shown in Fig. 6.13. We denote the length of the kth subinterval by Ax*, so 

Ax k = x k - x k - h (4) 

Since the subintervals are equal in length, it is clear that Ax* = {b — a)/n. Let 


m k denote the minimum value of f(x) on the kth subinterval [x k -\, x k \ Then this 
minimum value is assumed at some point x k in the subinterval: 

/(**) = m b x k -i <x k <x k . 

For the particular curve shown in Fig. 6.13, x k is easily seen to be the left end¬ 
point of the subinterval when the curve is rising and the right endpoint when it is 
falling. Since the area of each inscribed rectangle is the product of its height and 
its base, the approximating sum s n of the areas of all these rectangles is clearly 
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Figure 6.13 


■In =/Ul) Ajt, + f(x 2 ) A *2 + ' ' ' + /(**) Ax* + • • • +/(*„) tXx n . 

If we use the sigma notation to abbreviate this sum, we get 

n 

f(x k ) Ax*, (5) 

k— l 

and (1) becomes 

n 

area of region = lim Y /(3c*) Ax k . (6) 

k= j 

This formula is all right as far as it goes, but from several points of view it is in¬ 
convenient and unduly restrictive. We broaden its scope and deepen its meaning 
in a series of remarks. 

Remark 1 It is not necessary that the subintervals (3) must be equal in length. 
In fact, the underlying theory is greatly simplified if this restriction is removed. 
We therefore allow the subintervals (3) to be equal or unequal in length, so that 
the increments (4) may be different from one another. In formula (6), it is now 
no longer enough to require that n approaches infinity; we must also require that 
the length of the longest subinterval approaches zero. Since the latter condition 
includes the former, we replace (6) by 

n 

area of region = lim Y /(3c*) Ax*, (7) 

max 

where max A x k denotes the length of the longest subinterval. 

Remark 2 The sum (5) is called a lower sum because it uses inscribed rectan¬ 
gles and approximates the area of the region from below. We can also approxi¬ 
mate the area from above, as follows. Roughly speaking, we now use each subin¬ 
terval as a base, as before, but this time we construct the shortest rectangle whose 
top lies entirely above the curve. 

To express this in symbols, let M k denote the maximum value of f(x) on the 
kth subinterval [x*_i, x*]. As before, this maximum value is assumed at some 
point Xk in the subinterval: 

ffak) Xk— l — x k — Xk . 


Using lower sums. 
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Figure 6.14 Using upper sums. 



The sum of the areas of the circumscribed rectangles is therefore 

S„ = X /(**) ^ (8) 

k= 1 

This is called an upper sum because it approximates the area of the region from 
above, as shown in Fig. 6.14. Geometric intuition tells us that the area of our re¬ 
gion can just as well be obtained as the limit of upper sums, so we have 

n 

area of region = lim V f(x*) Ax*. (9) 

max Ax*.—>0 ,—' 
k k= 1 

However, entirely apart from intuition—which is sometimes misleading—it can 
be rigorously proved as a theorem of pure mathematics that the limits in (7) and 
(9) both exist and have the same value for any continuous function. 1 ’ 

Further, if x% is taken to be any point in the kth subinterval [x*_!, x *], then we 
clearly have 

n 

Sn ^ X /(**) Ax* - S n- 
k= 1 

It therefore follows from the theorem just stated that both (7) and (9) can be re¬ 
placed by the formula 

n 

area of region = lim V /(x*) Ax*, (10) 

max Ax*—>0 ^ 

where the only restriction placed on x\ is that x*_ i < x\ < x*. 

Remark 3 The limit in (10)—or in (7) or (9) — is symbolized by the standard 
Leibniz notation 



( 11 ) 


f The details of this proof are not appropriate for an introductory course in calculus, where most stu¬ 
dents already have quite enough to think about. However, for the sake of the few exceptionally skep¬ 
tical and tenacious students who might be interested in pursuing the matter, these details are given 
in Appendix A.5. 
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which is read (as we said in Section 6.2) “the definite integral from a to b of 
/(x) dx?' If we write down the definition of (11), 

J f^ 

f(x)dx= lim 7/(4) Ax*, (12) 

« max Aj:*— >0 f— 

k= 1 

then every part of the symbol on the left side is intended to remind us of the cor¬ 
responding part of the approximating sum on the right side. The integral sign f 
is an elongated letter S, as in “sum,” chosen because of the similarity between a 
definite integral and a sum of small quantities; the passage to the limit in (12) is 
suggested by replacing the letter 2 by the symbol /. Also, the usual symbol A 
for an increment is replaced by the letter d to remind us of this limit operation, 
just as in the Leibniz notation dy/dx for the derivative. Thus, with the passage to 
the limit in (12), 

becomes 

becomes /(x), 
becomes dx . 

The numbers a and b attached to the integral sign are called the lower and up¬ 
per limits of integration . t Limits of integration are always present in a definite 
integral, and help distinguish it from the similar-appearing but very different in¬ 
definite integral 

J/M dx. 

The function/(x) in (11) is called the integrand —the thing being integrated— 
and the variable x is the variable of integration. The role of the dx as an impor¬ 
tant intuitive component of definite integrals will become much clearer in the 
next chapter. 


I 

k= 1 

/WE) 

Ax* 


Remark 4 In our discussion so far, we have adopted the naive but reasonable 
attitude that the area of the region under the graph clearly exists, and that all we 
have to do is devise a method for computing it. However, the following exam¬ 
ple shows that the situation is more complicated than this. 

Consider the function /(x) defined on [0, 1] by 

JO if x is rational, #### 

J [l if x is irrational. 

The graph is suggested in Fig. 6.15, and the very discontinuous nature of this 

function is shown by the fact that at least one irrational number lies between 

every pair of rationals and at least one rational number lies between every pair 

of irrationals. What is the area of the region under this graph? It is quite easy to ) tM< 

see that every lower sum is 0 and every upper sum is 1, so the area calculated 

by (7) is 0 and the area calculated by (9) is 1. Also, the limit on the right of (12) Figure 6.15 


f Here the word “limit” has nothing to do with the limit concepts that are the basis of calculus. It is 
used in its loose, everyday sense, meaning “border” or “boundary.” The limits of integration tell us 
where the integration begins and where it ends; they specify the left and right endpoints of the in¬ 
terval over which the integration is carried out, and remind us of k = 1 and k = n in the approxi¬ 
mating sum. 
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NOTE ON RIEMANN 

No great mind of the past has exerted a 
deeper influence on the mathematics of the modem era than 
Bernhard Riemann (1826-1 866), the son of a poor country 
minister in northern Germany. This influence was especially 
profound in geometry, number theory, and the theory of 
functions. 

From 1846 to 1851 he studied at the universities of Berlin 
and Gottingen. In 1851 he obtained his doctorate at Gottin¬ 
gen under Carl Friedrich Gauss—at that time universally 
considered to be the greatest mathematician in the world— 
with his celebrated dissertation, “Foundations for a General 
Theory of Functions of a Complex Variable.” In this path¬ 
breaking work he set forth an approach that is now familiar 
to all students of this subject, based on general principles 
and geometric ideas rather than formulas and calculations. 

In number theory his only published work was a brief but 
exceedingly prof ound paper of less than 10 pages devoted 
to the distribution of the prime numbers. This mighty effort 
started tidal waves in several branches of pure mathematics, 
and its influence will probably still be felt a thousand years 
from now. Every advanced course on number theory given 
at any university in the world today is saturated with Rie- 
mann’s ideas. 

In 1854 he was appointed Privatdozent (unpaid lecturer) 
at Gottingen, which at that time was the necessary first step 
on the academic ladder. Before he could be appointed, how¬ 
ever, he was required to present a trial lecture to the faculty. 
It was the custom for the candidate to offer three titles, and 
the head of his department usually accepted the first. Rie¬ 
mann rashly listed as his third topic the foundations of geom¬ 
etry, a subject on which he was unprepared but which Gauss 
had been turning over in his mind for 60 years. Naturally, 


Gauss was curious to see how this particular candidate would 
cope with such a challenge, and to Riemann’s dismay he 
designated this as the subject of the lecture. Riemann quickly 
tore himself away from his other interests at the time—“my 
investigations of the connection between electricity, mag¬ 
netism, light, and gravitation”—and wrote his lecture in the 
next 2 months. The result was one of the great classical mas¬ 
terpieces of mathematics, and probably the most important 
scientific lecture ever given. In it he greatly extended Gauss’s 
own investigations of 30 years earlier and created a vast gen¬ 
eralization of all known geometries which is now known as 
Riemannian Geometry . Some 50 years later these ideas 
turned out to be indispensable tools for Albert Einstein in 
his creation of the General Theory of Relativity. It is 
recorded that even Gauss—who had seen everything, math¬ 
ematically speaking, had thought of most of it himself, and 
was almost impossible to impress—was surprised and en¬ 
thusiastic. 

We have merely scratched the surface of Riemann’s great 
influence on the history of mathematics. In view of this 
it is quite surprising that his collected works fill only one 
average-sized volume. However, his writings were brief and 
powerful and pregnant with meaning for future generations, 
not routine publications of routine research that is dead the 
moment the printer’s ink dries. 

His short life was plagued by ill health aggravated by the 
abominable climate of northern Germany. He died of tuber¬ 
culosis in Italy at the age of only 39, trying to escape the 
cold and rain and ice of Gottingen. His gravestone was in¬ 
corporated into the wall of a village cemetery in Italy, and 
in 1906 his remains could no longer be found. However, he 
lives almost everywhere in modem mathematics. 



does not exist. Does the concept of area have any meaning in a situation like 
this? 

This bizarre example suggests the following indirect but more logical approach 
to the problem of area. If we are given a bounded nonnegative function f(x) de¬ 
fined but not necessarily continuous on [a, b], we begin by examining the limit 
on the right of (12). If this limit exists, then we define its value to be the area 
of the region under the graph, and we say that the function f(x) is integrable on 
[a, b\ And if this limit does not exist, then it is meaningless to speak of the area 
of the region. Almost all the functions we encounter in practice are continuous, 
and the theorem stated in Remark 2 guarantees that every continuous function is 
integrable, so these logical fine points will have little practical significance for 
most of our work. Nevertheless, these issues are interesting and important from 
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the point of view of the theory of calculus, and students should be aware of them 
even though we choose not to emphasize them. 

The definite integral which is defined here is often called the Riemann inte¬ 
gral , in honor of the nineteenth-century German mathematician who was the first 
to give a careful discussion of integrals of discontinuous functions. Also, the sums 
on the right of (12) are often called Riemann sums. 


The concepts discussed in Section 6.4 suggest an actual procedure for calculat¬ 
ing areas. We now examine how this procedure works in a few specific cases. 

Example 1 Consider the function y = f(x) = x on the interval [0, b\ The region 
under this graph (Fig. 6.16) is a triangle with height b and base b , so its area is 
obviously b 2 /2. However, it is of some interest to verify that our limit process 
gives the same result, but more important, to understand how the limit process 
gives this result. 

Let n be a large positive integer and divide the interval [0, b] into n equal 
subintervals by means of n — 1 equally spaced points 


6.5 

THE COMPUTATION 
OF AREAS AS LIMITS 




(« ~ 1)6 


( 1 ) 


The bases of the rectangles are A** = bln , and if we use upper sums as shown 
in Fig. 6.16, then the heights of the rectangles are 


f(M) = f(x 2 ) = ^ 


nb 

f(x n ) = —, 
n 


and we have 


=4o +2 + 

n l 






+ n). 


By using formula (2) in Section 6.3, we can write this as 



n 


= b 


1 


Figure 6.16 
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Z> 2 n(n + 1) b 2 n n + 1 b 2 ( 1 \ 

Sn n 2 * 2 2 ' n ' n 2 [ n[ 

We therefore conclude that 

b 2 ( 1 \ b 2 

area of region = lim S n = lim — 1 H— = — 
n-^oo 2 \ n J 2 

which we knew at the beginning. In the notation of definite integrals, this result 
is 

Cb b 2 

L xdx = T' (2) 

In this example we chose to use equal subintervals and upper sums. There was 
no compulsion to make these choices; our motive was only to make the calcula¬ 
tions as easy as possible. 


Example 2 Now consider the function y = f(x) = x 2 on the interval [0, b], as 
shown in Fig. 6.17. Let n be a large positive integer and again divide the inter¬ 
val [0, b] into n equal subintervals of length Ax* = bln by using the points of di¬ 
vision (1). We again use upper sums S n , so the heights of the successive rectan¬ 
gles are easily seen to be 




f(x n ) = 


nbV 


and we have 




nb\ 2 ( b 


= 4(l 2 + 2 2 + • • • + n 2 ). 
n 5 


This time we use formula (3) in Section 6.3 to write 



Figure 6.17 


X n - 1 x n b 
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5 


n ~ 


b 3 


n(n + 1)(2 n 4- 1) 
6 


6 3 n n + 1 2 n + 1 

6 n n n 




As ft 


oo this clearly yields 

area of region = lim S n 

n—>°° 


3 ’ 


or equivalently, 



(3) 


This calculation produces a result which we did not know at the beginning. 


In Problem 1 we ask students to show in the same way that 

r 

Jo 


fb 3 , b 4 
Jr 3 dx = — 
4 


It is natural to conjecture from (2), (3), and (4) that the formula 

b n+1 


f b , b 
x n dx = — 
Jo n 


+ 1 


(4) 


(5) 


is probably true for all positive integers n = 1, 2, 3, ... . The validity of (5) was 
established for the cases n = 3, 4, . . . , 9 by the Italian mathematician Cavalieri 
in 1635 and 1647, but his laborious geometric methods bogged down at n = 10. 
A few years later Fermat discovered a beautiful argument that proves (5) at one 
stroke for all positive integers. This argument is somewhat aside from our main 
purpose here; it can be found in Section B.5 of the book Calculus Gems men¬ 
tioned earlier. 


Example 3 Next, we find the area under the cosine curve y = cos x from x = 0 
to x = b, where 0 < b < 7t/2 (Fig. 6.18). Again we let ft be a large positive in¬ 
teger and divide the interval [0, b] into n equal subintervals of length Ax* = bln 
by using the points of division (1). This time we use lower sums s m and since 
the function is decreasing, the points are the right endpoints of the subinter¬ 
vals. The heights of the successive rectangles are therefore 

b 2b nb 

COS —, cos —, . . ., cos —, 

ft ft ft 

and we have 



Figure 6.18 





cos fB r 



kb 


cos —. 

ft 


To calculate the limit of this as n 
Section 6.3, 


^ cos kx = 

k=\ 


—> oo, we use the formula of Problem 9 in 

sin \nx cos j(n + l)x 
sin jx ’ 


with x = bin. We therefore have 










206 


DEFINITE INTEGRALS 


area of region = lim s n 


= lim — ■ 

n— n 


1 1 

{n + 1 )b 

sin 2 b cos 

2 n 


sin (b/2n) 

To calculate this limit we begin by observing that 


(6) 


(n + 1 )b 
In 


= cos| 1 + i)f 


b 

cos- 


since the cosine is continuous. Next, if we put 6 = b/2n, then 6 —> 0 as n —> «>, 
and by using the limit (3) in Section 2.5 we see that 


1 


■ = 2 ■ 


b/2n 


n sin {b/2n) sin (b/2n) 

These facts enable us to write (6) as 


= 2 ■ 


e 


sin 6 


-> 2 


or equivalently, 


area of region = lim s n = 2 sin cos = sin 6, 

n— >oo 2 2 


cos x dx = sin 6. 


PROBLEMS 


Use upper sums to show that the area under the graph of 


, = v3 


over the interval [0, b] is b 4 / 4. 


Find the area under the graph of y = x over the interval 
[0, b] by using lower sums instead of the upper sums of 
Example 1. 

Find the area under the graph of y = x 2 over the interval 
[0, b] by using lower sums instead of the upper sums of 
Example 2. 

Solve Problem 1 by using lower sums instead of upper 
sums. 

As we know, every parabola with vertex at the origin 
which opens upward has an equation of the form y = ax 2 . 


It is easy to see from Example 2 that 

L 


2 j 1,3 

. ax z ax = a —. 
to 3 


Use this to prove the theorem of Archimedes stated in Sec¬ 
tion 6.2 for the special case in which the chord AB is per¬ 
pendicular to the axis of the parabola. 

Find the area under the curve y = sin x from jc = 0 to jc = 
b, where 0 < b < tt. Hint: Use equal subintervals, take the 
points x\ to be the right endpoints of the subintervals, and 
apply Problem 10 in Section 6.3. 


6.6 


THE FUNDAMENTAL 
THEOREM OF 
CALCULUS 


As our main achievement so far in this chapter, we have formulated a rather com¬ 
plicated definition of the definite integral of a continuous function as the limit 
of approximating sums, 

r /W ^ = ma^o2/« ) ^- (1) 

We have also considered several examples of the use of this definition in calcu¬ 
lating the values of certain simple integrals, such as 

b 2 


\ b xdx = ^~, \ 

Jo 2 Jo 


2 . b 3 . p 3 , b 4 

_ , . x l dx — —, and x* dx = —. 

2 Jo 3 Jo 4 


(2) 
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Figure 6.19 


These calculations have had two purposes: to emphasize the essential nature of 
the integral by giving students some direct experience with approximating sums, 
and also to suggest the severe limitations of this method as a practical tool for 
evaluating integrals. Thus, for example, how can we possibly use limits of sums 
to find the numerical values of such complicated integrals as 


fi x 4 dx 

" V7 +X 5 



(3) 


This is clearly out of the question, so where do we go from here? What is evi¬ 
dently needed is a much more efficient and powerful method of computing inte¬ 
grals, and we find this method in the ideas of Newton and Leibniz. 

The Newton-Leibniz approach to the problem of calculating the integral (1) 
depends on an idea that seems paradoxical at first sight. In order to solve this 
problem, we replace it by an apparently harder problem. Instead of asking for 
the fixed area on the left in Fig. 6.19, we ask for the variable area produced when 
the edge on the right side of the figure is considered to be moveable, so that the 
area is a function of x, as suggested on the right in Fig. 6.19. If this area func¬ 
tion is denoted by A(x), then clearly A(a) = 0 and A(b) is the fixed area on the 
left in the figure. Our aim is to find an explicit formula for A(x), and then to de¬ 
termine the desired fixed area by setting x = b. There are several steps in this 
process, which we consider separately for the sake of clarity. 


step 1 We begin by establishing the crucial fact that 


f=/(*)■ (4) 

This says that the rate of change of the area A with respect to x is equal to the 
length of the right edge of the region. To prove this statement, we must appeal 
to the definition of the derivative, 

dA A(x + Ax) - A(x) 

—r~ — lirn - t - . 

dx Ax->0 Ax 

Now A(x) is the area under the graph between a and x, and A(x + Ax) is the area 
between a and x + Ax. Hence the numerator A(x + Ax) — A(x) is the area be¬ 
tween x and x + Ax (see the shaded region in Fig. 6.20). It is easy to see that 
this area is exactly equal to the area of a rectangle with the same base whose 
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height is fix), where x is a suitably chosen point between jc and t + Ax* This 
enables us to complete the proof of (4) as follows: 

dA . A (x + Ax) - A (x) f(x) Ax 

— = lim -:-= lim 1 —:- 

dx At—>0 Ax Ax—>0 Ax 


= lim fix) = fix), 
At-»0 


since fix) is continuous. To explain the last step here in a bit more detail, we 
point out that Ax —> 0 is equivalent to x + Ax —> t; since x is caught between jc 
and x + Ax, we also have x —» x, and the continuity of the function now yields 
the conclusion that f(x) —> fix). 


step 2 Equation (4) makes it possible for us to achieve our goal of finding a 
formula for the area function A(x). The reasoning goes this way. By (4), A(x) is 
one of the antiderivatives of fix). But if Fix) is any antiderivative of fix), then 
we know from Chapter 5 that 

AOx) = F{x) + c (5) 

for some value of the constant c. To determine c, we put jc = a in (5) and obtain 
A(a) = F(a) + c; but since A (a) = 0, this yields c = —F(a). Therefore 

A(x) = F{x) - Fia) (6) 

is the desired formula. 


step 3 All that remains is to observe that 

j b fix) dx = Aib) = Fib) - Fia), 

Ja 

by (6) and the meaning of A(x). 

We summarize our conclusions by formally stating the Fundamental Theorem 
of Calculus: 


If fix) is continuous on a closed interval [a, b], and if Fix) is any antiderivative of fix), 
so that id/dx) Fix) = fix) or equivalently 

I f(x) dx = Fix), (7) 

then 

[fix) dx= Fib) - Fia). (8) 

This theorem transforms the difficult problem of evaluating definite integrals by 
calculating limits of sums into the much easier problem of finding antideriva¬ 
tives. To find the value of fix) dx, we therefore no longer have to think about 
sums at all; we merely find an antiderivative F{x) in any way we can—by in- 


*When this statement is expressed in formal language, it is called the First Mean Value Theorem of 
Integral Calculus. Loosely speaking, if the top of the rectangle is at just the right level, then the part 
of the area protruding above it exactly balances the deficiency below it. 
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spection, routine calculation, ingenious calculation, or looking it up in a book— 
and then compute the number F(b ) - F(a). 

For instance, in Section 6.5 we used a good deal of algebraic ingenuity to ob¬ 
tain the formulas (2). Now, with the aid of the Fundamental Theorem, we see 
these formulas as obvious consequences of the following simple facts: 



More generally, for any exponent n>0we clearly have 


rb b n+l 

x n dx = 


n n+ 1 


n + 1 n + 1 ’ 


because 



x n+1 

n + r 


Remark 1 In the process of working problems, it is often convenient to use the 
bracket symbol , 


F(x) 


= F{b) - F(a), 


(9) 


which is read “F(x) bracket a , 6 .” This symbol means exactly what (9) says it 
does: To find its value, we write the value of F(x) when jc has the upper value b, 
and subtract the value of F(x) when t has the lower value a. For example, 
x 2 ] 3 = 4 2 - 3 2 = 16 - 9 = 7. By using this notation, ( 8 ) can be written in the 
form 


f(x) dx = F(x) 


Remark 2 It should be clear from this discussion that any antiderivative of f(x) 
will do in ( 8 ). In case students are in doubt about this, they should recall that if 
F(x) is one antiderivative, then any other can be obtained by adding a suitable 
constant c to form F(x) + c; and since 


F(x) + c 


b 

a 


= [ F(b) + c] ~ [F(m) + c] = F{b) - F(a), 


the constant c has no effect on the result. We may therefore ignore constants of 
integration when finding antiderivatives for the purpose of computing definite in¬ 
tegrals. (Nevertheless, these constants of integration remain indispensable when 
we are working with differential equations, as we saw in Section 5.4.) 


Example 1 Evaluate each of the following definite integrals: 

f2 „ f 16 dx f27 3/ - f 14 

(a) J_ j x 4 dx\ (b) J 1 —; (c) J g Vx dx\ (d) (x - 13) 10 dx. 


Solution In each case an antiderivative is easy to find by inspection: 


<»I 


dx = -x i 


1 33 

= 5 I 32 -(-!)]=-; 


16 4 = 2V^ 


16 


= 2(4 - 1) = 6 ; 
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The Fundamental Theorem establishes a strong connection between definite 
integrals and antiderivatives. This connection has made it customary to use the 
integral sign to denote an antiderivative, as in (7), and to replace the word “an¬ 
tiderivative” by the term “indefinite integral.” The reader is familiar with these 
usages from Chapter 5. From this point on we will often drop the adjective (in¬ 
definite, definite) and use the word “integral” alone to refer to either the func¬ 
tion (7) or the number ( 8 ), relying on the context and the reader’s understanding 
of what is going on in order to avoid confusion. As an infallible aid in keeping 
track of which is which, we emphasize that a definite integral always has limits 
of integration attached to it, and that an indefinite integral never has such limits. 

We apologize again for any confusion that may be caused by using such sim¬ 
ilar notations, //(*) dx and f(x) dx, for such very different concepts. How¬ 
ever, these notations have been with us for 300 years, and trying to change them 
now would be as futile as asking the wind not to blow. [Some years ago, one au¬ 
thor tried to introduce the notation A [/(*)] for the antiderivative to replace ff(x) 
dx. His book disappeared from view faster than yesterday’s newspaper.] Rather, 
it is the responsibility of students to read the symbols //(*) dx and J£ /(;c) dx 
carefully. We all read words carefully, and distinguish between such similar-ap¬ 
pearing words as “peak” and “peek,” “venal” and “venial,” “manor” and “man¬ 
ner.” Mathematics must be read with even more care. 

From our experience in Chapter 5, we know—or can calculate—many in¬ 
definite integrals, and therefore many definite integrals are now within our reach. 
In particular, the definite integrals (3) are not at all difficult to compute, as we 
now show. 

Example 2 Evaluate 


A i 



Solution For the sake of clarity, we consider separately the problem of finding 
the indefinite integral. The substitution 


u = 7 + x 5 , du = 5x 4 dx 


yields 




By the Fundamental Theorem we therefore have 
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n x 4 dx 3 

£ WT7'mJ < 7 + j, ) 


' = — (4 — 7 2/3 ) = — (4 - ^49). 

o 1(P ’ 1(P ; 


Example 3 Evaluate 



+ 


J_ V dx_ 

X] X 1 ' 


Solution Here we have 


so 



du = 


dx 

jc 2 ’ 



The Fundamental Theorem now yields 


fK)‘ 




i + i v 

JC 


1 / 243 
” 5 \ 32 


32 


781 

160' 


Example 4 Find the area under the curve y = cos x from x = 0 to x = b, where 
0 < * < 7 t/2. 

Solution This area (see Fig. 6.18) is given by the definite integral 

f b 

cos jc dx. 

Jo 

But the indefinite integral of cos x is familiar to us, 

cos jc dx = sin jc, ' 


so we immediately have 

b 

= sin b — sin 0 = sin b. 
o 


cos x dx — sin x 


A comparison with Example 3 in Section 6.5 demonstrates the power of the Fun¬ 
damental Theorem with particular clarity. The calculation in Example 3 was dif¬ 
ficult and depended on an obscure trigonometric identity, whereas the calcula¬ 
tion here is very easy indeed—but only because the Fundamental Theorem is 
available to us and we know a little about indefinite integrals. 


Remark 3 Newton and Leibniz are commonly credited with discovering calcu¬ 
lus at about the same time but independently of each other. Yet the concepts of 
the derivative as the slope of the tangent, and the definite integral as the area un- 
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der a curve, were familiar to many thinkers who preceded them. Under these cir¬ 
cumstances, why are Newton and Leibniz given the lion’s share of the credit for 
creating this new branch of mathematics, which played such a central role in the 
rise of science as the dominant feature of Western civilization? Mostly because 
they were the principal discoverers of the Fundamental Theorem of Calculus. 
They, and they alone, understood its importance and began to construct the nec¬ 
essary supporting machinery, and also applied it with spectacular success to prob¬ 
lems in science and geometry. 

Nevertheless, historians of science have traced the roots of the Fundamental 
Theorem back to hints in the earlier geometric work of Barrow and Pascal, whose 
writings are known to have influenced Newton and Leibniz. As Newton said in 
one of his rare moments of self-deprecation—he was not a modest man—“If I 
have seen farther, it is by standing on the shoulders of giants.” One of these gi¬ 
ants was Fermat, who was the first to prove the area formula stated in Fig. 6.21. 
This suggests—as we look back with 20-20 hindsight—that he must therefore 
have known the Fundamental Theorem itself, which seems such a short step away. 
But unfortunately he failed to notice it. 

The Fundamental Theorem of Calculus is unquestionably one of the greatest 
achievements of the human mind. It is also one of the most influential, when we 
consider how much of the subsequent development of mathematics and the phys¬ 
ical sciences depends upon it. Before it was discovered, from the time of 
Archimedes in the third century B.C. to the time of Fermat in the middle of the 
seventeenth century, problems of finding areas, volumes, and lengths of curves 
were so difficult that only people of genius could hope to solve them, and there 
are very few of these in any generation. But now, equipped with the great arse¬ 
nal of systematic methods that Newton and Leibniz and their followers built on 
the foundation of the Fundamental Theorem, we will see in the following chap¬ 
ters that these problems are open to all of us. 


PROBLEMS 

A sketch is a necessary part of the solution of almost any prob¬ 
lem involving a geometric quantity, and students should form 
the habit of drawing one as a matter of routine. If drawn with 
reasonable (but not excessive) care, such a sketch can help us 
avoid errors by reminding us of what we are doing, and of¬ 
ten acts as a valuable source of ideas. 

1 Use integration to find the area of the triangle bounded 
by the line y = 2x, the x-axis, and the line x = 3. Check 
your answer by elementary geometry. 

2 Use integration to find the area of the triangle bounded 
by the axes and the line 3x + 2y = 6. Check your an¬ 
swer by elementary geometry. 

In Problems 3-5, find the area between each parabola and the 
x-axis. 

3 x 2 + y = 4. 

4 4x 2 + 9 y = 36. 

5 4x 2 + 12y = 24x. 


In Problems 6-10, each curve has one arch above the x-axis. 
Find the area of the region under the arch. 

6 y = -x 3 + 4x. 

7 y = x 3 — 9x. 

8 y = 2x 2 — x 3 . 

9 y = x 4 - 6x 2 + 8. 

10 y = x 3 - 5x 2 + 2x + 8. 

In Problems 11-21, find the area bounded by the given curve, 
the x-axis, and the given vertical lines. 

11 y = x 2 , x = — 2 and x = 3. 

12 y = x 3 , x = 0 and x = 2. 

13 y = 3x 2 + x + 2, x = 1 and x = 2. 

14 y = x 2 - 3x, x = -3 and x = - 1. 

15 y = 2x + x = 1 and x = 3. 

x^ 

16 y = — 1 x = 1 and x = 6. 

Vx + 3 
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17 y = 3x 2 + 2, x = 0 and x = 3. 

18 y = 2x + 3, x = 0 and x = 3. 
y = V2x + 3, x = - \ and x = 3. 

1 

y = ■ 


19 

20 


21 


T = 


V2x + 3 

1 


(2x + 3) : 

22 If n is positive, then 


x = — 1 and x = 3. 
2 ‘, x = — 1 and x = 3. 


f x n dx = — 
J-\ n 


71+ 1 


+ 1 


Why is this calculation incorrect if n is a negative num¬ 
ber ± -1? 

In Problems 23-42, find the value of each definite integral. 

2/3 dx 


23 


V3jc + 2 ' 

r o 

25 J ^ 7x 6 dx. 


24 


26 


[' (2x + 3) dx. 
J Vx dx. 


27 P V4^+ 1 dx. 
Jo 

x dv 


29 


r 

J2a 


28 


30 


r, (*+o 2 

f2fc x dx 


dx. 


] 2a (X 2 - 2 ) 2 ’ Jo Vx 2 + b 2 ’ 

31 (x — x 2 ) dx. 32 (1 + x)(2 - x) dx. 

33 j° ( a 2 x — x 3 ) dx. 34 Jf^ (x + 1 ) 9 dx. 

35 jo (Vb - Vx) 2 dx. 36 ^ x 2 (l — x 2 ) dx. 


37 f o x 2 (l -x) 2 dx. 38 + 

« f tt/4- r 

39 sin 4x dx. 40 

Jo Jt 

Ctt/2 

41 (2 sin 0 + cos 0) <70. 


dx. 


. 7T/4 

40 sin r dr. 

tt/6 


42 r 3 ^d0. 

Jo COS 2 0 


ALGEBRAIC AND GEOMETRIC AREAS 

In the previous sections we considered the area of the region under the curve y = 
f(x) between x = a and x = b, and two assumptions were more or less explicit: 
(1) /(x) > 0 throughout the interval, and (2) a < b. However, the formula defin¬ 
ing the definite integral as the limit of approximating sums, namely, 


6.7 

PROPERTIES OF 
DEFINITE INTEGRALS 



lim T 

max Ajc,+>0 

K k= 1 


f{x* k ) Ax*, 


( 1 ) 


is independent of these assumptions. 

For example, suppose that the curve lies below the x-axis, as shown on the left 
in Fig. 6.22. In this case we would hesitate to speak of the region “under the 
curve,” but we can certainly describe it as the region “bounded by the curve and 
the x-axis, between x = a and x = b .” Each term of the sum (1) is clearly nega¬ 
tive because f(x%) < 0. Accordingly,/(x*) Ax* is the negative of the area of the 
shaded rectangle, the integral is the negative of the area of the region, and con¬ 
sequently 


area of the region = - f f(x) dx. 

Ja 



Figure 6.22 
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Similarly, if the curve lies partly above the x-axis and partly below it, as shown 
on the right in Fig. 6.22, then the integral (1) can be thought of as a sum of pos¬ 
itive and negative terms, corresponding to parts of the region lying above and be¬ 
low the x-axis: 


f f{x) dx = A\ - A 2 + A 3 - A 4 , (2) 

Ja 

where the areas A\, A 2 , A 3 , A 4 are understood to be positive. The integral (2) is 
often called the algebraic area of the region bounded by the curve and the 
x-axis, because it counts areas of regions above the x-axis with a positive sign 
and areas of regions below the x-axis with a negative sign.* The actual area of 
the region bounded by the curve and the x-axis, with each part counted as a pos¬ 
itive number, is called the geometric area: 

A, +A 2 +A 3 + A 4 = £ - l + l - f*. (3) 

To find the geometric area, we must sketch the graph, locate the crossing points, 
and calculate each integral on the right of (3) separately so that they can be com¬ 
bined with the correct signs. 

MISCELLANEOUS PROPERTIES 

If we drop the condition a < b and instead assume that a > b, we can still re¬ 
tain the purely numerical definition (1) for the definite integral. The only change 
is that as we traverse the interval from a to b the increments Ax* are negative. 
This yields the equation 

f/W dx = -j“ f(x) dx, (4) 

which is valid for all numbers a and b (a =£ b). Also, since (4) says that inter¬ 
changing the limits of integration changes the sign of the integral, it is natural to 
take the equation 

fix) dx = 0 (5) 

Ja 

as the definition of the integral on the left. 

If a < b, and if c is any number between a and b, it is easy to see from (1) 
that 


P/(x) dx = p/(x) dx + I"fix) dx. (6) 

Ja Ja Jc 

Properties (4) and (5) allow us to conclude that (6) is true for any three numbers 
a , b, c, regardless of their relation to one another. 

We list several further properties of definite integrals that follow in a routine 
way from the definition (1): 


The discussion in Section 6.6 leading to the Fundamental Theorem of Calculus extends without es¬ 
sential change to integrals of this type. An alternative proof of the Fundamental Theorem, based on 
entirely different ideas, is given in Appendix A.6. 
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[ cf(x) dx = c f fix) dx\ (7) 

Ja Ja 

[ U{x) + g(x)] dx = j a fix) dx + j a g(x) dx\ (8) 

lifix) < g(x) on [a, b], then ( fix) dx < [ g{x) dx. (9) 

Ja Ja 

In words, property (7) says that a constant factor can be moved across the inte¬ 
gral sign, and (8) says that the integral of a sum is the sum of the separate inte¬ 
grals. 

VARIABLE LIMITS OF INTEGRATION 

We have used x as the “variable of integration” in writing the definite integral 

\ b f{x)dx. (10) 

Ja 

However, (10) is a fixed number whose value does not depend on which letter is 
used for this variable. Instead of (10), we could equally well write 

\ f{t) dt, [ f(u) du, 

Ja Ja 

or any similar expression, and the meaning would be the same. Letters used in 
this way are often called dummy variables. 

In most situations it doesn’t matter what letters are used, as long as the ideas 
are clearly understood. However, sometimes we wish to construct a new func¬ 
tion F(x) by integrating a given function/(x) from a fixed lower limit to a vari¬ 
able upper limit, as in 

F(x)= f X f(x)dx. (11) 

Ja 

It is evident that this usage can be confusing, because the letter x is used with 
two different meanings on the right: as the upper limit of integration above the 
integral sign, and as a dummy variable behind the integral sign. For this reason, 
it is customary to write (11) in the form 

F(x)= l X f(t)dt , (12) 

Ja 

with t used as the dummy variable in place of x. 

The function F(x) defined by (12) has two properties that make it important. 
First, it exists whenever the integrand is continuous on the interval between a 
and x. And second, we proved in Section 6.6 that the derivative of this function 
is simply the value of the integrand at the upper limit: 

" 3 > 

This provides a satisfactory theoretical solution of the problem of finding an in¬ 
definite integral for any given continuous function f(x). As a practical matter, it 
may be very difficult—or even impossible—to calculate 
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[ f(x) dx = F(x) 

in any recognizable form involving familiar functions. But even if we can’t find 
a formula for F(x), it is at least some consolation to know that in principle an in¬ 
definite integral of a continuous function always exists, namely, the function de¬ 
fined by (12). 


Example 1 The problem of finding an explicit formula for the indefinite inte¬ 
gral 

dx 


is beyond our reach now, and will always be beyond our reach. However, if we 
don’t require an explicit formula, but only a well-defined function, then 

cx dt 

F(x) = Jo 

will do. 


Example 2 Let us try to calculate 



At this stage of our work we have no way of carrying out the integration to find 
a formula for the function in parentheses so that this function can be differenti¬ 
ated. But this doesn’t matter. By (13) we immediately have 



so no preliminary integration is necessary before the differentiation can be car¬ 
ried out. 


PROBLEMS 

1 In each of the following cases, compute the geometric 
area of the region bounded by the jc-axis and the given 
curves: 

(a) y = 3x — x 1 2 , x = 1, x = 4; 

(b) y = x 2 - 2jc, x = 1, jc = 4; 

(c) y = 4 + 4jc 3 4 5 , x = -2, jc = 1 ; 

g 

(d) y = x - 7 , x = 1, jc = 4. 

x 

2 Find the a rea bo unded by the axes and the given curve: 

(a) y = V4 - jc; (b) V* + Vy = Vfl. 

3 Find the area bounded by y 2 = x 3 and x = 4. 

4 Find the area enclosed by the loop of y 2 = jc(jc - 4) 2 . 

5 If a < c < b and /(jc) > 0 on [a, b], draw a suitable pic¬ 


ture and explain why equation ( 6 ) is an obvious relation 
among areas. 

6 If /(jc) >0 on [a, b] and c > 0, draw a suitable picture 
and explain why equation (7) is an obvious statement 
about areas. Do the same for equations ( 8 ) and (9) if both 
/(jc) and g(;c) are nonnegative on [a, b]. 

1 If /(jc) is an even function, that is, if /(-jc) = /(jc), show 
geometrically or otherwise that 

j_ a fix) dx = 2 J fix) dx. 

8 Verify the equation in Problem 7 by calculating the fol¬ 
lowing integrals of even functions: 
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x 2 dx 


and 


f 19 

(\+x 24 )dx. 

J- 19 



- x 2 dx. 


9 If f(x) is an odd function, that is, if/(— x) = ~f(x), show 
geometrically or otherwise that 

f" f(x)dx = 0. 

J—a 

10 Verify the equation in Problem 9 by computing the fol¬ 
lowing integrals of odd functions: 

[2 c p x dx 

x 5 dx and r— -. 

J-2 i -7 VjC 2 + 1 1 

11 The graph of y = x 2 , x > 0, can be considered to be the 
graph of x = Vy, y > 0. Show by geometry that this im¬ 
plies the validity of the equation 

J Q x 2 dx + J Vy dy = a 3 , a > 0. 

Check this by calculating the integrals. 

12 Generalize Problem 11 by finding and checking a simi¬ 
lar equation fory = x n , where n is any positive number. 

13 Use the known area of a circle to find the value of the 
integral 


14 The graph of the equation 


£i + ii=, 

a 2 b 2 


a > b > 0, 


is called an ellipse. Sketch it, and use the result of Prob¬ 
lem 13 to find the enclosed area. 

15 Show that 
d 


(a) 


dx 


P7(0 dt = -/(*); 

Jx 


d f“60 du 

(b) ^l /«*-/<**»*■ 


16 In each of the following, compute the indicated deriva¬ 
tive: 

C5 


, x d p+2 dt 

T 

*> if; 

<«> if 


dt 

i + r 


<d) if 


r 3 

dt 

1 + r 4 ’ 


dt 


vTTvTTT 


CHAPTER 6 REVIEW: CONCEPTS, METHODS 


Define, state , or think through the following. 

1 Sigma notation. 

2 Special sums. 

3 Area under a curve; lower sums and upper sums. 

4 Definite integral as a limit of sums. 

5 Limits of integration, integrand, variable of integration. 


6 Integrable function. 

7 Every continuous function is integrable. 

8 Fundamental Theorem of Calculus. 

9 Indefinite integral. 

10 Algebraic and geometric areas. 

11 Variable limit of integration. 


ADDITIONAL PROBLEMS FOR CHAPTER 6 


SECTION 6.5 
1 Show that 


Hint: It will be necessary to use a variation of the idea 
behind the formula 


j o Vr dx = f b 312 


l-2 + 2- 3 + 3- 4 + "' + n(n + l) 


by taking x k = k 2 b/n 2 and x k = x k in formula (12) of Sec¬ 
tion 6.4. Notice that this problem illustrates the calcula¬ 
tion of an integral as a limit by using subintervals of dif¬ 
ferent lengths. 

*2 Show that 


by using equal subintervals and taking x\ = Vx*- \X k in 
formula (12) of Section 6.4. (Why is x k - X < x\ <x k l) 


*3 Show that 



+1 - —~ 
\n n -1-1 


= 1 - 


I 


n + r 


[* -L dx = 20 Vb - 

'* \Tx 


1 ) 


by using equal subintervals and taking 
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. _ **-i + Xk + V2x t -|X t _ ( Vjc/t-1 + Vxu \ 2 

JC * 4 \ 2 J 

in formula (12) of Section 6.4. Remember to check that 

Xk -1 ^ -4 ^ **■ 


SECTION 6.6 

4 Find the area between each parabola and the x-axis: 

(a) x 2 + 3y = 9; 

(b) 3x 2 + 4 y = 48; 

(c) x 2 + 4x + 2 y = 0. 

5 The part of the curve b 2 y = 4 h{bx — x 2 ) that lies above 
the x-axis forms a parabolic arch with height h and base 
b. Sketch the graph and check these statements. Use in¬ 
tegration to show that the area under this arch is two- 
thirds the area of the rectangle with the same height and 
base. 

6 Each curve has one arch above the x-axis. Find the area 
under this arch. 

(a) y = 10 — x — 2 x 2 . 

(b) y = -x 3 - 4x 2 ~ 4x. 

(c) y = x 3 + 2x 2 - 8x. 

(d) y = x 4 — 6x 2 + 9. 

(e) y = xV 1 ~ x. 

7 Find the area bounded by the given curve, the x-axis, and 
the given vertical lines: 

(a) y = x 2 + 2 x + 1 , x = - 1 and x = 1 ; 

(b) y = Vx + 2 , x = 2 and x = 7; 

(c) y = X'3 - x, x = -5 and x = 3; 

(d) y = xV5 - x 2 , x = 0 and x = V5; 

(e) y = ■ 2 X .o '» x = 0 and x = 3. 

(x z + 1 y 

8 Find the value of each definite integral: 

(a) £ x(x 2 + 2) 3 dx, (b) £ 3x 2 (3 + x 3 ) 2 dx; 

(c) £ x\/a 2 — x 2 dx\ (d) £ x'Va 2 + x 2 dx\ 


(e) f ^ (8 — 4x + x 2 ) dx; 

(f) £ 7 {2x~ m + 8^: l/3 ) dx; 

(g) £ V9 - 8x dx; 
p dx 

(h) h (3x — 5 ) 5/2 ’ 

fV3 x dx 

0 ) J» V^ ; 

(j) Jo V1 + x 3 x 2 dx ; 

(k) ([b 2/3 - x 2/3 ) 3 dx. 


SECTION 6.7 

9 In each of the following cases, compute the geometric 
area of the region bounded by the x-axis and the given 
curves: 

(a) y = 6 — 3x 2 , x = 0, x = 2; 

(b) y = x 2 + 2x, x = -3, x = 0; 

(c) y = x 2 - x - 2, x = 1, x = 3; 

(d) y = x 3 - 3x, x = —2, x = 3. 

10 In each of the following cases, compute both the alge¬ 
braic and geometric areas of the region bounded by the 
x-axis and the given curves: 

(a) y = 3x 5 — x 3 , x = — 1, x = 1; 

(b) y = (x 2 - 4)(9 — x 2 ). 

11 Compute 



dt 

i + r 

dt 

V3 1 + 7’ 




t dt 


VTT" 


12 Verify the results obtained in parts (c) and (d) of Prob¬ 
lem 11 by actually carrying out the integration and then 
differentiating. 


APPENDIX: 
THE LUNES OF 
HIPPOCRATES 


According to one tradition, Hippocrates of Chios (ca. 430 B.C.) — not to be con¬ 
fused with his better-known contemporary, the physician Hippocrates of Cos— 
was originally a merchant whose goods were stolen by pirates.* He then went to 
Athens, where he lived for many years, studied mathematics, and compiled a 
book on the elements of geometry that strongly influenced Euclid more than a 
century later. 


"Aristotle, who rarely missed a chance to express his scorn for mathematicians, gives a more de¬ 
meaning account of Hippocrates’ misfortune. “It is well known,” he wrote with relish, “that people 
brilliant in one particular field may be quite foolish in most other things. Thus Hippocrates, though 
skilled in geometry, was so stupid and spineless that he let a tax collector of Byzantium cheat him 
out of a fortune.” This, from the man who asserted that heavier bodies fall to the ground more rapidly 
and that men have more teeth than women. 
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We recall Hippocrates’ discovery as stated in Section 6.2: The lune (crescent¬ 
shaped region) in Fig. 6.23 bounded by the circular arcs ADB and AEB (the lat¬ 
ter having C as its center) has an area exactly equal to the area of the shaded 
square whose side is the radius of the circle. (Hippocrates also found the areas 
of two other kinds of lunes, but we do not discuss these here.) 

This astonishing theorem seems to be the earliest precise determination of the 
area of a region bounded by curves. Its proof is simple but ingenious and de¬ 
pends on the last of the following three geometric facts, each of which implies 
the next: (a) The areas of two circles are to each other as the squares of the radii 
(Fig. 6.24); (b) sectors of two circles with equal central angles are to each other 
as the squares of the radii (Fig. 6.25); (c) segments of two circles with equal 
central angles are to each other as the squares of the radii (Fig. 6.26). We shall 
need (c) in the special case of right angles at the center. 

The proof of Hippocrates’ theorem now proceeds as follows. Redraw the lune 
as shown in Fig. 6.27. The chords joining D with A and B are tangent to the arc 
AEB and divide the lune into three regions with areas a\, < 22 , a. 3 . If the radius of 
the smaller circle is denoted by r, then the Pythagorean theorem tells us that the 
radius of the larger circle is V2r. It is easy to see that a\ and <22 are equal seg¬ 
ments of the smaller circle and that <24 is a segment of the larger circle, all with 
right angles at the center. We now use statement (c) to infer that 

£1 _ = r 2 _ J_ 
a * (V2 r) 2 2 ‘ 

This yields 

ci\ = and #2 = 


so 


It now follows that 


CL\ + 0,2 ~ CI 4 . 


area of lune = a\ + ai + 

= a 4 + a 3 

= area of triangle ABD 
= r 2 = area of square OBFC, 

and the argument is complete. 

Hippocrates was a contemporary of Pericles, the great political and cultural 
leader of Athens during its Golden Age. But nothing Pericles achieved has the 
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Figure 6.23 
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Figure 6.27 


D 



enduring quality of this beautiful geometric discovery; even the Parthenon, whose 
design and construction he supervised, is crumbling away. The reasoning of Hip¬ 
pocrates is a paragon of mathematical proof, untouched by time: In a few ele¬ 
gant steps it converts something easy to understand but difficult to believe into 
something impossible to doubt. 
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APPLICATIONS 
OF INTEGRATION 


In Chapter 6 we accomplished two major purposes. First, we approximated the 
area under a given curve by certain sums and found the exact area by forming 
the limit of these sums. And second, we learned how to calculate the numerical 
value of this limit by using the much more powerful method provided by the 
Fundamental Theorem of Calculus. Almost the whole content of Chapter 6 can 
be compressed into the following statement: If f(x) is continuous on [a, b\ then 

lirn /(4) kx k = f f(x) dx 

max —>0 Ja 

* k= 1 

= F(x) J = F(b) - F{a), (1) 

where F(x) is any indefinite integral of f(x). 

There are many other quantities in geometry and physics that can be treated 
in essentially the same way. Among these are volumes, arc lengths, surface ar¬ 
eas, and such basic physical quantities as the work done by a variable force act¬ 
ing over a given distance. In each case the process is the same: An interval of 
the independent variable is divided into small subintervals, the quantity in ques¬ 
tion is approximated by certain corresponding sums, and the limit of these sums 
yields the exact value of the quantity in the form of a definite integral—which 
is then evaluated by means of the Fundamental Theorem. 

Once we have seen the details of this limit-of-sums process being carried out 
for the area under a curve, as was done in Chapter 6, it is unnecessary and bor¬ 
ing to think through these details over and over again for each new quantity that 
we meet. The notation needed for this is complicated and repetitive, and actually 
impedes the intuitive understanding that we wish to cultivate. 

In this spirit, we turn briefly to Fig. 7.1 and consider the easy, intuitive way 
of constructing the definite integral in (1). We think of the area under the curve 
as composed of a great many thin vertical rectangular strips. The typical strip 
shown in the figure has height y and width dx, and therefore area 

dA = y dx = f(x) dx, (2) 

since y = f(x). This area is called the differential element of area, or simply the 
element of area', it is located at an arbitrary position within the region, and this 
position is specified by a value of x between a and b. We now think of the total 


7.1 

INTRODUCTION. THE 
INTUITIVE MEANING 
OF INTEGRATION 



i i 
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a dx b 


Figure 7.1 
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area A of the region as the result of adding up these elements of area dA as our 
typical strip sweeps across the region. This act of addition or summation can be 
symbolized by writing 

A= jdA. (3) 

Since the element of area sweeps across the region as jc increases from a to b y 
we can express the idea in (3) with greater precision by writing 

A = JdA = fydx = f a fix) dx. (4) 

We reach a true definite integral only in the last step in (4), where the variable 
of integration and the limits of integration become visibly present. In this way 
we glide smoothly over the messy details and set up the definite integral for the 
area directly, without having to think about limits of sums at all. 

From this point of view, integration is the act of calculating the whole of a 
quantity by cutting it up into a great many convenient small pieces and then 
adding up these pieces. It is this intuitive Leibnizian approach to the process of 
integration that we intend to illustrate and reinforce in the following sections. 


7.2 

THE AREA BETWEEN 
TWO CURVES 



Suppose we are given two curves y = fix) and y = g(x), as shown in Fig. 7.2, 
with points of intersection at x = a and x = b and with the first curve lying above 
the second on the interval [a, b ]. In setting up an integral for the area between 
these curves, it is natural to use thin vertical strips as indicated. The height of 
such a strip is the distance /(;c) — g(x) from the lower curve to the upper at the 
position x, and its base is dx. The element of area is therefore 

dA = [f(x) - g(x)] dx , 

and the total area is 

A = f dA = £ [f(x) - gix)] dx. (1) 

We integrate from the smaller limit of integration a to the larger b so that the in¬ 
crement (or differential) dx will be positive. It should also be pointed out that a 
and b are the values of x for which the two functions yield the same y’s; that is, 
they are the solutions of the equation fix) = gix), and to find them we solve this 
equation. 

We urge students not to be satisfied with merely memorizing formula (1) and 
applying it mechanically to area problems. Our aim is the mastery of a method, 
and this aim is better served by thinking geometrically and constructing the 
needed formula from scratch for each individual problem. The method applies 
equally well to finding areas by using thin horizontal strips, which are often more 
convenient. In this case the width of a typical strip will be dy> and the total area 
will be found by integrating with respect to y. 

As an aid to students we give an outline of the steps that should be followed 
in finding an area by integration. 


step 1 Sketch the region whose area is to be found. Write down on the sketch 
the equations of the bounding curves and find their points of intersection. 
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step 2 Decide whether to use thin vertical strips that have width dx or thin hor¬ 
izontal strips that have width dy, and draw a typical strip on the sketch. 

step 3 By looking at the sketch and using the equations of the bounding curves, 
write down the area dA of the typical strip as the product of its length and its 
width. Express dA entirely in terms of the variable (Tory) appearing in the width. 

step 4 Integrate dA between appropriate t or y limits, these limits being found 
by examining the sketch. 


Example 1 The region bounded by the curves y = x 2 and y = 4 is shown in Fig. 
7.3. If we use vertical strips, then the length of our typical strip is 4 - x 2 and its 
area is dA = (4 - x 2 ) dx. The total area of the region is therefore 

C2 12 

(4 - x 2 ) dx = 4x - \x 3 
J~ 2 J-2 

= (8-f) - (-8+ f) = f. 

We urge students to use symmetry whenever possible in order to simplify the 
calculations. In this case the left-right symmetry of the figure suggests that we 
integrate only from t = 0 to t = 2 to find the right half of the area, and then 
double the result to obtain the total area: 


2 So (4 ~ x2 '> dx = 2 ( 4 * - T* 3 ) 


= 2(8 - f) = f. 



As this calculation shows, it is often an advantage (only a slight advantage in 
this case) to have 0 as one of the limits of integration. 

If we decide to use horizontal strips, then the length of the strip is the value 
of x (in terms of y) at the right end minus the value of t at the left end. This is 
Vy - (- Vy), so dA = [Vy — (—Vy)] dy = 2 Vy dy and the total area is 

fo 2Vydy = iy 3/2 



The answer is the same as before, which is not surprising but is nevertheless re¬ 
assuring. 

We emphasize once again how important a good sketch is for understanding 
and carrying out these procedures. 


Example 2 The region bounded by the curves y = 3 - x 2 and y = x + 1 is 
shown in Fig. 7.4. We find where the curves intersect by solving the equations 
simultaneously. We do this by equating the y’s, which gives 

3-x 2 = x + 
x 2 + x - 2 = 0, 

(x + 2)(x - 1) = 0, 

t= -2, 1. 

The points of intersection are thus (-2, —1) and (1, 2). The length of the indi¬ 
cated vertical strip is (3 — x 2 ) — (x + 1 ) = 2 — x 2 - x so the area of the region 
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is found by integrating the element of area dA = (2 - x 2 - x) dx as x goes from 
-2 to 1 , 

[ (2 — x 2 — x) dx = {2x — yx 3 - \x 2 ) 

J~ 2 " J—2 

= (2-T-i)-(-4 + f-2) = 4 

It is inconvenient to use horizontal strips in this problem, because a horizontal 
strip clearly reaches from the left half of the parabola to the line if y < 2 and 
from the left half of the parabola to the right half if y> 2 , and this means that 
different formulas for dA must be used according as y < 2 or y > 2. 


Example 3 Find the area of the region bounded by the curves y = cos x and y = 
sin 2x on the interval 0 < x < ttH. 



Solution The curves are shown in Fig. 7.5, and the region—consisting of two 
parts—is shaded. The main feature of this example is that the curves cross each 
other, so that first one curve, and then the other, is the “upper” curve. To deal 
with this, we must begin by finding exactly where the curves cross, which means 
we must solve the equation cos x = sin 2x. We do this by writing 

_ . i 77 

cos x = 2 sin x cos x , sin x = y, x = —. 

6 


Accordingly, 


dA = 


(cos x - sin 2x) dx for 0 < x < — 

6 


77 77 

(sin 2x — cos x) dx for — < x < — 
6 2 


The desired area is therefore 



f tt/2 

2x) dx + 

Jtt/6 


(sin 2x — cos x) dx 


= (sin x + y cos 2x) 


TT16 


+ (—y cos 2x — sin x) 


tt/2 

tt/6 


= (i + j ~ o -1) + (1 - 


+ - + -') = - 
' 4 ^ 2 / 2 - 


PROBLEMS 


In Problems 1-19, sketch the curves and find the areas of the 

*9 

y 

9 

= 2x H- t, y = — 2x 

regions they bound. 

10 


X 2 

1 

y = x 2 , y = 2x. 

X 

= y 2 , y = x+ 3,y = 

2 

y = x 2 , x = y 2 . 

11 

y 

= x 2 — 4x, y = 2x. 

3 

y = x 2 + 2, y = 4 - x 2 . 

12 

y 

= x 3 , y = 2x ~~ x 2 . 

4 

y = 4x 3 + 3x 2 + 2, y = 2. 

13 

X 

= y 2 , x = 2y + 3. 

5 

y = x 2 — 2x, y = 3. 

14 

X 

II 

1 „ K 
* 

II 

K> 

V 

6 

y = x 3 - 3x, y = x (x > 0). 

15 

y 

= x 2 + l, y = 3 - x 2 , 

7 

y = x 4 - 4x 2 , y = -4. 

16 

y 

= x 4 -4 ,y = 3x 2 . 

8 

y = x 3 - 4x, y = 5x (x > 0). 

17 

X 

II 

00 

1 

V 

II 

V 

K 

1 

oo 
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18 y = x 3 , x = y 3 . 

19 y = x 3 , y = 32V*. 

20 Find the area in Example 2 by integrating with respect 
to y, first with one integrand from y = — 1 to y = 2 , and 
then with another integrand from y = 2 to y = 3. 

21 Find in two ways the area under y = x 2 from * = 0 to 
* = 4. 

22 Find in two ways the area under y = x 3 from * = 0 to 
x = 2. 

23 Find the area bounded by 

(a) the x-axis and y = x 2 — x 3 ; 

(b) the y-axis and x = 2 y — y 2 . 

24 The area between x = y 2 and * = 4 is divided into two 
equal parts by the line x = a. Find a. 

*25 Find the area between y = x 3 and its tangent at * = 1. 

26 Find the area above the x-axis bounded by y = 1/x 2 , 
x = 1 , and x = b, where b is some number greater than 
1. The result will depend on b. What happens to this 
area as b —» °o? 

27 Solve Problem 26 with y = 1/x 2 replaced by y = 1 /Vx. 

28 Solve Problem 26 with y = 1/x 2 replaced by y = 1 lx p , 
where p is a fixed positive number greater than 1. What 
happens if p is a fixed positive number less than 1 ? 

29 In each case find the area of the region bounded b y the 
given curves over the stated interval: 

(a) y = sin x and y = cos x, 0 < x < 7 t/ 2 ; 

(b) y = sin x and y — cos 2x, 0 < x < 7 r/ 4 ; 

(c) y = x and y = sin x, — 77/4 < x < tt/2. 

30 Sketch the graph of y = sin 4 x cos x on the interval 
0 < x < 7 t/2 , and find the area of the region between 
the curve and the x-axis. 


31 A rectangle with sides parallel to the axes has one ver¬ 
tex at the origin and the opposite vertex on the curve 
y = ax n at the point where x = b {a > 0, n > 0, and 
b > 0). Show that the fraction of the area of the rec¬ 
tangle that lies below the curve depends on n but is in¬ 
dependent of a and b. 

32 By calculating the areas involved by integration, verify 
the theorem of Archimedes stated in Section 6.2 for the 
parabolic segment cut off from the parabola y = x 2 by 
the line y = —x 4-2. Hint: Begin by finding the points 
A, B, and C in Fig. 6.5. 

*33 The shaded region inside the square of side a shown in 
Fig. 7.6 consists of all points that are closer to the cen¬ 
ter of the square than to its boundary. Find the area of 
this region. 



Figure 7.6 


If the region under a curve y = /(x) between x = a and x = b is revolved about 
the x-axis, it generates a three-dimensional figure called a solid of revolution. 
The symmetrical shape of this solid makes its volume easy to compute. 

The situation is illustrated in Fig. 7.7. On the left we show the region itself, 
together with a typical thin vertical strip of thickness dx whose base lies on the 
x-axis. When the region is revolved about the x-axis, this strip generates a thin 
circular disk shaped like a coin, as shown on the right, with radius y =/(x) and 
thickness dx. The volume of this disk is our element of volume dV. Since the disk 
is a cylinder, its volume is clearly the area of the circular face times the thick¬ 
ness, 


7.3 

VOLUMES: THE 
DISK METHOD 


dV = Try 2 dx = 7r/(x) 2 dx. 


0 ) 


We now imagine that the solid of revolution is filled with a very large number 
of very thin disks like this, so that the total volume is the sum of all the elements 
of volume as our typical disk sweeps through the solid from left to right, that is, 
as x increases from a to b\ 


V = j dV = j Try 2 dx = j i r/(x) 2 dx. 


( 2 ) 
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Figure 7.7 



This is another fundamental formula that students should not memorize. Instead, 
it is much better to understand it so clearly that memorization is unnecessary. 

Some students may feel that formula (2) cannot give the exact volume of the 
solid, because it doesn’t take into account the volume of the small “peeling” 
around the outside of the disk in Fig. 7.7. However, just as in the calculation of 
areas, this slight apparent error visible in the figure—due to using disks instead 
of actual slices—disappears as a consequence of the limit process that is part of 
the meaning of the integral sign. We can therefore calculate volumes using for¬ 
mula (2) and have full confidence that our results will be exactly correct, not 
merely approximations. 



Example 1 A sphere can be thought of as the solid of revolution generated by 
revolving a semicircle about its diameter (Fig. 7.8). If the equation of the semi¬ 
circle is x 2 + y 2 = a 2 , y > 0, then y = V a 2 — x 2 and the element of volume is 

dV = iry 2 dx = 7 r(a 2 - x 2 ) dx. 


By using the left-right symmetry of the sphere, we can find its total volume by 
integrating dV from x = 0 to x = a and multiplying by 2: 


V = 2 j* o 7r(a 2 - x 2 ) dx = 2 7T(a 2 x - }jt 3 ) 


4 3 

= ymr. 


(3) 


This result confirms the well-known (but little-understood) formula from ele¬ 
mentary geometry. If we integrate dV only from x = a — h to x = a, we obtain 
the formula for the volume of a segment of a sphere of thickness h, 


V = 



- x 2 ) dx= ir{a 2 x - jjc 3 ) 


a 

a-h 


= TTlja 3 - [a 2 (a - h) - {(a - /z) 3 ]} 
= 7 rh 2 (a ~ }/z), 


after some algebraic simplification. It should be noticed that this formula reduces 
to (3) when h = 2a. 
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Example 2 Another important formula from elementary geometry states that a 
cone of height h and radius of base r has volume V = \irr 2 h\ or equivalently, the 
volume is one-third the volume of the circumscribed cylinder. To obtain this for¬ 
mula by integration, and thereby to understand the origin of the factor we think 
of the cone as the solid of revolution generated by revolving the right triangle 
shown in the first quadrant of Fig. 7.9 about its base on the x-axis. The hypotenuse 
of this triangle is clearly part of the straight line y = ( r/h)x , so the element of 
volume is 

dV = iry 2 dx = x 2 dx. 


We now obtain the total volume by integrating dV from x = 0 to x — h. 


[h 7rr 2 ~ , 7 Tr^ 

= Jo !f x dx= -* 


7T1 2 1 


= t7 rr 2 h. 


(4) 



Figure 7.9 


For obvious reasons, the method of these examples is usually called the disk 
method. The same idea can be applied to solids of other types, in which the el¬ 
ement of volume is not necessarily a circular disk. Suppose that each cross sec¬ 
tion of a solid made by a plane perpendicular to a certain line is a triangle or 
square or some other geometric figure whose area is easy to find. Then our ele¬ 
ment of volume dV is the product of this area and the thickness of a thin slice, 
and we can calculate the total volume of the solid by the method of moving slices 
as suggested in Fig. 7.10: 

dV = A(x) dx, V = j dV= j b A(x) dx. 


Example 3 A wedge is cut from the base of a cylinder of radius a by a plane 
passing through a diameter of the base and inclined at an angle of 45° to the 
base. To find the volume of this wedge, we first draw a careful sketch (Fig. 7.11). 
A slice perpendicular to the edge of the wedge, as shown, has a triangular face. 
By using the notation established in the figure, we see that the volume of this 
slice is 

dV = - y 2 ’ Va 2 — y 2 dy 

= j(a 2 - y 2 ) dy, 

so the volume of the wedge is 

v = 2 f" j(a 2 - y 2 ) dy = a 2 y - jy 3 

JO Jo 

= fa 3 - 

A vertical slice parallel to the edge of the wedge evidently has a rectangular face 
(students should draw their own sketches). If x is the distance from the edge to 
this slice, then with careful thought we can see that this time the element of vol¬ 
ume is 

dV = IxVa 2 - x 2 dx, 



Figure 7.10 



and therefore 


Figure 7. 


Sri 
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' = f a 2xV, 

JO 


a 2 - x 2 dx 


= -j(a 2 - x 2 ) 3 ' 2 


= ln* 


as before. 


Remark 1 The following minor variation of the disk method is often useful and 
is necessary for many of the problems at the end of this section. Suppose the 
strip being revolved about an axis is separated from this axis by a certain dis¬ 
tance, as suggested on the left in Fig. 7.12. In this case the element of volume 
generated by the strip is a disk with a hole in it—what might be described as a 
washer (this washer is moved out to the right in the figure for the sake of clar¬ 
ity). The volume of this washer is the volume of the disk minus the volume of 
the hole, 


dV = 7r(yi 2 — y 2 2 ) dx. 


The total volume of the solid of revolution is therefore 


V = 




- yi 2 ) dx , 


where y\ and y 2 , the outer and inner radii of the washer, are determined as func¬ 
tions of jc from the given conditions of the problem. This procedure for calcu¬ 
lating volumes is called—naturally enough—the washer method. It applies to 
solids of revolution that have hollow spaces inside them. 


Remark 2 Both of the formulas obtained in Examples 1 and 2 can be vividly 
expressed and easily remembered by stating the volumes of the cone and sphere 
as simple fractional parts of the volumes of the circumscribed cylinders (Fig. 
7.13). 


Remark 3 Formula (3) for the volume of a sphere was discovered by Archimedes 
in the third century B.C. Since he used a very beautiful and ingenious early form 
of integration, we give his argument in the Appendix at the end of this chapter 
for those who wish to examine it. 



Figure 7.13 
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PROBLEMS 


1 Find the volume of the solid of revolution generated 
when the area bounded by the given curves is revolved 
about the x-axis: 

(a) y = Vx, y = 0, x = 4; 

(b) y = 2x - x 2 , y = 0; 

(c) y 3 = x, y = 0, x = 1; 

(d) y = x, y = 1, x = 0; 

(e) x = 2y -y 2 , x = 0; 

(f) x 2/3 + y 2/3 = a 213 , first quadrant. 

2 Problem 14 in Section 6.7 is concerned with the ellipse 



If the area inside this ellipse is revolved about the 
x-axis, the resulting solid (which resembles a football) 
is called a prolate spheroid. Find its volume. [If a < b, 
the solid is called an oblate spheroid. Observe that the 
volume formula is the same regardless of how a and b 
are related to each other, and also that it reduces to for¬ 
mula (3) when b = a.] 

3 The horizontal cross section of a certain pyramid at a 
distance x down from the top is a square of side ( b/h)x , 
where h is the height of the pyramid and b is the side 
of the base. Show that the volume of the pyramid is 
one-third the area of the base times the height. 

4 A horn-shaped solid is generated by a moving circle 
perpendicular to the y-axis whose diameter lies in the 
xy-plane and extends from y = 27x 3 to y = x 3 . Find the 
volume of this solid between y = 0 and y = 8. 

5 The square bounded by the axes and the lines x = 2, 
y = 2 is cut into two parts by the curve y 2 = 2x. Show 
that these parts generate equal volumes when revolved 
about the x-axis. 

6 The two areas described in Problem 5 are revolved about 
the line x = 2. Find the volumes generated. 

7 A tent consists of canvas stretched from a circular base 
of radius a to a vertical semicircular rod fastened to the 
base at the ends of a diameter. Find the volume of this 
tent. 

8 The base of a solid is a quadrant of a circle of radius 
a. Each cross section perpendicular to one edge of the 
base is a semicircle whose diameter lies in the base. 
Find the volume. 

9 The base of a certain solid is the circle x 2 + y 2 = a 2 . 
Each plane perpendicular to the x-axis intersects the 
solid in a square cross section with one side in the base 
of the solid. Find its volume. 

10 If the area bounded by the parabola y = H — (H/ : )x 2 
and the x-axis is revolved about the y-axis, the result¬ 
ing bullet-shaped solid is a segment of a paraboloid of 
revolution with height H and radius of base R. Show 


that its volume is half the volume of the circumscrib¬ 
ing cylinder. 

11 If the circle (x — b) 2 + y 2 = a 2 (0 < a < b) is revolved 
about the y-axis, it generates a doughnut-shaped solid 
called a torus. Find the volume of this torus by the 
washer method. Hint: If necessary, use the result of 
Problem 13 in Section 6.7. (Notice the remarkable fact 
that the volume of the torus is the product of the area 
of the circle and the distance traveled by its center as it 
revolves about the y-axis.) 

12 Find the volume of the solid formed by revolving the 
area inside the curve x 2 + y 4 = 1 about (a) the x-axis; 
(b) the y-axis. 

13 The base of a certain solid is an equilateral triangle of 
side a, with one vertex at the origin and an altitude along 
the x-axis. Each plane perpendicular to the x-axis in¬ 
tersects the solid in a square cross section with one side 
in the base of the solid. Find the volume. 

14 Each plane perpendicular to the x-axis intersects a cer¬ 
tain solid in a circular cross section whose diameter lies 
in the xy-plane and extends from x 2 = 4y to y 2 = 4x. 
The solid lies between the points of intersection of these 
curves. Find its volume. 

15 The base o f a certain solid i s the region bounded b y the 
parabola x 2 = 4y and the line y = 9, and each cross sec¬ 
tion perpendicular to the y-axis is a square with one side 
in the base. If a plane perpendicular to the y-axis cuts 
this solid in half, how far from the origin is this plane? 

*16 Two great circles lying in planes that are perpendicular 
to each other are drawn on a wooden sphere of radius 
a. Part of the sphere is then shaved off in such a way 
that each cross section of the remaining solid that is per¬ 
pendicular to the common diameter of the two great cir¬ 
cles is a square whose vertices lie on these circles. Find 
the volume of this solid. 

*17 The axes of two cylinders, each of radius a, intersect at 
right angles. Find the common volume. Hint: Drawing 
this figure is nine-tenths of the problem. To do this, draw 
the usual x-axis and y-axis, and also a z-axis “coming 
out of the paper,” so that thexz-plane is horizontal. Let 
the axis of one cylinder be the x-axis and the axis of 
the other the z-axis, but draw only the quarter of the 
first cylinder that lies in front of the xy-plane and above 
the xz-plane, and the quarter of the second that lies to 
the right of the yz-plane and above the xz-plane. Their 
intersection is one-eighth the total volume and is not 
difficult to sketch. Now consider horizontal cross sec¬ 
tions. 

18 Consider the area in the first quadrant under the curve 
x 2 y 3 = 1 and to the right of x = 1. By integrating from 
x = 1 to x = b and then letting b —> °o, show that this 
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area is infinite, but that on revolving it about the jc-axis we 
obtain a finite volume. 

19 A birdbath 4 in deep has the shape of a segment of a 
sphere of diameter 16 in. It contains a decorative lead 
cannonball of diameter 6 in. If the birdbath is filled with 
water to a depth of x inches, how much water does it 
contain? 

20 In Example 3, findthe volume of the wedge if the plane 
through a diameter of the base is inclined at an angle 
of 30° to the base. 

21 Cavalieri’s principle states that if two solids have the 
same height H, and if sections made by planes parallel 
to the bases and at the same distance jc from these bases 
always have equal areas A(x) and B(x), then the two 
solids have equal volumes. 1 " 

(a) Prove this by integration. 

(b) Use this principle and the formula for the volume 
of a cone (Example 2) to derive the formula for the 
volume of a sphere (Example 1) by examining Fig. 
7.14. (The figure on the right is a cylinder with two 
conical hollows in it whose common vertex is at the 
center.) 

22 Water evaporates from an open bowl of unspecified 
shape at a rate proportional to the area of the water sur¬ 
face; that is, 



where V is the volume of w ater, A(h) is the area of the 
water surface when the depth is h, and c is a positive 
constant. 

f BonaventuraCavalieri (1598-1647), the Italian mathematician men¬ 
tioned in Section 6.5, was a disciple of Galileo and clearly had many 
excellent ideas of his own. 



area = 7ry 2 

= 7 T(a 2 - x 2 ) 


(a) Show that 

dh _ 
dt C ’ 

so that the water level drops at a constant rate 
regardless of the shape of the bowl. Hint: 
V = f?jA(x) dx. 

(b) If h = ho when t = 0, when will the bowl be empty? 

23 A cylindrical drinking glass of height h and radius of 
base a is full of water. The glass is tilted, and the 
water spilled, until the remaining water just covers the 
bottom of the glass. Use rectangular cross sections to 
calculate ho w muc h water remains in the glass. 
Hint: J- a \Za 2 — x 2 dx = \7ra 2 , because the integral 
represents the area of a semicircle of radius a. 

24 Solve the preceding problem without calculation, by us¬ 
ing only common sense and geometry. 

25 The clepsydra, or ancient water clock, was a bowl from 
which water was allowed to escape through a small hole 
in the bottom. It was often used in Greek and Roman 
courts to time the speeches of lawyers in order to keep 
them from talking too much. Let y = f(x) be a curve 
that rises from the origin into the first quadrant of the 
jcy-plane, and assume the clepsydra has the shape of the 
surface obtained by revolving this curve about the 
y-axis. According to Torricelli’s law, water flows out 
through the hole in the bottom of the clepsydra at the 
speed it would acquire in falling freely from the water 
level to the level of the hole. Find what the function y = 
f(x) must be in order to guarantee that the water level 
falls at a constant rate. Hint: By equations (6) and (7) 
in Section 5.5, Torricelli’s law implies that the exit speed 
of the water through the hole is proportional to the 
square root of the depth of the water. 



= 7r(a 2 - x 2 ) 


Figure 7.14 
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There is another method of finding volumes that is often more convenient than 
those described in Section 7.3. 

To understand this method, consider the region shown on the left in Fig. 7.15, 
that is, the region in the first quadrant bounded by the axes and the indicated 
curve y = /(;c). If this region is revolved about the x-axis, then the thin vertical 
strip in the figure generates a disk, and we can calculate the total volume of the 
solid by adding up (or integrating) the volumes of these disks from x = 0 to x = 
b. This, of course, is the disk method described in Section 7.3. However, if the 
region is revolved about the y-axis, as shown in the center of the figure, then we 
get an entirely different solid of revolution and the vertical strip generates a thin- 
walled cylindrical shell. This shell can be thought of as resembling a soup can 
whose top and bottom have been removed, or perhaps a thin-walled cardboard 
mailing tube. Its volume dV is essentially the area of the inner cylindrical sur¬ 
face (277xy)* times the thickness of the wall ( dx ), so 


7.4 

VOLUMES: THE 
METHOD OF 
CYLINDRICAL SHELLS 


dV = 2nxy dx. 


(1) 


As the radius x of this shell increases from x = 0 to x = b, we can see from Fig. 
7.15 that the resulting series of cylindrical shells fills the solid of revolution from 
the axis outward, in much the same way as the cylindrical growth layers in the 
trunk of a tree fill the trunk from the axis outward. The total volume of this solid 
is therefore the sum—or integral—of the elements of volume dV, 

V = j dV = | 2nxy dx = 2iTxf{x) dx , (2) 


since y —f(x). In principle, this volume V can also be found by using horizontal 
disks generated by thin horizontal strips; however, this could turn out to be dif¬ 
ficult, since the given equation y =f(x) would have to be solved for x in terms 
of y. 

Just as in our other applications of integration, formulas (1) and (2) give brief 
expression to a complex process involving limits of sums; and as usual, we omit 
the details of this process in the interests of clarity. 

Also as usual, we suggest that students would be wise not to memorize for¬ 
mula (2). This formula is somewhat similar to the corresponding formula for the 
disk method, and students who try to memorize them and use them without think- 


*Remember: The curved surface area of a cylinder of radius * and height y is obtained by “unrolling” 
the cylinder so that it becomes a rectangle of base 2ttx and height y; see the right side of Fig. 7.15. 




Figure 7.15 
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ing about their meaning will almost certainly confuse them and come to grief. It 
is better to sketch a figure and construct (1) directly from the visible evidence of 
this figure, and then form (2) by integration. Also, this approach has the further 
advantage that we are not tied to any particular notation, and can easily adapt the 
basic idea to solids of revolution about various axes. 



Example 1 In Example 1 of Section 7.3 we calculated the volume of a sphere 
by the disk method. We now solve the same problem by the shell method (see 
Fig. 7.16). The volume of the shell shown in the figure is 

dV = 27rx(2y) dx 

= 4ttj tV < 2 2 — x 2 dx. 


The volume of the sphere is therefore 

V = 47tJ q jcV a 2 — x 2 dx = 47t(-j )(a 2 - x 2 ) 312 


477 


(a 2 - x 2 } 


2 \ 3/2 


_ 4 3 

= y 77(3. 


In this connection we can profitably consider a related problem: If a vertical hole 
of diameter a is bored through the center of the sphere, find the remaining vol¬ 
ume. For this, it clearly suffices to integrate dV as the radius x of the shell in¬ 
creases from x = a/2 to x = a, so 


V = 



477 


-* ldx= — ( fl2 “ * 2 ) 3 /2 


a 

all 


477 / 3 ,\ 3/2 

" 3 U fl ) 


477 

T 




This problem could be solved by the washer method, but the shell method is 
much more convenient. 



Example 2 The region in the first quadrant above y — x 2 and below y = 2 - x 2 
is revolved about the y-axis (Fig. 7.17). To find the volume by the shell method, 
we observe that the height of our typical shell is y = (2 — x 2 ) — x 2 = 2 — 2jc 2 , 
so 

dV = 2rrxy dx — 2ttx{2 — 2x 2 ) dx 
= 4tt{x — x 3 ) dx; 

and since the curves intersect at t = ± 1, we have 


’ = 477 J o (x - x 3 ) dx 


= 4tt(\x 2 - \x A ) 


Students often wish to set up this integral incorrectly by integrating from jc = 
— 1 to jc = 1. The reason why this is incorrect can be seen by understanding from 
the geometry that our typical shell sweeps through the solid from the axis out¬ 
ward: x is the radius of the shell, and this radius increases from 0 to 1, not from 
-1 to 1. 
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Notice that if we attempt to solve this problem by the disk method, then it is 
necessary to calculate two separate integrals—one referring to the volume be¬ 
low the points of intersection of the two curves, and the other to the volume 
above. 


Example 3 Blood flow. The great artery of the human body—the aorta—is a 
tube about as large as the base of an average human thumb. The heart pumps 
blood through it so powerfully that blood particles near the center move at speeds 
of about 50 cm/s (20 in/s). On the other hand, blood is a viscous liquid, and near 
the artery wall the blood tends to stick to the wall, and its speed there is essen¬ 
tially zero. The problem of calculating the total flow under these circumstances 
requires integration by the method of cylindrical shells. 

We begin with the very simple idea that if a liquid flows through a cylindri¬ 
cal tube with constant speed $o, then the volume of liquid passing a fixed point 
per unit time (the flow F ) is soA, where A is the area of a cross section of the 
tube (Fig. 7.18). 

However, we know that the flow of blood in an artery in the human body is 
much more complicated than this. Let us assume that the artery is a cylindrical 
tube with radius R and length L (Fig. 7.19). Because of the viscosity mentioned 
above, the flow of blood takes place in thin cylindrical layers, with the blood in 
each layer moving at approximately constant speed and the blood in different 
layers moving at different speeds. In this so-called laminar flow the blood moves 
slowly near the artery wall and faster near the center, as suggested in Fig. 7.19, 
so that the inner layers slip past the outer ones (Fig. 7.20). 

The precise relation between the speed 5 and the distance r from the center is 
given by the formula 



Figure 7.18 


s = 



r 2 ). 


(3) 


where P is the pressure difference between the ends of the artery and tj (eta) is 
the viscosity of blood.* We notice that this formula gives zero speed if r = R and 


^Formula (3) can be derived from general principles in the theory of viscous fluid dynamics. The de¬ 
tails can be found on pp. 39-41 of R. L. Whitmore, Rheology of the Circulation (Pergamon Press, 
1968). 



Figure 7.19 Figure 7.20 
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maximum speed PR 2 /4r]L if r = 0. It is customary to measure R, r, and L in cen¬ 
timeters (cm), P in dynes/cm 2 , and 77 in dyne-s/cm 2 , so that s is measured in 
cm/s. A typical value for R in the human body is R = 0.2 cm, and a realistic 
value for the constant P!Ar\L is 500. With these values formula (3) becomes 

5 = 500(0.2 2 - r 2 ) 



= 20 — 500r 2 cm/s. (4) 

The graph of this function is part of a parabola (Fig. 7.21), and this graph shows 
how quickly the speed of a blood particle approaches zero as its position ap¬ 
proaches the wall of the artery. Thus, at the center the speed 5 is 20 cm/s, but 
when r = 0.15 we see that 5 is only 20 - 500(0.15) 2 = 8.75 cm/s. 

Now, to calculate the flow F (the total volume of blood passing a fixed point 
per unit time) we write down the element of flow dF in a thin cylindrical shell 
of radius r and thickness dr. 

dF — s • 2nr dr 

= — (R 2 — r 2 ) ■ lirr dr 
4 7)L 

= ( R 2 r — r 3 ) dr. 

2 y]L 


All that remains is to add together these elements of flow over all the shells, that 
is, to integrate from 0 to R: 

F= l dF = lo^ (R2r -' 3)dr 


This formula, 


7 tP 
2r)L 




dr 


= — ( l R 2 r 2 -~r*) 
2yL ( - r * r ’ 


ttP 
8 vL 


R\ 


F = 


ttP 

JrfL 


R\ 


(5) 


is called Poiseuille’s law in the field of cardiovascular physiology. It shows that 
the flow is proportional to the fourth power of the radius of the artery, so dou¬ 
bling this radius increases the flow by a factor of 16.* 


*Jean Poiseuille (1799-1869) was a French physician-physiologist whose experimental studies of the 
flow of liquids through thin glass tubes have rarely been equaled since his classical paper of 1846. 
By varying one parameter and holding all the others fixed, he learned from his experiments that P 
is directly proportional to the length L , to the viscosity rj, to the flow F, and inversely proportional 
to the fourth power R 4 of the radius R , so that 


Of course, the fact that Poiseuille’s constant of proportionality k equals 8/77 can only be learned from 
the mathematics. See p. 134 of H. S. Badeer, Cardiovascular Physiology (Karger Publishing Co., 
1984). For the somewhat surprising role of Isaac Newton in this subject, see pp. 50-51 of A. C. Bur¬ 
ton, Physiology and Biophysics of the Circulation (Year Book Medical Publishers, 1965). 
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PROBLEMS 

1 Solve the problem of the sphere with the hole bored 
through it (Example 1) by the washer method. 

2 Solve the problem in Example 2 by the disk method. 

In Problems 3-8, sketch the region bounded by the given 

curves and use the shell method to find the volume of the solid 

generated by revolving this region about the given axis. 

3 y = Vr, x = 4, y = 0; the y-axis. 

4 x 2 = 4y, y = 4; the x-axis. 

5 y = x 3 , x = 3, y = 0; the y-axis. 

6 x = y 2 , x 2 = 8y; the x-axis. 

7 y = \ x = a, x = b (0 < a < b\ y = 0; the y-axis. 

8 y = x 2 , y = j(3x 2 + 1); the y-axis. 

9 The region bounded by y = x/Vx 3 4- 8, the x-axis, and 

the line x = 2 is revolved about the y-axis. Find the vol¬ 
ume of the solid generated in this way. (Observe that the 
washer method is not practical in this problem.) 

10 A hole of radius V3 is bored through the center of a 
sphere of radius 2. Find the volume removed. 

11 Consider the region i n the first quadrant bounded by y = 
4 — x 2 and the axes. 

(a) Use both the disk method and the shell method to 
find the volume of the solid generated when this re¬ 
gion is revolved about the y-axis. 

*(b) Use both methods to find the volume of the solid 
generated when this region is revolved about the 
x-axis. 

12 Let r and h be positive numbers. The region bounded by 
the line x/r + y/h = 1 and the axes is revolved about the 
y-axis. Use the shell method to obtain the standard for¬ 
mula for the volume of a cone. 

13 A spherical ring is the solid that remains after drilling a 
hole through the center of a solid sphere. If the sphere 
has radius a and the ring has height h, prove the re¬ 
markable fact that the volume of the ring depends on h 
but not on a. 

14 The parabola a 2 y = bx 2 , 0 ^ y ^ b, is revolved about the 
y-axis. Use the shell method to show that the volume of 
the resulting paraboloid is one-half the volume of the 
cylinder with the same height and base. 

15 The region in the first quadrant above y = 3x 2 and be¬ 
low y = 4 — 6x 2 is revolved about the y-axis. Find the 
volume generated in this way. 

16 For the artery of radius 0.2 cm (about -jy in) described 
by equation (4), calculate the flow F and also the total 
volume of blood passing a fixed point in 1 hour. 

17 High blood pressure is one of the consequences of nar¬ 
rowing of the arteries by unwelcome fatty deposits, be¬ 
cause to maintain the same flow to the tissues the heart 


must pump harder. If Po and Ro are normal values of the 
pressure and radius for a particular artery, and the ab¬ 
normal values due to narrowing are P and R, show that 
for the flow to remain constant we must have 

P_ = (RoY 

p 0 U r 

If the radius of the artery is reduced to seven-eighths of 
its normal value, how much is the pressure increased? 

18 A cylindrical can partly filled with water is rotated about 
its axis with constant angular velocity co. As the rotation 
proceeds, the water level rises along the wall and sinks 
in the center to form the concave surface shown in Fig. 
7.22. Show that this surface has the shape of the surface 
of revolution formed by revolving the parabola 

co 2 1 

y = — x 2 + h 
2 g 

about its axis, where g (as usual) is the acceleration due 
to gravity. Hint: The centripetal force acting on a parti¬ 
cle of water of mass m at the free surface is mxco 2 , where 
x is the distance from the axis, and this is the resultant 
according to the parallelogram law (as shown in the fig¬ 
ure) of the downward gravitational force mg and the re¬ 
action force R normal to the surface which is due to other 
nearby particles of water. 



(a) Let V 0 be the given volume of water, and by calcu¬ 
lating Vo by the method of cylindrical shells, express 
the depth h as a function of co. 
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(b) If the can rotates faster and faster, then either the bot¬ 
tom will be exposed, as in Fig. 1.23a, or water will 


begin to spill out the top, as in Fig. 1.23b. If the can is orig¬ 
inally half full, which happens first? 


Figure 7.23 




{b) 


7.5 

ARC LENGTH 


An arc is the part of a curve that lies between two specific points A and B, as 
shown on the left in Fig. 7.24. Physically, the length of an arc is a very simple 
concept. Mathematically, it is somewhat more complicated. From the physical 
point of view, we merely bend a piece of string to fit the curve from A to B , mark 
the points corresponding to A and B, straighten out the string, and measure its 
length with a ruler. 

This process can be carried out by means of an approximation procedure that 
lends itself to mathematical treatment, as follows. Divide the arc AB into n parts 
by using points Pq = A, P\, P 2 , . . . , P n = B\ place pins at these points; and let 
the string stretch in short straight-line paths from each pin to the next. We illus¬ 
trate this idea on the right in Fig. 7.24 with n = 3. The part of this string be¬ 
tween A and B is evidently shorter than the arc, since a straight line is the short¬ 
est distance between two points. However, if we take larger and larger values of 
n , and at the same time require that the pins be placed closer and closer together, 
then the length of the string should approach the length of the arc. We now ex- 




Figure 7.24 
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press this idea in mathematical language and derive a practical method of cal¬ 
culating arc length by integration. 

Let us assume that the arc under discussion is the graph of a continuous func¬ 
tion y = /(x) for a < x < b. We partition the interval [a, b ] into n subintervals 
by using points xq = < 2 , x\, . . ., x k -u x b • • • * x. n = b, as shown in Fig. 7.25. We 
let P k be the point (x*, y*), where y* = f(x k ). The total length of the polygonal 
path PqP\ • • • P k -\Pk ' ’ ' P n is the sum of the lengths of the chords joining each 
point to the next. If we write 

Ax* = x k - x k -x and A y k = y* - y*- i, k = 1, 2, . . ., n, 

then it is clear by the Pythagorean theorem that we have 
length of kih chord = V(Ax*) 2 + (Ay*) 2 



(Ay*) 2 

(Ax *) 2 


(Ax*) 2 




by first factoring (Ax*) 2 out of the sum and then out of the square root sign. We 
now assume that y = /(x) is not only continuous but also differentiable. This per¬ 
mits us to replace the ratio inside the radical, which is the slope of the chord 
joining P k - \ to P k , by the value of the derivative at some point x* between x k -1 
and x k : 


A y k _ r, 


Ax* 


= /'( 4 ), 


Xk -1 < X* < X*. 


The justification for this step lies in the fact that the chord is parallel to the tan¬ 
gent at some point on the curve between P*- j and P k .^ This enables us to write 
(1) as 

length of fcth chord = Vl + [/'(x*)] 2 Ax*, 
so the total length of the polygonal path is 

X Vl + [/'(4)] 2 A**. (2) 

We now obtain our conclusion by forming the limit of these sums as n approaches 
infinity and the length of the longest subinterval approaches zero: 

n 

length of arc AB = lim Y Vl + [/'(x*)l 2 Ax* 

max Ax*—>-0 

= \ h Vl + [/'Wl 2 dx, (3) 

Ja 


provided /' (x) is continuous so that this integral exists. 

At first sight, formula (3) may appear to be rather hard to keep in mind How¬ 
ever, if we use the Leibniz notation dy/dx instead of/'(*), then the following in¬ 
tuitive approach makes this formula much easier to grasp and remember. Let the 


^This highly plausible assertion is called the Mean Value Theorem. This theorem is one of the cor¬ 
nerstones of the theory of calculus and is discussed in Section 2.6. 
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letter s denote the variable arc length from A to a variable point on the curve, as 
shown in Fig. 7.26. Let s be allowed to increase by a small amount ds, so that 
ds is the differential element of arc length, and let dx and dy be the correspond¬ 
ing changes in x and y. We think of ds as so small that this part of the curve is 
virtually straight, and therefore ds is the hypotenuse of a tiny right triangle called 
the differential triangle. For this triangle the Pythagorean theorem says that 

ds 2 = dx 2 -I- dy 2 , (4) 

and this simple equation is the source of all wisdom in the calculation of arc 
lengths. 1- If we solve (4) for ds, then factor dx 1 out of the sum and remove it from 
the square root sign, we clearly get 


ds = \/ dx 2 + dy 2 



(5) 


We now touch again on the basic theme of this chapter and point out that the to¬ 
tal length of the arc AB can be thought of as the sum—or integral—of all the 
elements of arc ds as ds sweeps along the curve from A to B. In view of (5), this 
yields 


length of arc AB = 



( 6 ) 


which is (3). This formula tells us that x is the variable of integration and that y 
is to be treated as a function of x. However, it is sometimes more convenient to 
use y as the variable of integration and treat x as a function of y. In this case we 
replace (5) by 

ds = ^ dx 2 + dy 2 

,71 

which is obtained by factoring dy instead of dx out of the square root sign. With 
y as the variable of integration, the integral for the length of the arc AB is then 

j*-r m 

which is sometimes easier to evaluate than (6). 

Most mathematicians remember formulas (6) and (8) not by memorizing them 
as they stand, but instead by starting with (4) and mentally performing as needed 
the simple manipulations in (5) and (7). This way, the whole package of ideas is 
harder to forget, and we win a small skirmish in the constant struggle of mem¬ 
ory against forgetting. 


Example Find the length of the curve y 2 = 4x 3 between the points (0, 0) and 
(2, 4V2). 


Tarentheses are usually omitted in writing squares of differentials. Thus, ds 2 means (ds) 2 and not 
d(s 2 ), etc. 
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Solution This curve is shown in Fig. 7.27, and the arc in question is the indi¬ 
cated piece of the curve in the first quadrant. If we solve for y, then we get 

y = 2x 3 ' 2 , so = 3* 1 ' 2 . 


Formula (6) now yields 

length of arc = £ V I + 9x dx = (1 + 9x) 1/2 9 dx 


_ i 

— 9 


• f(l + 9x) 3/2 


2 

0 


= ^(19VT9- 1). 


This calculation should serve as a warning, for if we try to find the length of an 
arc on almost any familiar curve, then because of the presence of the square root 
in (6) the resulting integral will probably be impossible for us to work out. At 
this stage we must choose our problems very carefully in order for the integrals 
to be computable. This should also make us aware of our urgent need for more 
integration techniques. Filling this need is the main purpose of the next three 
chapters. 



Figure 7.27 


Remark 1 It is possible to give an example of a continuous curve y = f(x), a < 
x < b, that does not have a length. This very surprising fact suggests that the un¬ 
derlying theory of arc length is more complicated than it seems. 1 " In the preced¬ 
ing discussion we found it necessary to assume that the function y = f(x) has a 
continuous derivative. Such a curve is called a smooth curve, and the word “arc” 
is usually restricted to mean a piece of a curve with this property. A smooth curve 
is often described geometrically by saying that it has a “continuously turning tan¬ 
gent.” 


Remark 2 Some students may have the impression that equations (4) and (5) 
— which are equivalent to each other—are only approximately correct, because 
the differential triangle in Fig. 7.26 is only a “quasi-triangle” whose “hypotenuse” 
is not even a straight-line segment. But this is not the case. These equations are 
exactly correct, as the following argument shows. We know that (3) is valid, so 
the arc length s in Fig. 7.26 can be written as 

S= PVl + lf'(t)] 2 dt, 

Ja 

using t as the dummy variable of integration. It is clear that 5 is a function of the 
upper limit x; and if we calculate the derivative of this function by using formula 
(13) in Section 6.7, then we get 

f-Vi + l/W-V> + (£)’■ 

which is equivalent to (5). 


Tor examples and a few additional ideas, see Appendix A.7. 
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PROBLEMS 

In Problems 1-8, find the length of the specified arc of the 
given curve. 

1 y 2 = x 3 between (0, 0) and (4, 8). 

2 >,= 4 ** + 8?’ 1 ~ x ~ 2 * 

^ 1,1, 

3 y = — x 3 + — 1 < x < 3. 

3 4x 

4 y = yVx(3 - x), 0<x< 3. 

5 x =L y > + ±^y^ 3 . 

6 y = -j|x 6/5 — fx 4/5 , 1 < x < 32. 

7 y = y(2 + JC 2 ) 3/2 , 0<x<3. 

8 y = f( 1 + JC 2 ) 3/2 , 0<JC<3. 

9 Let A, B, C be positive constants. Show that the length 
of an arc of the curve y = A (B + Cj c 2 ) 3/2 can be calcu¬ 
lated by means of an integral not involving a square root 
if 

(a) A = \ and 5 2 C = 1, in which case the curve is 
y = (2/3 B 3 ){B 3 + jc 2 ) 3/2 ; or 

(b) B = 2 and 3AVC = 1, in which case the curve is 
y = (1/3VC)(2 + Cjc 2 ) 3/2 . 


Show that each of these curves includes Problems 7 and 
8 as special cases. 

10 The curve jc 2/3 + y m = a 213 is called an astroid or a 
hypocycloid of four cusps. Sketch it and find its total 
length. 

11 IfO < a < b and n is not equal to 1 or — 1, show that the 
length of 

= x n+l 1 1 

y >2+1 4(>1 -1) ' x n ~ 1 

between x = a and x = b can be calculated by means of 
an integral not involving a square root. Notice that Prob¬ 
lems 2 and 3 are special cases of this result. 

12 In each case set up the integral for the arc length, but do 
not attempt to evaluate it (these integrals are beyond our 
capacity at the present stage of our work): 

(a) y = Vx, 1 < x < 4; 

(b) y = x 2 , 0 < x < 1; 

(c) y = x 3 , 0 < x < 1; 

(d) the part of y — -x 2 + 4x — 3 lying above the 
x-axis. 


7.6 

THE AREA OF A 
SURFACE OF 
REVOLUTION 


Let us consider a smooth curve lying above the x-axis, as shown on the left in 
Fig. 7.24. When this curve is revolved about the x-axis, it generates a surface of 
revolution. We now set ourselves the problem of calculating the area of such a 
surface. 

For reasons that will become clear, we begin by considering a very simple sur¬ 
face of revolution, the curved lateral part of a cone whose base has radius r and 
whose slant height is L. If this cone is cut down the side from the vertex to the 
base—that is, along a generator—and laid out flat, as shown in Fig. 7.28, then 
we get a sector of a circle of radius L whose curved edge has length 2irr, and 
the lateral area A of the cone equals the area of this sector. It is geometrically 
clear that the ratio of the area of the sector to the complete area of the circle 
equals the ratio of the length of the curved edge to the complete circumference 
of the circle, that is 




Figure 7.28 
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A = 2ttt_ 
ttL 2 2 7 tL ’ 


so A = irrL. 


The lateral surface of the cone can evidently be thought of as the surface of rev¬ 
olution swept out by a generator as it revolves around the axis. If the formula is 
written as 

A = L • 27r({r), 


then we see that the lateral area of a cone equals the product of the length of a 
generator and the distance traveled by the midpoint on its journey around the 
axis. 

Next, we generalize slightly and find the area of the surface of revolution gen¬ 
erated when a line segment of length L is revolved about an axis at a distance r 
from its midpoint.* This area is the lateral area of a frustum of a cone, as shown 
in Fig. 7.29. If we denote this area by A, then A is the difference between the 
lateral areas of the two cones in the figure, so 

A = irr\L\ — irr 2 L 2 = 7h>iLi — r 2 L 2 ). 

By similar triangles, it is clear that 

L 2 _ L\ r - r 

— — or r x L 2 — r 2 L\. 
r 2 n 

With the aid of a bit of algebraic ingenuity, this enables us to write A in the form 



Figure 7.29 


A - ir(r\L\ r { L 2 + r 2 L x - r 2 L 2 ) = n[r x (L\ - L 2 ) + r 2 (L\ L 2 )] 

' r_x+rf 


= tt(L\ - L 2 )(r i + r 2 ) = ( L\ L 2 ) • 2tt 


= L • 2irr. 



We therefore conclude that in this case as well, the area of the surface of revo¬ 
lution equals the product of the length of the segment and the distance traveled 
by the midpoint on its journey around the axis. 

We now apply these ideas to the general area problem stated at the beginning 
of this section. Our approach will be intuitive and geometric. 

We begin by approximating the smooth curve y = /(;c) by a polygonal path 
consisting of many short line segments connecting nearby points on the curve, 
as shown on the left in Fig. 7.30. The surface generated by revolving the curve 


*In the case of a cone, one end of the segment lies on the axis and forms the vertex of the cone. 



Figure 7.30 


— 1 - 1-4 
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about the x-axis will have approximately the same area as the surface generated 
by revolving this polygonal path about the x-axis (Fig. 7.30, center). The latter 
surface is evidently made up of a number of pieces, each of which is shaped like 
a frustum of a cone. This situation suggests the fundamental idea illustrated on 
the right in the figure. If the element of arc length ds is revolved about the x- 
axis, then it generates a thin ribbonlike element of area dA\ and if the midpoint 
of ds is at a distance y from the x-axis, then the above discussion tells us that 


dA = 27 ry ds = 27ry 



dx. 


We obtain the total area A of the surface by forming the sum—or integral—of 
all the elements of area dA as dA sweeps along the complete surface, 

A = j dA = j 2ny ds = [ 2wy dx, 

where y is assumed to be known as a function of x [y =/(*)]. If we choose in¬ 
stead to revolve our curve about the y-axis, and thereby to generate an entirely 
different surface of revolution, then in the same way its area is given by 

A = | 27 rx ds. 


The underlying idea in both of these formulas can be expressed by writing 
A = 27r(radius of revolution) ds. 


In using this formula to perform an actual calculation, the element of arc length 
ds must be written in terms of a convenient variable of integration and appro¬ 
priate limits of integration must be provided. 


Example Find the surface area of a sphere of radius a. 



Solution The 
tion generated 
7.31). Since 


surface of this sphere can be considered as the surface of revolu- 
by revolving the semicircle y = V a 2 — x 2 about the x-axis (Fig. 


& = A (a i- x i)m = 
dx dx 




and ds sweeps along the arc of the circle in the first quadrant as x increases from 
0 to a , we can use the left-right symmetry of the figure and write 




a dx = 477 u 2 . 


It is also possible to use y as the variable of integration, because ds also sweeps 
along the arc in the first quadrant as y increases from 0 to a. The calculation is 
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a little more complicated, but it may be instructive for students to see how it 
works. Since x = Va 2 — y 2 in the first quadrant, we have 


— = — (a 2 - v 2 ) m = — y 
dy dy }) Va^J 2 ' 


and therefore 

A = J 2rry ds = 2 J q 2iryy 


dx\ 2 

d y, 


+ 1 dy 


= 4 Wo'a/^7+ ldy = 4 ™lo 


ydy 


Vfl 2 - y 2 


= 47 J (,a 2 - y 2 ) 1/2 (-2 y dy) = ATra{—\)2\/a 2 - y 2 


- Aira 2 , 


as before. 


Remark In addition to discovering the volume of a sphere, Archimedes also 
found its surface area by means of a brilliant piece of insight that links these two 
quantities to each other. His idea was to split up the solid sphere into a large 
number of small “pyramids,” as follows. Imagine that the surface of our sphere 
of radius a is divided into many tiny “triangles,” as suggested in Fig. 7.32. Of 
course, these little figures are not actually triangles, since there are no straight 
lines on the surface of a sphere. However, they are so small that each figure is 
nearly flat and they are nearly triangles. Let each such triangle be used as the 
base of a pyramid of height a whose vertex is the center of the sphere. If A* is 
the area of the base of our tiny pyramid and V* is its volume, for k = 1,2 ,,n, 
then we know that V * = jA*&. (The fact that the volume of a pyramid is one- 
third the area of the base times the height was discovered by Democritus two 
centuries before the time of Archimedes.) By adding these equations for k = 
1, 2, . . . , n, we obtain 

k =1 k= 1 D \k =1 / 

Since all our pyramids fill the solid sphere, this tells us that the volume V and 
surface area A of the sphere are related by the equation 

V = jAa. 

But now Archimedes’ discovery that V = 77 ra 3 enables us to write this equation 
in the form 

jna 3 = jAa, 


so 


A = 477V2 2 , 



Figure 7.32 


just as in the example. 
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PROBLEMS 

In Problems 1-6, find the area of the surface of revolution 

generated by revolving the given arc about the indicated axis. 

1 y = -7 x 4 + — Kr, 1 < x < 2 , the y-axis. 

4 8jt 

2 y = yVx(3 - x), 0 < x < 3, the x-axis. 

3 y = y(2 + x 2 ) 3/2 , 0 < x < 2, the y-axis. 

4 y = x 2 , 0 < x < 2, the y-axis. 

5 y = x 3 , 0 < x < 1, the x-axis. 

6 y = iVx, 2 < x < 8 , the x-axis. 

7 The arc of the parabola x 2 = 4 py between (0, 0) and 
(2 p, p) is revolved about the y-axis. Find the area of the 
surface of revolution (a) by integrating with respect to x; 
(b) by integrating with respect to y. 

8 The loop of 9y 2 = x(3 -x) 2 is revolved about the y-axis. 
Find the area of the surface generated in this way. 

9 Find the area of the surface generated by revolving the 
astroid (or hypocycloid of four cusps) x 2/3 + y 2/3 = a 273 
about the y-axis. 

10 Consider a cylinder circumscribed about a sphere of ra¬ 
dius a. Let two planes perpendicular to the axis of the 
cylinder intersect the sphere. If these planes are at a dis¬ 
tance h apart, show that the area of the spherical zone 
that lies between them on the sphere is lirah. (It is a re¬ 
markable fact that this is the same as the area between 
these planes on the lateral surface of the cylinder. Note 
also that if h = 2a this result yields the formula for the 
total surface area of the sphere.) 

11 If a curve lies above the x-axis, its moment around the 
x-axis is defined to be Jy ds , where this integration is ex¬ 
tended over the complete length of the curve. The mo¬ 
ment of the curve around the y-axis is Jx ds. The point 
(x, y) is called the centroid of the curve (Fig. 7.33) if its 
coordinates are defined by 


_ _ Jx ds _ jx ds 

Jds length of curve 
and 

_ _ Jy ds _ Jy ds 
y fds length of curve ’ 

If the curve is thought of as a uniform metal wire, its 
centroid is its center of gravity, or balancing point. Find 
the centroid of the semicircle x 2 + y 2 = a 2 , y > 0. 



12 In Problem 11, let the curve be revolved about the x-axis 
and show that the area of the surface of revolution equals 
the length of the curve times the distance traveled by the 
centroid on its journey around the x-axis. 

13 Use the theorem proved in Problem 12 to find 

(a) y for the semicircular arc in Problem 11; 

(b) the surface area of the torus (a fancy name for a 
doughnut) obtained by revolving the circle x 2 + 
(y — b) 2 = a 2 , a < b , about the x-axis. 


7.7 

WORK AND ENERGY 


It is a common experience that in moving an object against a force acting on it, 
as in lifting a heavy stone, we have the sensation of expending effort or doing 
work. Even before we define the physical concept of work, we are convinced that 
it takes twice as much work to lift a 20-lb stone a given distance as it does to 
lift a 10-lb stone, and also that the work done in lifting a stone 3 ft is three times 
that done in lifting it 1 ft. These ideas point the way to our basic definition: If a 
constant force F acts through a distance d, then the work done during this process 
is the product of the force and the distance through which it acts, 


work = force ■ distance 


or 


W= F - d. 


( 1 ) 


It is understood here that the force acts i n the direction of the motion. 
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As we know, the “weight” of an object is the force with which the object is 
attracted to the earth by gravity. For a given object moving at or near the surface 
of the earth, this force remains essentially constant in magnitude and is always 
directed toward the center of the earth. Thus, if a box of groceries weighing 20 
lb is lifted 3 ft from the floor and placed on a table, then definition (1) tells us 
that 60 ft-lb of work are done; but if the box is then carried into another room 
and placed on a shelf without raising it or lowering it, then this action accom¬ 
plishes no work because the box was moved a distance zero in the direction of 
the force. And if a tractor drags a boulder 18 in by applying a constant force of 
2 tons, then the tractor does 36 in-tons (or 3 ft-tons) of work.* 

This definition is satisfactory as long as the force F is constant. However, many 
forces do not remain constant during the process of performing work. In a situ¬ 
ation like this we divide the process into many small parts and calculate the to¬ 
tal work by integrating the elements of work corresponding to these parts. 

This idea is illustrated by the operation of stretching a spring, as follows. 


Example 1 A certain spring has a natural length of 16 in. When it is stretched 
x inches beyond its natural length, Hooke’s law states that the spring pulls back 
with a restoring force of F= kx pounds, where k is a constant. The constant of 
proportionality k is called the spring constant, and can be thought of as a mea¬ 
sure of the stiffness of the spring. For the spring under discussion, 8 lb of force 
are required to hold it stretched 2 in. How much work is done in stretching this 
spring from its natural length to a length of 24 in? 


Solution First, the fact that F = 8 when x = 2 allows us to find k. We have 
8 = k • 2, so k = 4 and F = Ax. To clarify our ideas, we draw a picture of the 
spring in its unstretched condition, and also after it has been stretched x inches 
(Fig. 7.34). Now, if we imagine that the spring is stretched a very small addi¬ 
tional distance dx, then the force changes very little over this increment of dis¬ 
tance and can be treated as essentially constant. The work done against the pull 
of the spring over this increment of distance is 

dW= F dx = Axdx, (2) 


and the total work done during the complete stretching process is 


W= I dW= I Fdx = £ 4x dx = 2x 2 


= 128 in-lb, 



Figure 7.34 


since x increases from 0 to 8 as the length of the spring increases from 16 to 24. 


In a similar way, we can consider the work done by any variable force that 
acts in a given direction as its point of application moves in this direction. If 
we coordinatize the line of action by introducing an jc-axis, and if the point of 


Tf force is measured in pounds and distance in feet, work is measured in foot-pounds. This is the 
English or engineering system. In science the terms are as follows. In the centimeter-gram-second 
(cgs) system, the unit of force is the dyne, defined as the force that imparts an acceleration of 1 cm/s 2 
to a mass of 1 gram, while in the meter-kilogram-second (mks) system, the unit of force is the new¬ 
ton, defined as the force that imparts an acceleration of 1 m/s 2 to a mass of 1 kilogram. The corre¬ 
sponding units of work are the dyne-centimeter, or erg, and the newton-meter, or joule. For conver¬ 
sion I ft-lb = 1.356 joules and I joule = 10 7 ergs. 
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application of the variable force F(x ) moves from x = a to x = b, then dW = 
F(x) dx is the element of work and 

W= I dW = j b F(x)dx (3) 

gives the total work done during the process. This formula can be taken either 
as a definition or as a natural method of computing the work in accordance with 
the way of thinking described in Example 1. In our next example the same idea 
is applied to a different situation. 

Example 2 According to Newton’s law of gravitation, any two particles of mat¬ 
ter of masses M and m attract each other with a force F whose magnitude is di¬ 
rectly proportional to the product of the masses and inversely proportional to the 
square of the distance r between them, 


where G is the so-called constant of gravitation. If M is fixed at the origin, how 
much work is required to move m from r — a to r — b, where a < b ? 

Solution The element of work is 


dW= F dr = GMm 


(4) 


so the total work is 


W = 


dW = GMm 


f 


b dr^ 

2 


GMm 



GMm 



If we think of the final position r = b as being chosen farther and farther away, 
so that 6 —> then the work W approaches the limiting value GMm!a. This quan¬ 
tity is the work which must be done against the force of attraction to move m 
from r = a to an infinite distance, that is, to separate the masses completely; it 
is called the potential of the two particles. 



Each of the preceding examples is concerned with a variable force acting 
through a given distance. Our next example is very different. It involves a process 
in which the parts of a body—in this case, drops of water—are moved differ¬ 
ent distances against a constant force, and the total work is calculated as the sum 
of the various bits of work associated with the various parts. 

Example 3 Consider a cylindrical tank of radius r and height h y filled with wa¬ 
ter to a depth D (Fig. 7.35). How much work is done in pumping the water out 
over the rim of the tank? (As usual, we denote the weight-density of water, that 
is, the weight per unit volume, by w.) 

Solution The essence of this problem is the fact that each drop of water must 
be lifted from its initial position up to the rim of the tank and dumped over the 
side. The work done in this process is the same for all drops which are the same 
distance below the rim. This suggests that we consider all the water located in a 


Figure 7.35 
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thin horizontal layer of thickness dx at a height jc above the bottom of the tank, 
that we write down the element of work dW needed to lift this entire layer up to 
the rim of the tank, and that we calculate the total work in our usual way, by 
adding (or integrating) these elements of work as jc increases from 0 to D, so that 
our typical layer sweeps through all the water in the tank. It is clear from the fig¬ 
ure that the volume of the layer is irr 2 dx , so its weight is wirr 2 dx , and the work 
done in lifting this layer through the distance h — x to the top of the tank is 

dW = W7 Tr 2 dx - (h — jc). (5) 

The total work done in pumping out all the water is therefore 

W= I dW= wnr 2 j" (h - x) dx 


= w7Tr 2 (hx — 


W) 


WTrr 2 {hD - jD 2 ). 


We repeat: The crux of the method in this example is the fact that all drops of 
water in our typical layer are essentially the same distance below the rim of the 
tank, and can therefore be treated together in calculating the work. 


Students should observe that the use of definition (1) in a suitable form is the 
key to each of these examples. Specifically, formulas (2), (4), and (5) are simply 
the versions of (1) that are appropriate in each case. 

We devote the rest of this section to a brief discussion of the important con¬ 
cept of energy. 

Consider a variable force F that acts on a particle of mass m over a given dis¬ 
tance along a straight line, which we take to be the jc-axis. This force not only 
does work, but also imparts an acceleration dv/dt to the particle in accordance 
with Newton’s second law of motion, 

F = m where v = dxldt. (6) 

dt 

This acceleration produced by the force changes the velocity v of m, and there¬ 
fore also changes its kinetic energy —or energy due to motion—which is de¬ 
fined by the formula 

kinetic energy = \mv 2 . 

We are now in a position to prove the following important theorem of mechanics: 


The work done by the force F during the process described above equals the change 
in the kinetic energy of the particle; and in particular, if the particle starts from rest, 
then the work done on it equals the kinetic energy it attains. 


The proof is easy. We begin by writing (6) in the form 


„ dv dv dx dv 

F = m —— = m — -— = mv ——. 

dt dx dt dx 


Formula (3) now yields 


W = 



, ( b dv 
dx = mv—— dx 
Ja dx 


r 

Jv„ 


mvdv 
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1 2 
jmv 2 


1 2 1 2 
2 mv b z - 2 mv a z , 


so the work W equals the change in the kinetic energy, as stated. 


(7) 


Remark In certain types of physical situations—but not in all—it is possible 
to introduce the concept of potential energy. We do this very briefly as follows. 
In using formula (3) for the calculation in (7), we tacitly assumed that the un¬ 
specified force F is a continuous function depending only on the coordinate x 
over the interval a< x< b, say F = F(x). (Notice that a frictional force does not 
have this property; for it depends not only on the location of the particle m, but 
also on the direction in which it is moving.) By the discussion at the end of Sec¬ 
tion 6.7, this assumption guarantees that there exists a function V(x) such that 
dV/dx = —F(x). We can therefore evaluate the work W in (7) in another way, as 
follows: 


W= T F(x) dx = f a —F(x) dx = V(x) 
Ja Jb 

= V(a) - V(b). 


(8) 


This enables us to write (7) as 

\mv b 2 - {mV 2 = V(a) - V(b) 


or 


{mvb 2 + V{b) = \mv 2 + V(a). 


(9) 


On the left side of (9) we drop the subscript and replace V(b) by V(x) in order 
to emphasize that z;and V(x) are considered to be variables; and on the right side 
we hold v a and V(a) fixed. Equation (9) now takes the form 

\mv 2 + V(x) = \mv 2 + V(a) = E, (10) 



where the constant E is called the total energy of the particle. The function V(x) 
is called the potential energy of the particle, and (10) states that the sum of the 
kinetic energy and potential energy is constant. This is the law of conservation 
of energy, which is one of the basic principles of classical physics. 

We see from (10) that if F(x) does work and thereby increases kinetic energy, 
it does so at the expense of potential energy and can therefore be viewed as con¬ 
verting potential energy into an equal amount of kinetic energy. 

We point out that the definition of V(x) means that this function is determined 
only to within an additive constant, so in any specific situation the state of zero 
potential energy can be chosen to suit our convenience. Also, students may won¬ 
der about the mild trickery with algebraic signs that takes place in the definition 
of V(x) and in the calculation (8). The purpose of this is to guarantee the ap¬ 
pearance of plus signs instead of minus signs in (10) so that we can speak of the 
sum of the kinetic and potential energies as being constant instead of their dif¬ 
ference. 

Example 4 The working heart. From the point of view of physics, the human 
heart is a pump. Blood enters the left ventricle through the mitral valve (Fig. 
7.36) and is then pumped out to the body through the aortic valve as the heart 
muscle contracts. During each contraction the pressure exerted by the heart wall 


Figure 7.36 
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increases in a roughly linear manner from a diastolic pressure of about 80 mm 
Hg (millimeters of mercury) to a systolic pressure of about 120 mm Hg. We shall 
calculate the work done in the left ventricle during one heartbeat, assuming—a 
realistic figure—that the volume of this ventricle decreases by about 75 cm 3 4 dur¬ 
ing one contraction. We shall need to know that 100 mm Hg = 1.33 X 10 5 * 
dyne1 s/cm 2 . 

For convenience in working with the idea of a pump, we imagine the heart’s 
action to be carried on by the movement of a piston from x = 0 to x = a, as 
shown in the figure, instead of by muscular contraction. If A is the area of the 
piston head, then aA = 75. The pressure P(x) against which the piston works is 
easily seen from Fig. 7.37 to be 

40 

P(x) = — * + 80. 
a 


is 


W 


= P(x)A dx = A 1^- x -I- 80 j dx 


= a(—x 2 + &0x 
a 


= 100 a A 


o 


= (1.33 X 10 5 dynes/cm 2 ) ■ (75 cm 3 ) 
= 10 7 dyne-cm 
= 1 joule 
= 0.74 ft-lb. 



In the case of a person weighing 120 lb who has a pulse rate of 60, we can quickly 
learn by punching a few keys on our calculator that the heart does enough work 
in a 24-hour day to lift the person through a vertical distance of more than 500 
ft. The human heart is a remarkable organ and is shockingly underappreciated! 


PROBLEMS 


1 A spring has a natural length of 10 in, and a 12-lb force 
stretches it \ in. Find the work done in stretching the 
spring from 10 in to 18 in. 

2 A spring has a natural length of 12 in, and a 45-lb force 
stretches it to 15 in. Find the work done in stretching it 
from 15 in to 19 in. 

3 A spring supporting a railroad car has a natural length 
of 12 in, and a force of 8000 lb compresses it \ in. Find 
the work done in compressing it from 12 in to 9 in. 
(Hooke’s law is valid for compressing springs as well 
as for stretching them.) 

4 Find the natural length of a spring if the work done in 

stretching it from a length of 2 ft to a length of 3 ft is one- 

fourth the work done in stretching it from 3 ft to 5 ft. 


5 A bucket weighing 5 lb when empty is loaded with 60 
lb of sand. Unfortunately there is a hole in the bucket, 
and sand leaks out uniformly at such a rate that a third 
of the sand is lost when the bucket has been lifted 10 
ft. Find the work done in lifting the bucket this distance. 

6 A cable 100 ft long that weighs 4 lb/ft is hanging from 
a windlass. How much work is done in winding it up? 

7 Solve Problem 6 if a 300-lb weight is attached to the 
free end of the cable. 

8 A 5-lb monkey is attached to the end of a 30-ft hang¬ 
ing chain that weighs 0.2 lb/ft. It climbs the chain to 
the top. How much work does it do? 

9 Gas in a cylindrical chamber moves a piston by ex¬ 
panding or contracting. Let the cross-sectional area of 
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the cylinder be A, and let its variable volume and length be V 
and x (Fig. 7.38). If p is the pressure of the gas, then the force 
the gas exerts on the piston is pA. 



(a) If the gas expands from a volume Vi to a volume 
V 2 , show that the work done by the gas on the pis¬ 
ton is 

W=j*pdV. 

(b) If a force is exerted on the piston to compress the 
gas from a volume V\ to a volume V 2 , show that the 
work done on the gas is 

fv 7 

W=**A p dV. 

10 If air is compressed or expanded without any loss or 
gain of heat but with a possible change of temperature, 
then it obeys the adiabatic gas law pV x 4 = c, where c 
is a constant. If a cylinder contains 243 in 3 of air at a 
pressure of 14 lb/in 2 , find the work done by the piston 
on the air in compressing it adiabatically to a volume 
of 32 in 3 . (If this air is compressed slowly so that the 
heat generated is allowed to escape and the temperature 
remains constant, the compression is said to be isother¬ 
mal. In this case the pressure and volume are related by 
Boyle's lawpV = c, and in trying to calculate the work 
we are led to an integral of the form JdV/V, which is 
beyond our reach. One of the main purposes of Chap¬ 
ter 8 is to enable us to cope with integrals of this kind, 
which are important in many applications.) 

11 Consider a cylindrical buoy of cross-sectional area 8 ft 2 
which is floating upright in water whose weight-den¬ 
sity is vv = 62.5 lb/ft 3 . According to Archimedes’ prin¬ 
ciple, a floating body is acted on by an upward buoy¬ 
ant force equal to the weight of the displaced water, and 
in a state of equilibrium this upward force balances the 
downward force acting on the body due to gravity. 

(a) Show that there is an upward force of 62.5(8x) lb 


acting on the buoy when it is held x feet down from its equi¬ 
librium position. 

(b) How much work is done in pushing the buoy 1 ft 
down from its equilibrium position? 

*12 A conical buoy that weighs B pounds floats upright in 
water with its vertex a ft below the surface. A crane on 
a dock lifts the buoy until its vertex just clears the sur¬ 
face. How much work is done? Hint: When the crane 
has lifted the buoy x ft, then the force required to hold 
it in this position is the weight of the buoy minus the 
upward buoyant force due to the water still displaced, 
and this can be expressed as a function of x. 

13 If an iron ball is attracted to a magnet by a force of 
F = 15/x 2 pounds when the ball is x feet from the mag¬ 
net, find the work done in pulling the ball away from 
the magnet from a point where x = 2 to a point where 
x = 6. 

14 According to Coulomb’s law, two electrons repel each 
other with a force that is inversely proportional to the 
square of the distance between them. Suppose one elec¬ 
tron is held fixed at the origin on the x-axis. Find the 
work done in moving a second electron along the x-axis 
from x = 2 to x= 1. From x = a to x = b, where 
0 <b<a. 

15 If two particles of matter of masses M and m are a units 
apart, how much work must be done to move them twice 
as far apart? 

16 If R is the radius of the earth (about 4000 mi) and g is 
the acceleration due to gravity at the surface of the earth, 
then the force of attraction exerted by the earth on a 
body of mass m is F = mgR 2 /r 2 , where r is the distance 
from m to the center of the earth. If this body weighs 
100 lb at the surface of the earth, what does it weigh at 
an altitude of 1000 mi? At an altitude of 4000 mi? How 
much work is required to lift it from the surface to an 
altitude of 1000 mi? 

17 Generalize Problem 16 by finding how much work must 
be done by a rocket on a satellite of mass m in lifting 
it to an altitude h above the surface of the earth. 

18 Suppose that a hole is drilled straight through the cen¬ 
ter of the earth, and that a body of mass m is dropped 
into this hole. As the body falls, the force of attraction 
exerted on it by the earth is F — mgr/R , where r is the 
distance from m to the center of the earth. (The reason 
behind this law of force will become clear in a later 
chapter.) Find the work done by the earth in pulling m 
from the surface down to the center. 

19 A conical tank 10 ft deep and 8 ft across the top is full 
of water. Find the work done in pumping the water over 
the top of a nearby 12-ft fence. 

20 Find the work done in Problem 19 if the tank is initially 
filled only to a depth of 5 ft and if the water is pumped 
just to the top of the tank and over the edge. 
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21 A spherical tank of radius a is at the top of a tower with 
its bottom at a distance h above the ground. How much 
work is needed to fill the tank with water pumped from 
ground level? 

22 A great conical mound of height h is built by the slaves 
of an oriental monarch, to commemorate a victory over 
the barbarians. If the slaves simply heap up uniform ma¬ 
terial found at ground level, and if the total weight of 
the finished mound is M, show that the work they do is 

i, \hM. 

4 23 The Great Pyramid of Egypt is perhaps the greatest sin¬ 
gle building ever erected by man. It was originally 482 
ft high with a square base 765 ft on a side, and it cov¬ 
ered an area large enough so that St. Peter’s in Rome, 
the cathedrals of Milan and Florence, Westminster 
Abbey, and St. Paul’s Cathedral in London could all be 
grouped within it. It contained enough stone to build a 
wall 1 ft thick and 7 ft high all the way around France. 
The Greek historian Herodotus said that it was built in 
20 years by the labor of 100,000 men. Calculate the 
plausibility of this assertion as follows: Assume that the 
Great Pyramid is made of stone that weighs 150 lb/ft 3 , 
that each laborer worked 10 hours per day for 350 days 
each year, and that each laborer did 200 ft-lb of effec¬ 
tive work per hour in lifting stones from ground level 
to their final positions in the pyramid . 1 If Herodotus’ 
figure of 20 years is correct, approximately how many 
-j laborers were needed? 

HI 24 Geologists who study mountain building are able to cal¬ 
culate the energy needed to lift a mountain up from sea 
level. In the case of Mt. Everest, assume the mountain 
has the shape of a cone of height 30,000 ft and radius 
of base 60,000 ft, with uniform density 150 lb/ft 3 . 

(a) How much work was required to build Mt. Everest 
if all of its constituent rock was initially at sea level? 

(b) The atomic bomb at Hiroshima released energy 
equivalent to 20,000 tons of TNT, and some hy¬ 
drogen bombs tested in the 1950s had energy 500 
times as great, on the order of 10 megatons of TNT. 
How does the work needed to build Mt. Everest 
compare with the energy of a 10 -megaton bomb? 
(10 megatons is about 3 X 10 15 ft-lb.) 

25 If the same amount of work done on two particles start¬ 
ing from rest causes one to move twice as fast as the 
other, how are their masses related? 

26 When the mass m of a particle is constant, Newton’s 
second law of motion 

r' dv 

F = ma = m —— (*) 

dt 

f If 200 ft-lb per hour seems too small a figure, remember that much 
time and strength were spent in quarrying, cutting, and transporting 
the stones. 


can be written in the less familiar form 

F = - 7 - ( mv ), (**) 

dt 

which remains valid even when m is not constant. Ac¬ 
cording to Einstein’s special theory of relativity, the 
mass is indeed not constant: It increases as the veloc¬ 
ity v increases, and is determined as a function of v by 
the formula 

m 0 

m = j --, 

Vi - Vi? 

where c is the velocity of light (approximately 186,000 
mi/s, or 300,000 km/s) and mo is the so-called rest mass. 
When this expression for m is inserted in (**), the re¬ 
sult is Einstein’s law of motion 

i=^?) ( *** } 

If v is small compared with c so that Vl - v 2 tc 2 is 
close to 1, then Newton’s law (*) is a very close ap¬ 
proximation to Einstein’s law. This is what happens for 
all problems of classical physics (in fact, this is almost 
a definition of classical physics). However, if ^ is an ap¬ 
preciable fraction of c, as in most phenomena of atomic 
physics, then the two laws differ considerably, and all 
the experimental evidence supports Einstein’s version. 

(a) Show that a particle acted on by a constant force F 
can never achieve the velocity of light, no matter 
how long the force acts.* Hint: Integration of (***) 
gives Ft/mo = v/\\ - v 2 /c 2 if the particle starts 
from rest. 

(b) By differentiation write (***) in the form 

mpa 

* (I - v 2 /c 2 ) 3/2 

where a = dv/dt, and thereby obtain another way 
of comparing Einstein’s law with Newton’s law (*). 
27 (This problem continues the ideas of Problem 26.) Con¬ 
sider a particle of rest mass mo that starts from rest at 
the origin on the ;c-axis and moves in the positive di¬ 
rection under the influence of a positive force F. Ein¬ 
stein’s law of motion [part (b) of Problem 26] tells us 
that a is also positive, so the velocity is increasing. If 
the energy E of the particle is understood to be the work 
done on it by F, show that E is related to the increase 
in the mass, which is M = m - mo, by Einstein’s 

*This is in sharp contrast with the implication of Newton’s law (*), 
that if the force F is constant, then the acceleration a is also constant, 
so the velocity increases at a constant rate, and can therefore be made 
greater than c if the force acts long enough. 
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famous equation E = Me 2 . Hint: Write a = — = 

- 7 - = v~- and calculate E = f F dx. [The crux of 

dx dt dx Jo 

Einstein’s equation — not mentioned in this problem— 

is the much deeper fact that the rest mass m 0 also has 

energy associated with it, in the amount m 0 c 2 . This can 


be thought of as the “energy of being” of the particle, 
in the sense that mass possesses energy just by virtue of 
existing. If we add this to the kinetic energy calculated 
in the problem, we obtain the complete Einstein 
equation: total energy = Me 2 + m 0 c 2 = (m — m 0 + 
mo)c 2 = me 2 .] 


7.8 

HYDROSTATIC FORCE 



Figure 7.39 


In the previous sections of this chapter we have seen how integration can be used 
to answer many natural questions that arise in geometry and basic physics. 

In this section we undertake a brief excursion into the science of hydrostatics , 
which is concerned with the behavior of liquids at rest. In particular, we calcu¬ 
late the force exerted outward against the walls of an open container by water at 
rest inside the container. The containers we consider can be anything from a small 
fishbowl to the reservoir behind a gigantic dam. We do not undertake this ex¬ 
cursion for its own sake, but rather because it provides an additional excellent il¬ 
lustration of the main theme of this chapter—the idea that the whole of a quan¬ 
tity can be calculated by dividing it into many convenient small pieces and adding 
up these pieces by means of integration. 

If a tank with a rectangular bottom and vertical sides is filled with water to a 
depth h (Fig. 7.39), then the force exerted downward on the bottom is equal to 
the weight of the water contained in the tank. If A is the area of the bottom, then 
this force is given by the formula 

F = whA, ( 1 ) 

where w is the weight-density of the water, which is approximately 62.5 lb/ft 3 , 
or -J 2 ton/ft 3 . It is obviously necessary for the units of measurement in (1) to be 
compatible. In our work we measure h in feet, A in square feet, and w in pounds 
or tons per cubic foot. The force F is then expressed in pounds or tons. 

If we divide (1) by A, then the resulting quantity 

p = wh ( 2 ) 


is the pressure , or force per unit area , exerted by the water on the bottom of the 
tank. The pressure at a given depth h below the surface can therefore be thought 
of as the weight of a column of water h units high that rests on a horizontal base 
whose area is 1 square unit. Formula (2) is quite remarkable, for it states that the 
pressure is proportional to the depth alone, and that the size and shape of the 
container are completely irrelevant. For example, at a depth of 4 ft in a swim¬ 
ming pool the pressure is the same as it is at a depth of 4 ft in a nearby lake 
(namely, 250 lb/ft 2 ) regardless of the size of the lake; and we find the same pres¬ 
sure at the bottom of a vertical glass tube 1 inch in diameter if we plug the bot¬ 
tom with a cork and fill it with 4 ft of water. Furthermore, it can be verified ex¬ 
perimentally that at any point in a liquid the pressure is the same in all directions. 
This means that a flat plate below the surface has the same pressure acting on 
one face at a given depth whether it is placed horizontally, vertically, or at an an¬ 
gle, and this pressure is normal (perpendicular) to the face of the plate. As skin 
divers know from personal experience, the water pressure on the eardrums de¬ 
pends only on how deep they are, and not at all on the angle at which the head 
is tilted. 
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In order to find the total force exerted by the water against the bottom of the 
tank in Fig. 7.39, it is enough to multiply the pressure at the bottom by the area 
of the base, 


F = pA, 


which is merely formula (1). It is more difficult to find the force against one of 
the sides, because the pressure is not constant there but increases as the depth 
increases. Instead of pursuing this particular problem, we consider a more gen¬ 
eral situation. 

In Fig. 7.40 we show a flat plate of unspecified shape submerged vertically in 
a body of water. To find the total force exerted by the water against one face of 
this plate, we imagine this face to be divided into a large number of thin hori¬ 
zontal strips. The typical strip shown in the figure is at a depth h below the sur¬ 
face. Its width dh is so small compared with h that the pressure is essentially 
constant over the entire strip, and has the value p = wh. The area of the strip is 
dA — x dh , so the element of force dF acting against the strip is given by 

dF = p dA = wh • x dh. 

The total force F acting against the whole face of the plate is now obtained by 
integrating these elements of force as our typical strip sweeps across the plate 
from top to bottom, 

F= j dF= [wh-xdh. (3) 



In order to carry out the indicated integration in a specific problem, it is neces¬ 
sary to know x as a function of h, and this is determined geometrically from the 
shape of the plate. As in the preceding sections of this chapter, it is better to un¬ 
derstand and apply the ideas used in constructing formula (3) than to try to mem¬ 
orize this formula and use it without thinking. We repeat the crux of the method: 
Thin horizontal strips are used because the pressure can be treated as essentially 
constant over all of such a strip, and the force acting against this strip is then 
simply the pressure times the area. 


Example 1 A vertical gate in a dam has the shape of a square 4 ft on a side, the 
upper edge being 2 ft below the surface of the water (Fig. 7.41). Find the total 
force this gate must withstand. 


Solution In this case x = 4 and h increases from 2 to 6, so 


F = J wh • 4 dh = 2wh 2 


= 2w • 32 = 2 tons. 



Example 2 A triangular dam in a ditch is 10 ft across the top and 6 ft deep (Fig. 
7.42). Find the force of the water against this dam when the water is at the top 
and ready to spill over. 

Solution By similar triangles we see that 



x_ _ 6 — h 

To" 6 ’ 


so 


X = f(6 - h). 


Figure 7.42 
































254 


APPLICATIONS OF INTEGRATION 


Since h increases from 0 to 6, we have 

F = J* wh • f(6 - h) dh = |w(3/i 2 - i/i 3 ) 
= 60w = l| tons = 3750 lb. 


PROBLEMS 

In Problems 1-4, it is assumed that the face of a dam adja¬ 
cent to the water is vertical and has the stated shape. In each 
case find the total force against the dam. 

1 A rectangle 150 ft wide and 12 ft high; water 8 ft deep. 

2 An isosceles trapezoid 200 ft wide at the top, 100 ft 
wide at the bottom, and 20 ft high; reservoir full of wa¬ 
ter. 

3 An isosceles triangle 60 ft wide at the top and 20 ft high 
in the center; reservoir full of water. 

4 An isosceles trapezoid 90 ft across the top, 60 ft across 
the bottom, and 20 ft high; water 12 ft deep. 

In Problems 5-8, it is assumed that a vertical gate in the face 
of a dam has the stated shape. In each case find the total force 
of the water against the gate. 

5 A triangle 4 ft wide at the top and 5 ft high, with up¬ 
per edge 1 ft below the water surface. 

6 An isosceles trapezoid 6 ft wide at the top, 8 ft wide at 
the bottom, and 6 ft high, with upper edge 4 ft below 
the water surface. 

7 A triangle 4 ft wide at the bottom and 4 ft high, with 
the upper vertex 2 ft below the water surface. 

8 A semicircle 4 ft in diameter with its diameter at the 
water surface. 


CHAPTER 7 REVIEW: CONCEPTS, METHODS 

Define, state, or think through the following. 

1 Area by vertical or horizontal strips. 

2 Volume by disks, moving cross sections, or washers. 

3 Volume by cylindrical shells. 

4 Arc length by ds = Vdx 2 + dy 2 . 

ADDITIONAL PROBLEMS FOR CHAPTER 7 

SECTION 7.2 

In Problems 1-13, sketch the curves and find the areas of the 
regions they bound. 

1 y = x 2 , y = x. 

2 jc = 3y + y 2 , jc + y + 3 = 0. 


9 A cylindrical barrel 4 ft high and 3 ft in diameter stands 
upright and is half filled with oil that weighs 50 lb/ft 3 . 
What is the total force of the oil against the lateral wall 
of the barrel? 

10 If the barrel in Problem 9 lies on its side, what is the 
force of the oil against one of the circular ends? 

11 A rectangular gate in a vertical dam is 5 ft wide and 6 
ft high. Find the force against this gate when the water 
level is 8 ft above its top. How much higher must the 
water rise to double the force? 

12 The vertical ends of a water trough are isosceles trian¬ 
gles with base 3 ft and height 2 ft. Find the force against 
one end when the trough is full of water. 

13 The end of a swimming pool is a rectangle inclined 45° 
to the horizontal. If the edge at the surface is 12 ft long 
and the submerged edge 10 ft long, find the force the 
water exerts against this rectangle. 

*14 A rectangular tank is filled with two nonmixing liquids 
whose densities are w\ and W 2 , where w\ < W 2 - In one 
side of the tank there is a square window 3V^2 ft on a 
side with one of its diagonals vertical and the upper ver¬ 
tex 1 ft below the surface, and with the other diagonal 
on the boundary between the liquids. Find the force the 
liquids exert against the window. 


5 Area of surface of revolution. 

6 Work by dW = F dx. 

7 Kinetic and potential energy. 

8 Pressure. 

9 Hydrostatic force by dF = p dA. 


3 y = x 4 — 2x 2 , y = 7x 2 . 

4 y 2 = x 3 , x = 4. 

5 y = x 2 - 2x - 3, y = 2x + 2. 

*6 y = — 7 ==, x + 3y - 5 = 0. 
V* + 2 
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7 y = 6x — jc 2 , y = jc. 

8 y 2 = 4jc, 2jc - y = 4. 

9 y 2 = 2jc, x - y = 4. 

10 y = 4 - jc 2 , y = 4 - 4 jc. 

11 y 2 = ~4x + 4, y 2 = -2* + 4. 

12 y = 9 - x 2 , y = x 2 . 

13 y = 9 - jc 2 , (x + 3) 2 = -4y. 

14 Find the complete area enclosed by y 2 = 9x 2 - jc 4 . 

15 Find the area bounded by y = jc 2 , y = 4, y = 2 — x. 

16 Find c > 0 so that the area bounded by y = x 2 — c and 
y = c - x 2 equals 9. 

*17 Find the area of the region in the second quadrant 
b ounded by the jc-axis and the parabolas y = jc 2 , y = 

Vjc+ 18. 

*18 Find the area between 4 y = x 3 and its tangent at jc = -2. 
SECTION 7.3 

19 Find the volume of the solid of revolution generated 
when the area bounded by the given curves is revolved 
about the jc-axis: 

(a) y = 2 - * 2 , y = 1; 

(b) y = 3 jc - jc 2 , y = jc; 

(c) y 2 = 4jc, y = jc; 

(d) y = x 2 + 3, y = 4; 

(e) Vjc + Vy = Va, x = 0, y = 0. 

20 Find the volume generated by revolving the area 
bounded by jc = y 2 and jc = 4 about 

(a) the jc-axis; (b) the y-axis; 

(c) the line y = 2; (d) the line jc = 4; 

(e) the line jc = - 1. 

21 Find the volume generated by revolving the area 
bounded by jc = 4y — y 2 and jc = 0 about 

(a) the y-axis; (b) the jc-axis. 

22 Each plane perpendicular to the jc-axis intersects a cer¬ 
tain solid in a circular cross section whose diameter lies 
in the jcy-plane and extends from y = jc 2 to y = 8 — jc 2 . 
The solid lies between the points of intersection of these 
curves. Find its volume. 

23 The base of a certain solid is the circle jc 2 + y 2 = a 2 . 
Each plane perpendicular to the jc-axis intersects the 
solid in a cross section that is an isosceles right trian¬ 
gle with one leg in the base of the solid. Find the vol¬ 
ume. 

24 The base of a certain solid is the area bounded by jc 2 = 
4 ay and y = a. Each cross section perpendicular to the 
y-axis is an equilateral triangle with one side lying in 
the base. Find the volume of the solid. 

25 A plane which is perpendicular to the jc-axis and con¬ 
tains a circle of radius jc 2 moves from jc = a to jc = b. 
If the center of the circle moves along a curve y = /(jc), 
find the volume of the solid the circle generates. 

*26 A solid is generated by revolving about the jc-axis the 
area bounded by a curve y = /(jc), the jc-axis, and the 


lines jc = a and x = b. Its volume is ir{b 3 — b 2 a) for all 
b > a. Find/(jc). 

27 Find the volume generated by revolving the area 
bounded by the curves jc 2 = 4 ay, y = a, x = 0 about 
(a) the y-axis; (b) the jc-axis; (c) the line y = a. 

28 Let R be a region of area A in a horizontal plane, and 
suppose that R is bounded by a closed curve C that does 
not intersect itself. Let P be a point whose height above 
this plane is h, and form a generalized “cone” by draw¬ 
ing segments connecting P to the points of C. Show that 
the volume of this cone is V = \Ah. Hint: If A(jc) is the 
area of the horizontal cross section at a height jc above 
the plane, observe that A(jc) = [(h - x) 2 /h 2 ]A. 

29 A line passes through a vertex of a square of side a and 
is perpendicular to the plane in which the square lies. 
As this vertex moves a distance h along the line, the 
square turns through a complete revolution with the line 
as the axis. Find the volume of the screw-shaped solid 
the square generates. What is the volume if the square 
turns through two complete revolutions while moving 
the same distance along the line? 

30 The square bounded by the axes and the lines jc = 1, 
y = 1 is cut into two parts by the curve y = x n , where 
n is a positive constant. Find the value of n for which 
these two parts generate equal volumes when revolved 
about the y-axis. 

*31 Two oblique circular cylinders of equal height h have a 
circle of radius a as a common lower base and their up¬ 
per bases are tangent to each other. Find the common 
volume. 

SECTION 7.4 

In Problems 32-37, sketch the region bounded by the given 
curves and use the shell method to find the volume of the solid 
generated by revolving this region about the given axis. 

32 y = ^Vx, jc = 8, y = 0; the y-axis. 

33 jc = y 2 — 4y, jc = 0; the jc-axis. 

34 y = 5 jc - jc 2 , y = 0; the y-axis. 

35 jc = y 3 + 1, y + 2 jc = 2, y = 1; the jc-axis. 

36 y = jc 2 , y = jc 3 ; the y-axis. 

37 2jc - y = 12, jc — 2y = 3, jc = 4; the y-axis. 

38 The region bounded by the given curves is revolved 
about the y-axis. Find the volume of the solid of revo¬ 
lution by using both the shell method and the washer 
method. 

(a) y = 4 jc - jc 2 , y = 0. 

(b) y = x 3 , x = 2, y = 0. 

39 The region in the first quadrant between y = 3 jc 2 and 
y = ^-jc 2 + 1 is revolved about the y-axis. Find the vol¬ 
ume generated in this way. 

40 The region bounded by y 2 = 4 jc and y = jc is revolved 
about the jc-axis. Find the volume generated in this way 
(a) by the shell method; (b) by the washer method. 
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*41 Consider the torus generated by revolving the circle 
(x — b) 2 + y 2 = a 2 (0 < a < b ) about the y-axis. Use 
the shell method to show that the volume of this torus 
equals the area of the circle times the distance traveled 
by its center during the revolution. Hint: At the right 
moment, change the variable of integration from x to 
z = x - b. 

42 Find the volume generated by revolving about the 
y-axis the region bounded by y = (x — 1 )(x — 2)(x —3) 
and the x-axis between x = 1 and x = 2. 


SECTION 7.5 

In Problems 43-49, find the length of the specified arc of the 
given curve. 

43 9y 2 = 4x 3 between (0, 0) and (3, 2V3). 


44 


y = i * 4 + i’ ls *- 2 - 


45 

46 

47 

48 

49 

50 


1 3 , 1 

y= 6* + 2? 


3. 


x = Jo y5 + ^^ y ^ 2 - 
1 , 2 

y = — x 3 H—. 2 < x < 4. 

7 24 x 


y = ^Vx(4x - 3), 1 < x < 9. 
y = ^(l + 4x 4/5 ) 372 , 1 <x<32. 


Let A and B be positive constants. If 0 < a < b, show 
that the problem of finding the length of the arc of the 
curve 


y = Ax 3 + — 

X 

for a ^ x < b leads to the integral 

f (3Ax 2 + Bx~ 2 ) dx 
Ja 

if AB = 4 

51 Let A and B be positive constants. If 0 < a < b, find a 
simple condition relating A and B that makes it possi¬ 
ble to calculate the length of the arc of the curve 

y = Ax 4 + 4 
x z 

between x = a and x = b by means of an integral not 
involving a square root. 

52 Solve Problem 51 for the curve 


y = Ax 5 + 4- 
x 


SECTION 7.6 

In Problems 53-55, find the area of the surface of revolution 
generated by revolving the given arc about the indicated axis. 
53 y = f(l + x 2 ) 3/2 , 0 < x < 3, the y-axis. 


54 y = fx 372 — yx 172 , 0 < x < 4, the y-axis. 

55 y = 2V15 - x, 0 < x < 15. the x-axis. 

56 The loop of 18y 2 = x(6 -x) 2 is revolved about the 
x-axis. Find the area of the surface generated in this 
way. 

57 Sketch the graph of 8 a 2 y 2 = x 2 (a 2 - x 2 ) and find the 
area of the surface generated when this curve is revolved 
about the x-axis. 

SECTION 7.7 

58 A 4-lb force will stretch a spring 6 in. How much work 
is done in stretching it 3 ft? 

59 A spring pulls back with a force of 7 lb when it is 
stretched from its natural length of 12 in to a length of 
13 in. How much work is required to compress it from 
a length of 11 in to a length of 7 in? 

60 Show that the work done in stretching a spring of nat¬ 
ural length L from a length a to a length b (L < a< b) 
is equal to the amount of the stretch (b - a) times the 
tension in the spring when its length is + b ). 

61 A bag of sand is lifted at the constant rate of 3 ft/s for 
10 seconds. At the beginning the bag contains 100 lb 
of sand, but the sand leaks out at the rate of 4.5 lb/s. 
How much work is done in lifting this bag? 

62 If a certain gas in a cylinder obeys an adiabatic gas law 
of the form pV 5/3 = c , and if it initially occupies 64 in 3 
at a pressure of 128 lb/in 2 , find the work it does against 
the piston in expanding to 8 times its initial volume. 

63 Find the work done in compressing 1024 ft 3 of air at a 
pressure of 27 lb/in 2 down to 243 ft 3 if the air obeys 
the adiabatic gas law pV [ 4 = c. 

64 Generalize Problem 63 by finding the work done in 
compressing air of initial volume V\ and pressure p\ 
down to a volume V 2 , assuming the adiabatic gas law 
pV L4 = c . 

*65 A conical buoy that weighs B pounds floats upright in 
water with its vertex a feet below the surface. If the top 
of the buoy is ja feet out of the water, how much work 
is done in pushing the buoy down until its top is just at 
the surface of the water? 

*66 A spherical buoy of radius a feet that weighs B pounds 
has exactly the weight-density w of water, so that it 
floats with its top just touching the surface. A crane on 
a dock lifts the buoy until it just clears the water. How 
much work is done? 

67 If two electrons are held fixed at the points x = 0 and 
x = — 1 on the x-axis, find the work done in moving a 
third electron along the x-axis from x = 4 to x = 1. 

68 Imagine a very deep mine shaft, of depth D = \R, ex¬ 
tending halfway down to the center of the earth (ignore 
all practical difficulties caused by the internal constitu¬ 
tion of the earth). A person whose weight is vv at the 
surface is lifted from the bottom of the shaft to the top. 
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Under the assumption that the weight remains constant 
during the journey, the work done would be wD. Show 
that the work done during this process is actually fwD, 
by taking into account the fact that the force of gravity 
below the surface of the earth is proportional to the dis¬ 
tance from its center. 

69 A tank has the shape of the paraboloid of revolution ob¬ 
tained by revolving y = x 1 (0 < x < V5) about the 
y-axis. If it is full of water, how much work is required 
to empty it by pumping all the water out over the edge? 

70 Let a cylindrical barrel of diameter 3 ft and height 5 ft 
be filled to a depth of 2 ft with water and then, above 
the water, with 2 additional ft of oil that weighs 50 lb/ft 3 . 
Find the work done in pumping the water and oil over 
the edge of the barrel. 

71 A hemispherical tank of radius 8 ft is full of water. If 
a hole is punched in the bottom, find the work done by 
gravity in emptying the tank. 

72 Two cables are hanging side by side from the ceiling of 
a gymnasium. The first is an elastic cable of length L 
and the second is inelastic and has length 2 L. As two 
gymnasts of equal weight climb down these cables, the 
weight of the first stretches his cable to a total length 
of 2 L. Show that when the two gymnasts climb back 
up to the ceiling, the first does only f of the work done 
by the second. 

SECTION 7.8 

73 Find the force due to water pressure against a rectan¬ 
gular floodgate 10 ft wide and 8 ft deep whose upper 
edge is at the surface of the water. 


74 Find the force against the lower half of the floodgate in 
Problem 73. 

In Problems 75 and 76, it is assumed that a vertical gate in 
the face of a dam has the stated shape. In each case find the 
total force on the gate. 

75 A triangle 6 ft wide and 4 ft high, with upper edge at 
the water surface. 

76 A triangle with base B and height H, with its vertex at 
the water surface. 

77 A rectangular canal lock is 30 ft wide. When the water 
is 20 ft deep, what is the force of the water against the 
lock? 

78 A rudder has the shape of an isosceles right triangle 
whose equal legs are 2 ft long. It is submerged verti¬ 
cally in water with one of the equal legs vertical and 
the other horizontal, and with the horizontal leg 3 ft be¬ 
low the surface and the opposite vertex 1 ft below the 
surface. Find the force of the water against one face of 
the rudder. 

79 A rectangular gate in a dam has width 10 ft and height 
8 ft. Find the force against the gate when the water level 
is 20 ft above its top. 

80 Assume that the gate in Problem 79 cannot withstand 
a force greater than 100 tons. How high must the wa¬ 
ter be above the top of the gate in order to break 
through? 

*81 The vertical end of a vat is a segment of a parabola 
opening upward which is 4 ft across the top and 8 ft 
deep. What is the force against this end when the vat is 
full of beer weighing 60 lb/ft 3 ? 


To understand how Archimedes discovered the volume of a sphere, it is neces¬ 
sary to know a little about the level of knowledge from which he started 

As he states in one of his treatises, it was Democritus two centuries earlier 
who discovered that the volume of a cone is one-third the volume of a cylinder 
with the same height and the same base. 1 ’ We will need this fact. 

Also, the Greeks knew a little analytic geometry, but without our notation. 
They were acquainted with the idea that a locus in a plane can be studied by con¬ 
sidering the distances from a moving point to two perpendicular lines, and if the 
sum of the squares of these distances is constant, they knew that the locus is a 
circle. In our notation, this condition amounts to the equation x 2 + y 2 — a 2 . 


APPENDIX: 
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f Democritus (about 460-370 B.C.) was the founder of the atomic theory of matter and the greatest 
philosopher of physical science among the ancient Greeks. He wrote at least 75 works on almost 
every conceivable subject, from physics and mathematics to logic, ethics, magnets, fevers, diets, agri¬ 
culture, law, “the sacred writings in Babylon,” “the right use of history,” and even the growth of an¬ 
imal horns, spiders and their webs, and the eyes of owls. Of these works there remain only a few 
hundred fragments quoted by later writers; for example: “It is better to examine one’s own faults than 
those of others,” and “I would rather discover one cause than gain the kingdom of Persia.” Plato hated 
him and was jealous of him, and Aristotle praised his genius—two weighty recommendations. 
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Further, Archimedes himself virtually created Greek physics. It is well known 
that he discovered the law of floating bodies. More important for our present pur¬ 
poses, he also discovered the principle of the lever and many facts about centers 
of gravity. 

We are now ready to follow Archimedes in his search for the volume of a 
sphere. He considered the sphere to be generated by revolving a circle about its 
diameter. In modern notation we start with the circle 

x 2 + y 2 = 2ax, (1) 

which has radius a and i s tangent to the y-axis at the origin. This circle is shown 
on the left in Fig. 7.43, which is almost identical with Archimedes’ original fig¬ 
ure. Equation (1) contains the term y 2 , and since Try 2 is the area of the variable 
cross section of the sphere jc units to the right of the origin, it is natural to mul¬ 
tiply through by tt and write (1) in the form 

7 tx 2 + Try 2 = 7t2 ax. (2) 

This leads us to interpret 77X 2 as the area of the variable cross section of the cone 
generated by revolving about the x-axis the right triangle under the line y = x be¬ 
tween x = 0 and x = 2a. This in turn suggests that we seek a similar interpreta¬ 
tion for the term ir2ax on the right side of (2). If we persist in this search, we 
might perhaps think of multiplying by 2 a and thus rewriting (2) as 

2a(Trx 2 + 77y 2 ) = xir(2a) 2 . (3) 

The motivation for this change clearly lies in the fact that 7r(2a) 2 can now be in¬ 
terpreted as the area of the cross section of the cylinder with the same height and 
base as the cone. 

We therefore have on the left in Fig. 7.43 three circular disks viewed edge on, 
of areas Try 2 , ttx 2 , and Tr(2a) 2 , which are the intersections of a single plane with 
three solids of revolution. This plane is perpendicular to the x-axis at a distance 
x units to the right of the origin, and the solids are the sphere, the cone, and the 
cylinder, as indicated in the figure. 

On the left side of equation (3) the sum of the first two areas is multiplied by 
2a, and on the right side the third area is multiplied by x. This observation led 



Figure 7.43 Archimedes’ balancing 
argument. 
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Archimedes to the following great idea, as shown on the right in the figure. He 
left the disk with radius 2 a where it is, in a vertical position x units to the right 
of the origin; and he moved the disks with radii y and x to a point 2 a units to the 
left of the origin, where he hung them in a horizontal position with their centers 
(centers of gravity) under this point, suspended by a weightless string. The pur¬ 
pose of this maneuver can be understood only if we think of the x-axis as a lever 
and the origin as its fulcrum or balancing point. It can now be seen that equa¬ 
tion (3) deals with moments. (A moment is the product of the suspended weight 
and the length of the lever arm.) From this point of view, equation (3) states that 
the combined moments of the two disks on the left equals the moment of the sin¬ 
gle disk on the right, and so, by Archimedes’ own principle of the lever, this lever 
is in equilibrium. 

We now carry out the final step of the reasoning. As x increases from 0 to 2a, 
the three cross sections sweep through their respective solids and fill these solids. 
Since the three cross sections are in equilibrium throughout this process, the solids 
themselves are also in equilibrium. Let V denote the volume of the sphere, which 
was unknown until Archimedes finished this calculation. If we use Democritus’ 
formula for the volume of the cone, and also the volume of the cylinder and the 
obvious location of its center of gravity, then the equilibrium of the solids in the 
positions shown in the figure yields the equation 

2a[jTT(2a) 2 (2a) + V] = air(2a)H2a). (4) 

It is now easy to solve (4) for V and obtain 

V = JTTCI 3 . 

The ideas discussed here were created by a man who has been described— 
with good reason—as “the greatest genius of the ancient world.” Indeed, nowhere 
can one find a more striking display of intellectual power combined with imag¬ 
ination of the highest order. Archimedes himself was so pleased with his dis¬ 
covery that he asked for a figure to be cut on his tombstone showing a sphere 
inscribed in a cylinder. And it was done.* 

*The Roman orator Cicero wrote the following about two centuries after the death of Archimedes: 

I shall call up from the dust on which he drew his figures [the dust on the ground was the black¬ 
board of the ancient mathematicians] an obscure, insignificant citizen of Syracuse, Archimedes. 
When I was quaestor I sought out his grave, which was unknown to the Syracusans (as they to¬ 
tally denied its existence), and found it enclosed all round and covered with brambles and thick¬ 
ets; for I remembered certain doggerel lines inscribed, as I had heard, upon his tomb, which stated 
that a sphere along with a cylinder had been set up on top of his grave. Accordingly, after taking 
a good look all round (for there are a great quantity of graves at the Agrigentine Gate), I noticed 
a small column rising a little above the bushes, on which there was the figure of a sphere and a 
cylinder. And so I at once said to the Syracusans (I had their leading men with me) that I believed 
it was the very thing of which I was in search. Slaves were sent in with sickles who cleared the 
ground of obstacles, and when a passage to the place was opened we approached the pedestal 
fronting us; the epigram was traceable with about half the lines legible, as the latter portion was 
worn away. [Cicero’s Tusculan Disputations (Loeb Classical Library, p. 491).] 

The present writer visited Syracuse in 1987, and if there had been any hope of finding this tomb — 
more than 2000 years later— he would have stayed until he found it. 

Archimedes (287-212 B.C.) died in the conquest of Syracuse by the Romans during the Second 
Punic (Carthaginian) War. In the general confusion following the fall of the city, he was found con¬ 
centrating on some diagrams he had drawn in the sand, and was killed by a marauding soldier who 
did not know who he was. In one version of the story he said to the intruder, who came too close, 
“Do not disturb my circles,” whereupon the enraged soldier ran a sword through his body. 
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8.1 

INTRODUCTION 


EXPONENTIAL 
AND LOGARITHM 

FUNCTIONS 


Our main purpose in this chapter is to learn how to work successfully with the 
indefinite integral 



As we shall see, this purpose compels us to study the special exponential and 
logarithm functions 

y = e x and y = log e x. (2) 

The letter e used in these functions denotes the most important special number 
in mathematics after i r. In decimal form it is an infinite nonrepeating decimal 
that is known to hundreds of thousands of decimal places; the first few digits are 

2.71828, . . . 

The ultimate reason for our interest in these matters is that the integral (1) and 
the functions (2) arise in a great variety of problems involving population growth, 
radioactive decay, chemical reaction rates, electric circuits, and many other phe¬ 
nomena in physics, chemistry, biology, geology, and virtually every science that 
uses quantitative methods, including meteorology, oceanography, and even ar¬ 
chaeology. This integral and these functions are also indispensable in many 
branches of pure mathematics. 

In order to reach a clear understanding of why the number e and the functions 
(2) matter so much, it is desirable to broaden the context a bit and consider the 
more general exponential and logarithm functions 

y = a x and y = log a Jt, 

where a is a positive constant 1. This is where we begin, and by adopting this 
approach we hope to make it perfectly clear that we choose a equal to e for com¬ 
pelling reasons of convenience and simplicity. 
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Students who have managed to get this far in this book certainly have a work¬ 
ing grasp of exponents, and perhaps also of logarithms as defined in terms of ex¬ 
ponents. Nevertheless, we briefly review the main definitions and facts as they 
appear in the traditional approach. 

We consider expressions of the form a x where a > 0 and x is any real num¬ 
ber. It is easy to explain exactly what a x means if x is an integer n , and we as¬ 
sume students understand this explanation. The following is a brief reminder: 


8.2 

REVIEW OF 
EXPONENTS AND 
LOGARITHMS 


If n > 0, then a n = a • a • • • a (n factors), 


a* = 1, 


a m a n = a 


= a m ~\ 


a " 

(. a m ) n = a" 


e.g., 

e.g., 

e.g., 


a 2 a 3 = (a • a)(a • a • a) = a • a • a • a • a = a 5 ; 

a 5 a • a • a • a • a « • « • a a • a ~ 

—r =---;— = a • a = a z ; 

a J a - a - a a • a • a 1 

(a 3 ) 2 = (a - a - a)(a • a - a) = a • a • a • a - a - a = a 6 . 


Next, in Section 3.5 we summarized the meaning of fractional exponents, and 
we repeat the essence of this summary here. If r — plq is a fraction in lowest 
terms with q > 0, then by definition 

a r = a p!n = ( 1 ) 

where S/a is the unique positive number whose gth power is a. 

If the exponent x is an irrational number, then difficulties appear that students 
might not notice if we didn’t mention them. For instance, what is meant by the 
expression 2 V ^7 Clearly, it doesn’t make sense to multiply 2 by itself V2 times. 
Also, since V2 can’t be written as a fraction, definition (1) is useless. Is 2^ re¬ 
ally a definite number with a specific value? The answer is Yes, but this is not 
at all obvious. A natural way to proceed is to use the fact that any irrational num¬ 
ber can be approximated as closely as we please by rational numbers. We can 
therefore define a x by 

a x — lim o r , 

r— 

where r approaches jc through rational values. This way of defining a x when jc is 
irrational is satisfactory from the logical point of view; however, it is a long and 
tedious chore to prove rigorously that everything works out as we expect and that 
the familiar laws of exponents remain valid. We skip over these boring details 
and merely state the final result, that the laws of exponents continue to hold in 
the following form: 

a x 'a x i = a x > + \ — = a x >~\ (a x 'Y* = a x '\ 

a x 2 

where x\ and x 2 are arbitrary real numbers. 

The next natural step in this development is to examine the properties of the 
general exponential function y = a x . Here again we simply state the important 
facts without making any attempt to discuss the logical details of how these facts 
are established. As above, we assume that a is a positive constant, and also that 
a ¥= 1. The case a = 1 is of no interest because \ x — 1 for all x. Let us suppose 
first that a > 1. Then y = a x is a continuous function of x; it is increasing; its 
values are all positive; and it has the further obvious properties that 

lim a x = 0 and lim a x = <». 

X—^ — oc x— >00 


(2) 
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To sketch the graph, we plot a few points corresponding to several integral val¬ 
ues of x, both positive and negative, and then connect these points by a smooth 
curve, as shown in Fig. 8.1. If a < 1, then y = a x is a decreasing function and 
its graph has the shape shown in Fig. 8.2. 

When this much information about exponents is known or assumed, it is very 
easy to define logarithms and obtain some of their properties. On the most prim¬ 
itive level, a logarithm is an exponent. Thus, the fact that 100 = 10 2 says that 2 
is the logarithm of 100 to the base 10 (written 2 = logi 0 100); and 4 = 64 1/3 
says that y is the logarithm of 4 to the base 64 (y = log6 4 4). 

More generally, the properties of exponents discussed above show clearly that 
if a is a positive constant =£ 1, then to each positive x there corresponds a unique 
y such that x = a y . This y is written in the form y = \og a x, and is called the log¬ 
arithm of x to the base a. Accordingly, 

y = log fl x has the same meaning as x = a y , (3) 

in the sense that each equation expresses the same relation between x and y, with 
the first written in a form solved for y and the second in a form solved for x. We 
can state this somewhat differently by saying that the symbol “log a ” is created 
for the specific purpose of enabling us to solve x = a y for y in terms of x. 

The basic properties of logarithms are direct translations of corresponding 
properties of exponents. Thus, if x\ = a y > and x 2 — a y \ then X\X 2 = a y \a y ^ = 
a yi+y 2 . But y\ = loga x\ and y 2 — \og a x 2 , so we have 


l0 g a *\X 2 = l0g fl X\ + l0g a X 2 . 


Similarly, 


and 


log a —= log a x\ - log fl x 2 
x 2 


logo x b = b log* x, 


where b is any real number. Further, (3) tells us that 

a' og ° x = x and log a a x = x. 

We note also that the particular facts 

log a 1 = 0 and log a a = 1 

are equivalent to 1 = a 0 and a = aK 
In studying the logarithm function 

y = log fl x, (4) 

we consciously think of x and y as variables instead of mere numbers. Our start¬ 
ing point is the fact that (4) is equivalent to x — a y . It is clear from this that x 
must be positive in order for y to exist, so (4) is defined only for x > 0. The graph 
of (4) is easy to obtain from the graph of x = a y by interchanging the axes, as 
we show in Fig. 8.3 for the case a > 1.* In this case y = \og a x is evidently an 


*By “interchanging the axes” we mean the following: on the left in Fig. 8.3, imagine the two axes 
and the curve to be three pieces of stiff wire glued together into a rigid frame; lift this frame off the 
page and flip it over in space so that the x- and y-axes are revolved into their normal positions; and 
finally, return the frame to the page as it appears on the right in the figure. 
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Figure 8.3 


increasing continuous function of x. The features of this function that correspond 
to the properties (2) are 

lim log* x = -°o and lim log* x = <*>. 

jc -* 0 + x —>°° 

The most convenient logarithm for actual numerical calculations is the loga¬ 
rithm to the base 10, the so-called common logarithm. Common logarithms were 
once widely used by engineers and scientists and students in high school 
trigonometry courses, but such uses have virtually disappeared in these days of 
calculators and computers. However, modern technological changes in the way 
people do calculations have had no influence whatever on the importance of the 
logarithm as a function ; it remains indispensable in the theoretical parts of math¬ 
ematics and its applications, and these theoretical uses are what concern us in 
this chapter. 


PROBLEMS 

1 Express in terms of logarithms: 


(c) 81 0 ' 5 = 9; 

2 Express in terms c 
(a) log jo 10 = 1; 
(c) log 5 15 = -2; 

3 Evaluate: 

(a) logio 10,000; 
(c) logio 0.0001; 

4 Solve for x: 

(a) log 4 x = 3.5; 
(c) log 3 x = 5; 


(d) 32 4/5 = 16. 
exponents: 

(b) log 2 8 = 3; 
(d) lo g 6 216 = 3. 

(b) log 2 64; 

(d) logs 4. 

(b) log 8 x = 

(d) log 32 x = 0.6. 


5 Find the base a: 

(a) log* 4 = 0.4; (b) log* 8 = -f; 

(c) \og a 36 = 2; _(d) log* 7 = f 

6 If y = log* (x + Vx 2 - 1), show that x = \{a y + a~ y ). 

— I 7 Show that log* (x + Vx 2 - 1) = -log* (x - Vx 2 - 1). 

8 The magnitude M of an earthquake on the Richter scale 
is a number that ranges from M - 0 for the smallest 
earthquake that can be detected by instruments to 
M = 8.9 for the greatest known earthquake. M is given 
by the empirical formula 

M= f logio i? 
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where E is the energy released by the earthquake in kilo- 
watthours and £ 0 = 7 X 10“ 3 ,* 

(a) Suppose the magnitudes o f two earthquakes differ by 
1 on the Richter scale. Show that the ratio of the en¬ 
ergy of the larger earthquake to that of the smaller is 
10 3/2 = 31.62. 

(b) How much energy is released by an earthquake of 
magnitude 6? 

(c) A city whose population is 300,000 uses about 
3 X 10 5 kilowatthours (kWh) of electric energy 
every day. If the energy of an earthquake could some¬ 
how be transformed into electric energy, how many 
days’ supply for this city would be provided by the 
earthquake in part (b)? 

(d) The great Alaskan earthquake of 1964 had a Richter 
magnitude of 8.4. Answer the question in part (c) for 
this earthquake. 

9 In chemistry the pH of a solution is defined by the for¬ 
mula pH = —logio[H + ], where [H + ] denotes the hydro¬ 
gen ion concentration as measured in moles per liter. 1 " 

"Charles Richter (1900-1985) was a professor of seismology at Cal¬ 
Tech. He invented his magnitude measure in 1935. The following are 
Richter magnitudes, and numbers of people killed, in other memo¬ 
rable earthquakes of recent history: 

8.9—Japan: Mar. 2, 1933, 2990 dead; 

8.3 — San Francisco, Calif.: Apr. 18, 1906, 2000 dead; 

8.2—Tangshan, China: July 28, 1976, 242,000-800,000 dead; 

8.1— Mexico City: Sept. 19, 1985, 9500 dead; 

7.7—Peru: May 31, 1970, 66,794 dead; 

7.2— Italy: Nov. 23, 1980, 3000 dead; 

6.9 — Armenia: Dec. 7, 1988, 25,000 dead. 

f The symbol “pH” is an abbreviation of the French expression puis¬ 
sance d’Hydrogene (power of hydrogen). 


(One mole—or gram molecular weight—of a substance 
consists of 6 X 10 23 molecules of the substance.) The 
value of [H + ] for pure water is found by experiment to 
be 1.00 X 10 -7 . 

(a) What is the pH of pure water? 

(b) A solution is called acidic or basic (alkaline) ac¬ 
cording as its value of [H + ] is greater or less than 
that for pure water. What pH’s characterize acidic 
and basic solutions? 

10 Show that the number log 3 2 is irrational. Hint: Assume 
the contrary, that log 3 2 = p/q where p and q are posi¬ 
tive integers, and express this in terms of exponents. Can 
an integral power of 3 equal an integral power of 2? 

11 Find the flaw in the following “proof” that { < mul¬ 
tiply both sides of the inequality 1 < 2 by log y to get, 
successively, 

1 ■ log 7 < 2 • log i 

log y < log ({) 2 , 

log y < log i 

i < 1 

2 ^ 4 - 

12 A prime number is an integer p > 1 that has no factors 
except itself and 1. The first few primes are p — 2, 3, 5, 
7, 11,.... In 1992 the largest known prime (discovered 
by people who like to play with supercomputers) was 

2756,839 _ \ 

(a) When written out in decimal form, how many digits 
will this number have? Hint: Solve the equation 
10 * = 2 756 ’ 839 . 

(b) How many pages of this book will be needed to print 
this number? (One page holds about 4600 digits.) 


8.3 

THE NUMBER e AND 
THE FUNCTION y = e x 


The number e is often defined by the limit 

e = lim [\ + . (1) 

n—>°° \ n J 

This definition has the advantage of brevity but the serious disadvantage of shed¬ 
ding no light whatever on the significance of this crucial number. We prefer to 
define e differently, in a manner that reveals as clearly as possible why this num¬ 
ber is so important. We then obtain (1) later, as merely one among many explicit 
formulas for e that can be used in a variety of ways. 

Our aim in this section is to study a function y — f(x) that is unchanged by 
differentiation: 


t/ ( x)=/(x) - 


( 2 ) 
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It is far from obvious that any such function exists [we don’t count the trivial 
case/O) = 0]. As we shall see, the desired function turns out to be one of the 
exponential functions y = a x for a > 1. The central meaning of the number e can 
now be stated as follows: It is the specific value of the base a that causes the 
function/(x) = a x to have the property (2). In this way we understand what pur¬ 
pose e serves. However, we must still give a satisfactory definition and show as 
simply as possible that this definition accomplishes the stated purpose. 

Let us calculate the derivative of f(x) = a x and see what happens. As usual 
when differentiating a new type of function, we go back to the definition of the 
derivative, 


—/W = lim 
dx Ax ->0 


fix + Ax) -fix) 
\x 


It will be convenient here to denote the increment by the single letter h instead 
of the familiar Ax (Fig. 8.4): 

_d_ 
dx ' 


= lim 

a x-rn _ a x 

= lim 

h-> 0 

h 

h —>0 


( a h — 1 

\ 

= lim 

I aX i 

= a x 

/i—>0 

V h 

) 


n h — 


h —>0 


(3) 


p h — 


e is the number for which lim 
/«-» o 


= 1. 



y = a x 

Slope of 
chord 
a h - 1 

h 

/ 1 

/ i 

/ \ i 

(0, 1) 

' 1 

^. 4 4 - . w. 

cr 

h x x + h 


As Fig. 8.4 shows, the quantity in parentheses on the right side of (3) is the slope 
of the tangent line to the curve y = a x at the point (0, 1). If this slope equals 1, 
then the right side of (3) reduces to a x and this particular function a x has the 
property (2). This brings us to our definition: e is the specific value of the base 
a that produces this result, that is, 


Figure 8.4 


(4) 


We can obtain considerable insight into the nature of the number e by sketch¬ 
ing y = a x for the cases a = 1.5, a = 2, a = 3, and a = 10, as shown in Fig. 8.5. 
These curves tell us that as the base a increases continuously from numbers close 
to 1 to larger numbers, the slope of the tangent to y = a x at the point (0, 1) in¬ 
creases continuously from values close to 0 to larger values, and therefore this 
slope is exactly equal to 1 for some intermediate value of a. This intermediate 
value is e\ and as we hope students will agree, it is geometrically clear from these 
remarks that e exists. Next, we plot the points on the first three of these curves 
corresponding to x = 1 in order to stress the fact that the slopes of the chords 
joining these points to (0, 1) are y, 1, and 2. This is conclusive geometric evi¬ 
dence that the slope of the tangent at (0, 1) is <1 for the cases a = 1.5 and a = 
2, and plausible evidence that this slope is >1 for the case a — 3; and therefore 
e is certainly >2 and probably <3. 

In Fig. 8.6 we show the graph of y = e x with emphasis placed on its defining 
characteristic: It is the single member of the family of exponential functions y = 
a x (a > 1) whose tangent line at the point (0, 1) has slope 1. The function y = 
e x is often called the exponential function, to distinguish it from its compara¬ 
tively unimportant relatives. 

We can investigate the number e more closely by noting that (4) tells us that 

e h - 1 

—-— is approximately equal to 1, 
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and that this approximation gets better and better as h approaches 0. Thus, by 
simple manipulations we obtain 


e h - j 


h 


= 1 , 


e h -\ =h, e h =l +h, e = (1 + h) ]/ \ 


and finally, 

e = lim (1 + h) VH . (5) 

/i—>o 

In words, this says that e is the limit of 1 plus a small number, raised to the power 
of the reciprocal of the small number, as that small number approaches 0. If we 
write h — Mn where n is understood to be a positive integer that —> as h —» 0, 
then (5) yields 

e = Lim f 1 + —V, 

y n ) 

which is (1). This formula enables us to compute rough approximations to e fairly 
easily, as the following table shows: 



1 2 

2 | = 2\ = 2.25 

3 £ = = 2.370 

4 625 = 2 — = 2 441 

H 256 z 256 


However, this is a slow process and the value of e has been computed to great 
accuracy by other and more efficient methods. To 15 decimal places it is 

e = 2.718281828459045. . . .* 


The number e, like the number 77, is woven inseparably into the fabric of both 
nature and mathematics. Many remarkable properties of e have been discovered 
over the centuries. For example, e is irrational; indeed, it is not even a root of 
any polynomial equation with rational coefficients. 

However, we must not forget our original purpose in this section, which was 
to study a function that is unchanged by differentiation. We have now made a 
good start on this task, in the sense that we have explained the meaning of the 
following statement and established its validity: 


d_ 

dx 


e x 


= e x . 


( 6 ) 


An equivalent statement is that y = e x satisfies the differential equation 


dy_ 

dx 


= y- 


Every function y = ce x also satisfies this equation, because 


*Many people remember this much of e by grouping the digits this way, 

2.7 1828 1828 45 90 45, 

in order to visualize the repeated 1828 followed by 45, then twice 45, then 45 again. 
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y- = fr ( ce x ) = c ~ e 
dx ax ax 


X — = 


Further, we assert that these are the only functions that are unchanged by differ¬ 
entiation. To prove this, suppose that y =/(;c) is any function with this property. 
Then by the quotient rule, 


_d_ 

dx 


fix) 


e x f\x) ~f(x)e x e x f(x) ~ f(x)e x 




= 0 . 


This implies thatf(x)/e x = c for some constant c, so f(x) = ce x , as stated. 
By the chain rule, (6) generalizes immediately to 


d_ 

dx 



(7) 


where u = u{x) is understood to be any differentiable function of x. 


Example 1 


In view of (7), the following derivatives are obvious: 


— e *x = 4 ^, 

dx 


—— e xl — 2xe * 2 , 
dx 




e 


l/x 


If we write (7) in differential form, as d(e u ) = e u du, then by reading this back¬ 
wards we obtain the integration formula 

J e u du = e u + c. (8) 


Example 2 To inte g rate I e5x dx, we write 

J e 5x dx = j J e 5x d{5x) = \e 5x + c, 

where 5x plays the role of u in formula (8). This problem is so simple that there 
is no need to make explicit use of the method of substitution. It suffices to keep 
in mind what (8) says and make minor adjustments accordingly, as indicated. 


Example 3 integral 

f 9xe X 3j2+2 dx 

' V3jc 2 + 2 

is more complicated. Our only hope is that (8) will see us through, so we write 
u = V3jc 2 + 2 = (3* 2 + 2) 1/2 
and 

3x dx 

du = 2O* 2 + 2) -1/2 6x dx = / 

V3x 2 + 2 

This substitution (or change of variable) enables us to express the given integral 
in a much simpler form, and thereby to finish the calculation, 

f = 3 [ e*du = 3e» + c = 3e^ + c. 

J V3?T2 1 
















268 


EXPONENTIAL AND LOGARITHM FUNCTIONS 


Students should observe that the complicated appearance of the given integral is 
only a disguise concealing the relatively simple form displayed in (8). Learning 
the art of integration is mostly learning to see the underlying form through the 
disguise. 


Example 4 Continuously compounded interest. If P dollars is deposited in a bank 
that pays an interest rate of 8 percent per year, compounded semiannually, then 
after t years the accumulated amount is 

A = P(\ + 0.04) 2 '. 

More generally, if the interest rate is 100* percent (* = 0.08 for 8 percent), and 
if this interest is compounded n times a year, then after t years the accumulated 
amount is 


A 




nr 


If n is now increased indefinitely, so that the interest is compounded more and 
more frequently, then we approach the limiting case of continuously compounded 
interest. To find the formula for A under these circumstances, we observe that 
(5) yields 


w 

17 jc 


1 + - 

/ 

\ W 


so 


A = Pe xt . (9) 

Ordinary compound interest produces growth in spurts or jumps at the end of 
each interest period. In contrast to this, we see from (9) that continuously com¬ 
pounded interest produces steady continuous growth of a type called exponen¬ 
tial growth. In Sections 8.5 and 8.6 we discuss many additional examples of ex¬ 
ponential growth as it occurs in the natural sciences. 


Remark 1 The function e x grows very rapidly as * increases; in fact, it grows 
faster than x p for any fixed positive exponent p , no matter how large, in the sense 
that 


e x 

lim —— = oo. 

A'—>oo X P 

An outline of a proof for the case in which p is a positive integer n is given in 
Additional Problems 18 to 20. 

Remark 2 We have deduced the existence of the limits in (1) and (5) from 
the definition of e given in (4). However, this definition itself is highly geo¬ 
metric in nature, and some mathematicians might be inclined to dismiss 
our entire approach to these ideas as “reasoning by wishful thinking.” To 
mollify such critics, and also for the occasional students who might be inter¬ 
ested, we provide an independent proof of the existence of these limits in 
Appendix A.8. 
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PROBLEMS 


In Problems 1-10, find the derivative dy/dx of the given func¬ 
tion. 

y = l(e x - e~ x ). 
y = x 2 e~ x2 . 
y = x e + e x . 


1 

y = i(e x + e~ x ). 

2 

3 

y = x 2 e x . 

4 

5 

y = e e \ 

6 


ax — 1 


7 

y = 2 eax - 

cr 

8 

9 

y = (2jc 2 — 2jc + l)e 2x . 

10 

Evaluate the integrals in Problem 

11 

fe 3x dx. 

12 

13 

j e^ l/5)x dx. 

14 

15 

j 6x 2 e x3 dx. 

16 

17 

Sketch the graph of each of 


fxe x2 dx. 
f 3 dx 


e^ x dx 


Vx 


find its maximum and minimum points and points of in¬ 
flection: 

(a) y = e~ x2 \ (b) y = xe x,2> . 

18 Find the base of the largest rectangle that rests on the 
;t-axis and has its upper vertices on the curve y = e~ xl . 

19 Sketch the curve y = + e~ x ) and find its length from 

jc = 0 to jc = b (b > 0). 

20 The arc in Problem 19 is revolved about the jc-axis. Find 
the area of the surface of revolution generated in this way. 

21 If a particle moves on the jc-axis in such a way that its 
position * at time t is given by jc = Ae kt + Be~ kt , where 
A, B, and k are constants, show that the particle is re¬ 
pelled from the origin with a force proportional to its dis¬ 
tance from the origin. Hint: Use Newton’s second law of 
motion, F = ma. 

22 If the tangent to y = e x at the point jc = jco intersects the 
;c-axis at jc = jci, show that xo — x\ = 1. 

23 Graph y = e~ x , find the area under this curve from jc = 
0 to jc = b (b > 0), and find the limit approached by this 
area as b —» «>, 


24 

25 


26 


27 


28 


29 


Verify that y = e~ x and y = e 2x are both solutions of the 
differential equation y" — y* — 2y = 0. 

Evaluate the following limits: 

(a) lim (1 + 4-) 2 "; 

„_>oo y 2/2 / 

/ 1 \3rt+l 

( b) i™ (' + 3^n) ; 

(c) lim (1 + -LY"; 

rt-->oo y yf J 

(d) lim (l + Tf". 

(e) lim f 1 + TV 

\ 2/2 / 

Use the argument in Example 4 to obtain the formula 

e* = lim (1 + -V. 

n->oo \ YIJ 

Use the Intermediate Value Theorem (Section 2.6) to 
show that the equation jc + e x = 0 has a root. Why does 
this equation have only one root? 

Use Newton’s method (Section 4.6) to calculate the root 
of the equation in the preceding problem correct to six 
decimal places. 

When Benjamin Franklin died in 1790, it was found that 
he had directed in his will for the sum of $10,000 to be 
given jointly to the cities of Philadelphia and Boston, to 
be invested at compound interest for the benefit of the 
people of those cities. In a recent TV broadcast by CNN 
it was stated that in 1990, 200 years later, the accumu¬ 
lated amount would have been 90 billion dollars 
($90,000,000,000) if invested at a “reasonable” rate of 
interest. What rate of interest would have been required 
to achieve this astounding result? 


Logarithms to the base 10—common logarithms—are often taught in high 
school, starting with the following familiar definition: For any positive number 
jc, logio jc is that number y such that jc = 10^. In just the same way, for any pos¬ 
itive number jc, log*, jc is defined to be that number y such that jc = e y . This is il¬ 
lustrated on the left in Fig. 8.7. 

The number log*, jc is called the natural logarithm of jc, for reasons that will 
become clear in Remark 2. In deference to standard practice at this level, we de¬ 
note this number by the simpler notation In jc, pronounced “ell enn ex.” Thus, 


8.4 

THE NATURAL 
LOGARITHM FUNCTION 
y = In x EULER 


y = In jc has the same meaning as jc = e y , 


in the sense that we are dealing here with a single equation, first written in a 
form solved fory and then written in a form solved for jc. The graph of y = In jc 
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Figure 8.7 



is obtained by simply turning over the graph of x = e y so as to interchange the 
positions of the axes (Fig. 8.7, right). Just as in Section 8.2, the natural logarithm 
function y = In x is defined only for positive values of x and has the following 
familiar properties: 

In X\X 2 = In x\ + In X 2 and In — = In x\ — In X 2 \ 

*2 

In x b = b In x; 

e \nx = x an d In e x = x; 
lim In x = — °o and lim In x = °o 

X— > 0 + x—too 


Also, In 1 = 0 and In e = 1. 

We can compute the derivative dy/dx of the function y = \n x very easily, by 
differentiating x = e y implicitly with respect to x: 


1 



dy_ = J_ = I 
dx e y x 


This yields the formula 



and we immediately have the chain rule extension 


d_ 

dx 


. 1 du 

In u = — —, 
u dx 


( 1 ) 


where u is understood to be any differentiable function of x. 


Example 1 


As direct applications of (1) we have 
1 d{ 3x+ 1) 

-mOx+D-^r^—= 


d . t 

^ ln( 


1 - x 2 ) = ■ 


1 d( 1-x 2 ) 
- x 2 dx 


3 

3x + 1 ’ 
—2x 
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d_ / 3x \ 1 (2x + 1) • 3 - 3x • 2 

dx n \2x + 1 ) 3x/(2x + 1) (2x + I) 2 

= _ 1 _ 

~~ x(2x + 1)' 

We point out that the last calculation can be simplified by first writing 
In [3 jc/(2jc + 1)] = In 3 + In jc — In (2x + 1), so that 

d_ / 3x \ = J_ _ 2 = 1 

dx n \ 2x + \ ) x 2x + 1 x(2x + 1)’ 


The differential version of (1) is d (In u) = du/u, which leads at once to the 
main formula of this chapter, 

f — = In u + c. (2) 

J u 

It is understood in (2) that u is positive, because only in this case does In u have 
a meaning. However, it is easy to see that the integrand can always be written 
with a positive denominator, by juggling the signs. Thus, if u < 0 we can write 

J u J —u 

Many writers cover all cases by writing (2) in the form 

/ ^ = In |n| + c. 

However, we shall not do this, for the reason that most of the applications re¬ 
quire a quick transition from logs to exponentials, and the presence of the ab¬ 
solute value sign interferes with the smooth operation of this process. We prefer 
to use (2) as it is, and to remember as we do this that u must be positive. In sit¬ 
uations where u is negative, we easily make the minor adjustments indicated in 

(3). 

Students will recall that the fundamental integration formula 

f u n+1 

\ u n du = —— + c, n # — 1, 

j n + 1 

failed to cover one exceptional case, namely, n = — 1. Formula (2) now fills this 
gap, since it tells us that 

J u~ ] du = - In w + c. 


Example 2 The following applications of (2) are easy to carry out by inspec¬ 
tion: 
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In more complicated problems it is desirable to make an explicit substitution or 
change of variable, in order to diminish the likelihood of accidental error. 


In Section 5.4 we discussed the method of separation of variables for solving 
differential equations. The equation 



(4) 


is one of the simplest and most important to which this method can be applied. 
We give the details of this procedure here because the same ideas will be used 
over and over again in the next two sections, and the sooner students become 
thoroughly familiar with them, the better: 


and finally, 


dy_ _ 


k dx, 


J — = J k dx. In y = kx + ci, 


y 

y — e 


lex. 

y = ce , 


where c is simply a more convenient notation for the constant e C] . From our point 
of view, the exponential and logarithm functions find their main reason for be¬ 
ing in the fact that they enable us to solve the differential equation (4) in this 
smooth and straightforward manner. It is also clear from the calculations just 
given that these functions go together like the two sides of a coin: you can’t spend 
one side without also spending the other. 

The next two sections are filled with many far-reaching applications of equa¬ 
tion (4) to various fields of science. We hope students will agree that these ap¬ 
plications fully justify the attention we have given to this differential equation 
and to the functions that are necessary for solving it. 


Remark 1 We know that In x — > oo as x — > «>. This property of the logarithm is 
illustrated on the right in Fig. 8.7. However, the graph of y = In x rises very 
slowly, since it is the mirror image of the rapidly rising graph of x = e y . Just 
how slowly y = In x increases can be understood by noticing that it doesn’t reach 
the level y = 10 until x = e w = 22,000. The fact that In x grows more slowly 
than x can be expressed by writing 

lim — = 0. (5) 

X 


We might try to estimate more accurately how slowly In x grows by comparing 
it with an even smaller function than x, say Vx or ^x. The remarkable fact is 
that In x grows more slowly than any positive power of x: 


,n * n 

lim —— = 0, 

x p 


(6) 


where p is any positive constant. Proofs of (5) and (6) are indicated in Problem 
13 and Additional Problem 26. 


Remark 2 We mention here another way of seeing—with additional clarity— 
how the number e arises in calculus. The idea is to calculate the derivative of 
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loga x as if we were doing this for the very first time in history, in an exploratory 
spirit, without any preconception of what the base a “ought” to be. We begin by 
applying the definition of the derivative, 


d_ 

dx 


log,, x = lim 

Ac—>0 


log«U + Ax) 
Ax 


log« * 


(7) 


Our next step is to manipulate the expression following the limit sign into a more 
convenient form by using the properties of logarithms discussed in Section 8.2, 


log a(x + Ax) - loga x = 
Ax 


^'°g a 


x + Ax 


= A7 log ‘- 


Ax 


=7iU og u*+- 


= 7 log " 


AxV 


x/Ax 


The definition (7) now yields 


-7- \og a x = lim 
dx Ax —>0 


7 log " 


= — lim 

X Ax—>0 


= -'°g a 


log,, 


lim 

Ax-»0 


/ Ax\ 

x/Ax" 

( 1 + v) 

1 

/ Ax\ 

x/Ax~ 

( 1 + v) 



AxY* 


If we maintain our spirit of research, then the distinctive limit in brackets here 
attracts our attention. It is natural to simplify its structure a bit by putting h = 
Ax/x, and to recognize that Ax ^ 0 is equivalent to h 0. We now define a new 
mathematical constant e by means of the resulting limit, 

e = lim (1 + h) l/h , (8) 

h —>0 


and we at once obtain 



- loga e. 


(9) 


One of our continuing purposes in calculus—though students may find this hard 
to believe—is to make the formulas we work with as simple as possible. Since 
\og e e = 1, it is clear that (9) takes its simplest form if the base a is chosen to 
be the number e\ 



( 10 ) 


The function log*, x (or In x) is called the “natural” logarithm because formula 
(10) makes it the most convenient logarithm to use in calculus and its applica¬ 
tions. 

The ideas described here are those by means of which the Swiss mathemati¬ 
cian Euler (pronounced “OlL-er”) essentially discovered both e and the functions 
In x and e* in the early eighteenth century. 
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Remark 3 Students should be informed that some writers define the function 
In x by the formula 

In x = j*!y- ( 11 ) 

These writers are then committed to deriving all the properties of the logarithm 
from the properties of this integral. Also, it is necessary to define the exponen¬ 
tial function in terms of the logarithm instead of the other way around. This ap¬ 
proach to the ideas of this chapter has its merits from the point of view of the 
theory of calculus. However, for most students, exponents come before loga¬ 
rithms as naturally as milk comes before cheese; and regardless of the fine points 
of logic, it is bound to seem perverse and unnatural to begin our subject with 
(11) — however much it may delight the soul of a mathematician. 


NOTE ON EULER 

Leonard Euler (1707-1783) was Switzer¬ 
land’s foremost scientist and one of the three greatest math¬ 
ematicians of modem times—the other two being Gauss 
and Riemann. 

He was perhaps the most prolific author of all time in any 
field. From 1727 to 1783 his writings poured forth in a seem¬ 
ingly endless flood, constantly adding knowledge to every 
known branch of pure and applied mathematics, and also to 
many that were not known until he created them. He aver¬ 
aged about 800 printed pages a year throughout his long life, 
and yet he almost always had something worthwhile to say 
and never seems long-winded. The publication of his com¬ 
plete works was started in 1911, and the end is not yet in 
sight. This edition was planned to include 887 titles in 72 
volumes. However, since that time extensive new deposits 
of previously unknown manuscripts have been unearthed. It 
is now estimated that more than 100 large volumes will be 
required for completion of the project, well into the twenty- 
first century. Euler evidently wrote mathematics with the 
ease and fluency of a skilled speaker discoursing on sub¬ 
jects with which he is intimately familiar. His writings are 
models of relaxed clarity. He never condensed, and he rev¬ 
eled in the rich abundance of his ideas and the vast scope 
of his interests. The French physicist Arago, in speaking of 
Euler’s incomparable mathematical facility, remarked that 
“He calculated without apparent effort, as men breathe, or 
as eagles sustain themselves in the wind.” He suffered total 
blindness during the last 17 years of his life, but with the 
aid of his powerful memory and fertile imagination, and with 
helpers to write his books and scientific papers from dicta¬ 
tion, he actually increased his already prodigious output of 
work. 


Euler was a native of the city of Basel in Switzerland and 
a student of John Bernoulli at the University—himself one 
of the most eminent mathematicians of the time—but he 
soon outstripped his teacher. His working life was spent as 
a member of the Academies of Science at Berlin and St. Pe¬ 
tersburg, and most of his papers were published in the jour¬ 
nals of these organizations. After the launching of calculus 
by Newton and Leibniz in the seventeenth century, math¬ 
ematics developed rapidly but without much order or co¬ 
herence. Euler tamed this mathematical wilderness as the 
explorers and settlers tamed the wilderness that became the 
United States of America. He was also a man of broad cul¬ 
ture, well versed in the classical languages and literatures 
(he knew the Aeneid by heart), many modem languages, 
physiology, medicine, botany, geography, and the entire 
body of physical science as it was known in his time. How¬ 
ever, he had little talent for metaphysics or disputation, and 
came out second best in many good-natured verbal encoun¬ 
ters with Voltaire at the court of Frederick the Great. His 
personal life was as placid and uneventful as is possible for 
a man with 13 children. 

Though he was not himself a teacher, Euler has had a 
deeper influence on the teaching of mathematics than any 
other person. This came about chiefly through his three great 
treatises: Intmductio in Analysin Infinitorum (1748); Insti- 
tutiones Calculi Differentialis (1755); and Institutiones Cal¬ 
culi Integralis (1768-1794). There is considerable truth in 
the old saying that all elementary and advanced calculus 
textbooks since 1748 are essentially copies of Euler or copies 
of copies of Euler. These works summed up and codified 
the discoveries of his predecessors, and are full of Euler’s 
own ideas. He extended and perfected plane and solid ana- 
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lytic geometry, introduced the analytic approach to 
trigonometry, and was responsible for the modern treatment 
of the functions In x (= log* * x) and e x . He created a con¬ 
sistent theory of logarithms of negative and imaginary num¬ 
bers and discovered that In x has an infinite number of val¬ 
ues. It was through his work that the symbols e, 77, and i 
(= V-7) became common currency for all mathematicians, 
and it was he who linked them together in the astonishing 
equation e ni = — 1. This is merely a special case (put 
6 = 7r) of his famous formula e ie = cos 0 + i sin 9 —see 
Section 14.8—which connects the exponential and trigono¬ 
metric functions and is absolutely indispensable in higher 
analysis.* Among his other contributions to standard math¬ 
ematical notation were sin x, cos x, the use of f(x) for an un¬ 
specified function, and the use of 2 for summation. 1 ” Good 
notations are important, but the ideas behind them are what 
really count, and in this respect, Euler’s fertility was almost 
beyond belief. He preferred concrete special problems to the 
general theories in vogue today, and his unique insight into 
the connections among apparently unrelated formulas blazed 
many trails into new fields of mathematics which he left for 
his successors to cultivate. 

He was the first and greatest master of infinite series and 
infinite products, and his works are crammed with striking 
discoveries in these fields. James Bernoulli (John’s older 
brother) found the sums of several infinite series, but he was 
not able to find the sum of the reciprocals of the squares, 
1 + i + i + i^+---.He wrote, “If someone should suc¬ 
ceed in finding this sum, and will tell me about it, I shall be 
much obliged to him.’’ In 1736, long after James’s death, 
Euler made the wonderful discovery that 

.111 7r 2 

1 *4 + 9"T6-T- 

He also found the sums of the reciprocals of the fourth and 
sixth powers, 


1 +^4+^4 + 


“ 1 + i? + ir + '"“ w 


and 


1 +36+36 + 


■ 1 + 6? + ra + 


77 

945 ' 


When John heard about these feats, he wrote, “If only my 
brother were alive now.”* Few would believe that these for¬ 
mulas are related—as they are—to Wallis’s infinite prod¬ 
uct (1656), 

7t_2 2 4 4 6 6 

T~T‘T * 7’ 7’7’7 


Euler was the first to explain this in a satisfactory way, in 
terms of his infinite product expansion of the sine, 

v2 " 1 




5LY,-XV.. 


9n 2 


The ideas described here are explained more fully in the ap¬ 
pendices to Chapters 13 and 14. 

His work in all departments of mathematics strongly in¬ 
fluenced the further development of this subject through the 
next two centuries. He contributed many important ideas to 
differential equations, including substantial parts of the the¬ 
ory of second-order linear equations and the method of so¬ 
lution by power series. He gave the first systematic discus¬ 
sion of the calculus of variations, which he founded on his 
basic differential equation for a minimizing curve. He in¬ 
troduced the number now known as Euler s constant, 

y = lim [ 1 + 4 + I + . . . + i - i n „ ] = 0.5772 ... , 

1 n->°° \ 2 3 n ) 

which is the most important special number in mathematics 
after 77 and e (see Section 13.6). He discovered the integral 
defining the gamma function. 


T(x) = J o f l e * dt, 


*An even more astonishing consequence of his formula is the fact 
that an imaginary power of an imaginary number can be real, in 
particular i‘ = £ _7l/2 ; for if we put 6 = 7 t / 2 , we obtain e ml2 = i, so 

ji = ( e m/2 y — ^7n-/2 = 

Euler further showed that i l has infinitely many values, of which 
this calculation produces only one. 

*566 F. Cajori, A History of Mathematical Notations (Open Court, 
1929). 


which is often the first of the so-called higher transcenden¬ 
tal functions that students meet beyond the level of calcu¬ 
lus, and he developed many of its applications and special 
properties. He also worked with Fourier series, encountered 
the Bessel functions in his study of the vibrations of a 
stretched circular membrane, and applied Laplace trans¬ 
forms to solve differential equations—all before Fourier, 
Bessel, and Laplace were bom. In almost every direction 
people traveled in exploring the world of classical math¬ 
ematics, they met Euler coming back—for he had been that 
way before them. 


*The world is still waiting—more than 250 years later—for some¬ 
one to discover the sum of the reciprocals of the cubes. 






276 


EXPONENTIAL AND LOGARITHM FUNCTIONS 


Just as William Shakespeare’s business was the writing of 
plays for the use of his theatrical company, Euler’s business 
was mathematical research and publication for the acade¬ 
mies that employed him; and each was the greatest master 
of his business that the world has ever seen. We know that 
it is almost impossible to speak English without using 
Shakespeare’s words and phrases; and in the same way, it is 


almost impossible to think about mathematics without us¬ 
ing Euler’s thoughts.* 

*For students who wish to enter into the mind of this great math¬ 
ematician and experience some of his most interesting work in num¬ 
ber theory at first hand—and in a context not requiring much pre¬ 
vious knowledge—we recommend Chapter VI of G. Polya’s fine 
book. Induction and Analogy in Mathematics (Princeton University 
Press, 1954). 


PROBLEMS 


Simplify each of the following: 

(a) e ln 2 ; (b) In e 3 \ 

(d) In e ]/x \ (e) In (1 /e x )\ 

(g) e ~M\/x). ( h ) e \n3 + \nx. 

(j) \ne^/e\ (k)e ln4 “ ln3 ; 

(m) 6 3 In x+2 In y. (n) £>3 In 2 ; 

( р ) £^+2\x\x 

Find dy/dx in each case: 

(a) y = In (3* + 2); 

(с) y = In (e x + 1); 

(e) y = x In x — jc; 

(g) y = (In x) 2 \ 

In x 


(c) e~ lnx \ 

(f) e w/x \ 
(i) In e ]n l ; 
(1) In (In e ); 
(o) e 3 + ,n2 ; 


(b) y = In (x 2 + 1); 

(d) y = In (e x ) 3 ; 

(f) y = In x 2 ; 

(h) y = In (3x 2 - 4x + 5); 


(i)T = — 


(k) y = In (x + Vx 2 + 1 ). 
Find dy/dx in each case: 

(a) In xy + 2x - 3y = 4; 


(j) y = In (In jc); 


(b) In — — xy = 2. 


Find dy/dx in each case. Whenever possible, use proper¬ 
ties of logarithms to simplify the function before differ¬ 
entiating. See (a) and (b). 

(a) y = In (jcVjc 2 + 1) = In x + \ In (jc 2 + 1 ). 


(b) y - In 


' * + | = J [In (x - 1) - In (x + 1)]. 


(c) y = In (3x - 2) 4 . 

(d) y = ln (t TJ 

(e) y = 3 In x 4 . 

(f) y = In K 

(g) y “ 3 In 152^r. 

(h) y = 5 In 2 Ijc + 4 In 31 x. 

(i) >> = In 1. 

1 JC 3 

(j) ^=3 ln m 

(k) y = In [(3x - l)\2x + 5) 3 ] 


5 


6 


7 


8 

9 

10 

11 

12 

13 


Integrate each of the following: 



dx 

3x + r 

3x 2 + 2 J 

- dx\ 

x 

x dx 
x+ V 
x dx 

In a: dx 
x 

dx 

V^(V^ + d’ 



x dx 
3x 2 + 2’ 


x 

x dx 

x 2 + r 

(2x — 1) dx 
' x(x-\) 

dx 


x In x ’ 

P x _ p -x 

, . dx. 


If c is a positive constant, show that the equation cx + 
In x = 0 has exactly one solution. Hint: Sketch the graph 
of y = cx + In jc with special attention to the behavior of 
dy/dx. 

Show that the equation jc = In jc has no solution 

(a) by minimizing y = jc — In jc; 

(b) geometrically, by considering the graphs ofy = jcand 
y = In jc. 

Find the length of the curve y = {jc 2 — \ In jc between 
jc = 1 and jc = 8. 

Sketch the graph of y = jc 2 - 18 In jc. Locate all max¬ 
ima, minima, and points of inflection. 

The area under y = e~ x from jc = 0 to jc = In 3 is re¬ 
volved about the jc-axis. Find the volume generated in 
this way. 

The area under y = 1/Vxfromjc = 1 tojc = 4 is revolved 
about the jc-axis. Find the volume generated in this way. 
Show that the area under y = 1/jc from x = a to x = b 
(0 < a < b) is the same as the area under this curve from 
x = ka to x = kb for any k > 0. 

Prove that 


x )°° JC 


by first showing that for jc > 1 
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In x = 



jc*, where both the base and the exponent are variable. 
Thus, we can write 


Hint: Compare the graphs of y = 1 It and y = 1/Vr for 
t > 1. 

14 Use the result of Problem 13 to show that 

lim jc In jc = 0. 

*-> 0 + 


Hint: Change the variable to u = 1 /jc. 

15 Use the result of Problem 14 to sketch the graph of y = 
jc In jc for all jc > 0. Locate its minimum and verify that 
the graph is concave up everywhere. 

16 Sketch the graph of y = (In jc)/jc for all jc > 0, and locate 
its maximum and point of inflection. 

17 The speed at which a signal is transmitted along a cable 
on the bottom of the ocean is proportional to jc 2 In 1 /jc, 
where jc is the ratio of the radius of the core of the cable 
to the radius of the entire cable. What value of jc maxi¬ 
mizes the speed of transmission? 

18 Logarithmic differentiation is a technique for computing 
the derivative of a function like 

V = V(* + 1 )(.v - 2)(2.v + 7), 

which is fairly complicated but whose logarithm can be 
written in a much simpler form: 

In y = j[ln (jc + 1) + In (jc - 2) + In (2jc + 7)]. 


19 


20 


Find dy/dx by differentiating this equation implicitly with 
respect to jc. 

Use the method of Problem 18 to find dy/dx if 


(a) y = 


e x (x 2 - 1) 


(b) y = 


x 2 + 3 


V6x — 2 \l x + 5 

The method of logarithmic differentiation (see Problem 
18) can also be used to differentiate functions like y = 


In y = In jc* = jc In jc, 

or equivalently, 

y — e x\nx 

Find dy/dx from both equations and use this derivative 
to find the minimum value of y = jc* for jc > 0. Sketch 
the graph. 

21 Use the method of Problem 20 to find dy/dx if 

(a) y = jc* v ; (b) y = A 7 * = x l/x . 

Sketch the graph of the function in (b) and find its max¬ 
imum value. 

22 In Problem 21(b), the behavior of the function y = A 7 * 
for large jc shows that 

lim X/rc = 1. 

n—>°° 


Find the limits of the following expressions as n —> °°: 
(a) (In ri) x/n \ (b) (n In n) x,n \ 


(c) 


In n \ 


l In 




«oi4 


n ) 

23 Obtain the limit formula lim A 
the fact that 


+o(l + jc) 17 * = e by using 


(1 -f x) X/x = £ ln U+•*)/* — g[l.n(l-hjO— In l]/x. 

24 If a is a positive number, show that 

a x - 1 

lim-= In a . 

x —>0 JC 

Hint: The limit is a value of a certain derivative. 

25 Show that 

lim — 1) = In a. 

n —>oo 

Hint: Put jc = Mn in Problem 24. 


As we emphasized in Section 8.1, our main purpose in this chapter is to develop 
the mathematical machinery that is necessary for treating a variety of related ap¬ 
plications. This machinery is now in place, and the time has come to see what it 
can do. 

Example 1 Population growth. Consider a laboratory culture of bacteria with 
unlimited food and no enemies. If N = N(t) denotes the number of bacteria pre¬ 
sent at time t, it is natural to assume that the rate of change of N is proportional 
to N itself.* If the number of bacteria present at the beginning is and this 
number doubles after 2 hours (the “doubling time”), how many are there after 6 
hours? After t hours? 


8.5 

APPLICATIONS. 
POPULATION GROWTH 
AND RADIOACTIVE 
DECAY 


^Briefly, we expect twice as many “births” in a given short interval of time when twice as many bac¬ 
teria are present. 
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Solution Even though bacteria come in units and are not continuously divisi¬ 
ble, there are so many present, and they are produced at such tiny time intervals, 
that it is reasonable to treat N(t) as a continuous, even differentiable, function. 
The assumed law of growth tells us that 

—t- = kN (k>0), (1) 

at 


or, separating variables, 


dN , j 
~N =kdt - 


Integration yields 


In N = kt + c. 


( 2 ) 


To determine the value of the constant of integration c, we use the fact that ini¬ 
tially (at t — 0) we have N — Nq. Thus, in equation (2) we have In No = 0 + c 
or c = In No, so (2) becomes 

In N = kt + In Nq 


or 


and therefore 


In N — In No = kt, In — = kt, 
N 0 


N = N 0 e kt . 



(3) 


To find k we use the fact that the population doubles in 2 hours. This gives 
2 N 0 = N 0 e lk , e 2k =2, 2k = In 2, k = \ In 2, 


so (3) becomes 


N = iV 0 e (Mn2)/2 , 


(4) 


which gives the population after t hours. Finally, putting t = 6 in (4) gives N = 
N 0 e 3]l)2 = N(ye lnS = 8A^ 0 , so the population increases by a factor of 8 in 6 hours. 


The situation just described is another example of exponential growth. This 
type of growth is characterized by a function of the form (3) where the constant 
k is positive. 

Example 2 Radioactive decay. After 3 days, 50 percent of the radioactivity 
produced by a nuclear explosion has disappeared. How long does it take for 99 
percent of this radioactivity to disappear? 

Solution We assume for the sake of simplicity that the radioactivity is entirely 
due to a single radioactive substance. This substance undergoes radioactive de¬ 
cay into nonradioactive substances by means of the spontaneous decomposition 
of its atoms, at a steady rate that is a characteristic property of the substance it¬ 
self. Each such decomposition is accompanied by a small burst of radiation, and 
these bursts are detected and counted by Geiger counters. We are not concerned 
here with the inner complexities of these remarkable events, but only with the 
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fact that the rate of change of the mass of our substance is negative and is pro¬ 
portional at each moment to the mass of the substance at that moment.* This 
statement means that if x = x(t) is the mass of the radioactive substance at time 
t y then 

= -kx (k > 0), (5) 

at 

where the minus sign says that x is decreasing. The positive constant k is called 
the rate constant ; it clearly measures the speed of the decay process. As before, 
we separate the variables and integrate, 

— — -k dty In x = —kt + c. (6) 

x 


If xq is the amount of the substance produced by the explosion, so that x = xq 
when t = 0, then we see that c — In xq, so (6) becomes 


In x = —kt + In xq 


or 


In x — In x 0 = —kt, In — = —kt, — = e kt , 
x 0 *o 


and consequently 


— let 

x = xoe . 


(7) 


In principle at least, x is never zero, because the exponential e~ kt never vanishes. 
It is therefore inappropriate to speak of the “total lifetime” of a radioactive sub¬ 
stance. However, it is both convenient and customary to use the concept of half- 
life: The half-life of a radioactive substance is the time required for the substance 
to decay to half its original amount (Fig. 8.8). If we denote the half-life by T, 
then (7) yields {xo = x§e~ kT , so e kT = 2 and 

kT = In 2. (8) 


This equation relates the half-life to the rate constant k y and enables us to find 
either if the other is known. 

In the specific problem we started with, 50 percent of the radioactivity disap¬ 
pears in 3 days. This tells us that the half-life of the substance is 3 days, so by 
(8) we see that 3k = In 2 or k = } In 2; and in this particular case, (7) becomes 

x = x 0 e~ (t ln 2 ^ /3 . 


The disappearance of 99 percent of the radioactivity means that 1 percent re¬ 
mains, and therefore x = -^xo. This happens when t satisfies the equation 

75o*0 = xoe~ (r ln 2)/3 , 

which is equivalent to 

e (r 10 2)13 = 100 or i-!si. = In 100. 


Finally, by using tables of natural logarithms (or a calculator) we find that 


Thus, if the mass of our substance were doubled, we would expect to lose twice as many atoms by 
decomposition in a given short interval of time. 



Figure 8.8 
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t 


3 In 100 
In 2 


6 In 10 
In 2 


= 20 days. 


It should be understood that this example is greatly oversimplified, because an 
actual nuclear explosion produces many different radioactive by-products with 
half-lives varying from a fraction of a second to many years. Thus polonium 212 
(3 ten-millionths of a second) and krypton 91(10 seconds) would disappear al¬ 
most immediately, whereas strontium 90 (28 years) lingers for decades and con¬ 
tributes substantially to the dangers of nuclear fallout.* 


The situation just discussed is an example of exponential decay. This phrase 
refers only to the form of the function (7) and the manner in which the quantity 
x diminishes, and not necessarily to the idea that something is disintegrating. 

Remark The concepts explained in Example 2 are the basis for a scientific tool 
of fairly recent development which has been of great significance for geology 
and archaeology. In essence, radioactive elements occurring in nature (with known 
half-lives) can be used to assign dates to events that took place from a few thou¬ 
sand to a few billion years ago. For example, the common isotope of uranium 
(uranium 238) decays through several stages into helium and an isotope of lead 
(lead 206), with a half-life of 4.5 billion years. When rock containing uranium 
is in a molten state, as in lava flowing from the mouth of a volcano, the lead cre¬ 
ated by this decay process is dispersed by currents in the lava; but after the rock 
solidifies, the lead is locked in place and steadily accumulates alongside the par¬ 
ent uranium. A piece of granite can be analyzed to determine the ratio of lead to 
uranium, and this ratio permits an estimate of the time that has elapsed since the 
critical moment when the granite crystallized. Several methods of age determi¬ 
nation involving the decay of thorium and the isotopes of uranium into the var¬ 
ious isotopes of lead are in current use. Another method depends on the decay 
of potassium into argon, with a half-life of 1.3 billion years; and yet another, pre¬ 
ferred for dating the oldest rocks, is based on the decay of rubidium into stron¬ 
tium, with a half-life of 50 billion years. These studies are complex and suscep¬ 
tible to errors of many kinds; but they can often be checked against one another, 
and are capable of yielding reliable dates for many events in geological history 
linked to the formation of igneous rocks. Rocks tens of millions of years old are 
quite young, ages ranging into hundreds of millions of years are common, and 
the oldest rocks yet discovered are upward of 3 billion years old. This of course 
is a lower limit for the age of the earth’s crust, and so for the age of the earth it¬ 
self. Other investigations, using various types of astronomical data, age determi¬ 
nations for minerals in meteorites, and so on, have suggested a probable age for 
the earth of about 4.5 billion years.^ 

These radioactive elements decay so slowly that the methods of age determi¬ 
nation based on them are not suitable for dating events that took place relatively 

*For students who have not met these ideas before, the number following the name of each of the 
chemical elements mentioned is the mass number (= total number of protons and neutrons in the nu¬ 
cleus) of the particular isotope referred to. For example, strontium as it occurs in nature has four sta¬ 
ble isotopes of mass numbers (in the order of their abundance) 88, 86, 87, 84. Several unstable iso¬ 
topes are produced in nuclear reactions, of which strontium 90 is the best known. 

+ For a full discussion of these matters, as well as many other methods and results of the science of 
geochronology, see F. E. Zeuner, Dating the Past, 4th ed. (Methuen, 1958). 
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recently. This gap was filled by Willard Libby’s discovery in the late 1940s of 
radiocarbon , a radioactive isotope of carbon (carbon 14) with a half-life of about 
5600 years. By 1950 Libby and his associates had developed the technique of 
radiocarbon dating , which added a second hand to the slow-moving geological 
clocks just described and made it possible to date events in the later stages of the 
ice age and some of the movements and activities of prehistoric people. The con¬ 
tributions of this technique to late Pleistocene geology and archaeology have been 
spectacular. 

In brief outline, the facts and principles involved are these. Radiocarbon is pro¬ 
duced in the upper atmosphere by the action of cosmic ray neutrons on nitrogen. 
This radiocarbon is oxidized to carbon dioxide, which in turn is mixed by the 
winds with the nonradioactive carbon dioxide already present. Since radiocarbon 
is constantly being formed and constantly decomposing back into nitrogen, its 
proportion to ordinary carbon in the atmosphere has long since reached an equi¬ 
librium state. All air-breathing plants incorporate this proportion of radiocarbon 
into their tissues, as do the animals that eat these plants. This proportion remains 
constant as long as a plant or animal lives; but when it dies it ceases to absorb 
new radiocarbon, while the supply it has at the time of death continues the steady 
process of decay. Thus, if a piece of old wood has half the radioactivity of a liv¬ 
ing tree, it lived about 5600 years ago, and if it has only one-fourth this ra¬ 
dioactivity, it lived about 11,200 years ago. This principle provides a method for 
dating any ancient object of organic origin, for instance, wood, charcoal, veg¬ 
etable fiber, flesh, skin, bone, or horn. The reliability of the method has been ver¬ 
ified by applying it to the heartwood of giant sequoia trees whose growth rings 
record 3000 to 4000 years of life, and to furniture from Egyptian tombs whose 
age is also known independently. There are technical difficulties, but the method 
is now felt to be capable of reasonable accuracy as long as the periods of time 
involved are not too great (up to about 50,000 years). 

Radiocarbon dating has been applied to thousands of samples, and laborato¬ 
ries for carrying on this work number in the dozens. Among the more interest¬ 
ing age estimates are these: linen wrappings from the Dead Sea scrolls of the 
Book of Isaiah, recently found in a cave in Palestine and thought to be first or 
second century B.C., 1917 ± 200 years; charcoal from the Lascaux cave in south¬ 
ern France, site of the remarkable prehistoric paintings, 15,516 ± 900 years; char¬ 
coal from the prehistoric monument at Stonehenge, in southern England, 3798 
± 275 years; charcoal from a tree burned at the time of the volcanic explosion 
that formed Crater Lake in Oregon, 6453 ± 250 years. Campsites of ancient peo¬ 
ple throughout the western hemisphere have been dated by using pieces of char¬ 
coal, fiber sandals, fragments of burned bison bone, and the like. The results sug¬ 
gest that human beings did not arrive in the New World until about the period of 
the last Ice Age, some 11,500 years ago, when the level of the water in the oceans 
was substantially lower than it now is and they could have walked across the 
Bering Straits from Siberia to Alaska.* 


*Libby won the 1960 Nobel Prize for chemistry as a consequence of the work described here. His 
own account of the method, with its pitfalls and conclusions, can be found in his book Radiocarbon 
Dating, 2nd ed. (Univ. of Chicago Press, 1955). 
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PROBLEMS 

1 The bacteria in a certain culture increase according to 
the law dN/dt = kN. If N = 2000 at the beginning and 
N = 4000 when t = 3, find (a) the value of iV when 
t = 1; and (b) the value of t when N = 48,000. 

2 If the rate of increase of the population of a country is 3 
percent per year, by what factor does it increase every 
10 years? What percentage increase will double the pop¬ 
ulation every 10 years? 

3 Sleepy ville has 5 times the population of Boomtown. The 
first is growing at the rate of 2 percent per year, and the 
second at 10 percent per year. In how many years will 
they have equal populations? 

4 It is often assumed that \ acre of land is needed to pro¬ 
vide food for one person. It is also estimated that there 
are 10 billion acres of arable land in the world, and there¬ 
fore a maximum population of 30 billion people can be 
sustained if no other sources of food are known. The to¬ 
tal world population at the beginning of 1970 was 3.6 
billion. Assuming that the population continues to in¬ 
crease at the rate of 2 percent per year, when will the 
maximum population be reached? What will be the pop¬ 
ulation in the year 2000? 

5 The half-life of radium is 1620 years. What percentage 
of a given quantity of radium will remain after 100 years? 

6 Cobalt 60, with a half-life of 5.3 years, is extensively 
used in medical radiology. How long does it take for 90 
percent of a given quantity to decay? 

7 In a certain chemical reaction a compound C decomposes 
at a rate proportional to the amount of C that remains. It 
is found by experiment that 8 g of C diminish to 4 g in 
2 hours. At what time will only 1 g be left? 

8 “A fool and his money are soon parted.” One particular 
fool loses money in gambling at a rate (in dollars per 
hour) equal to one-third of the amount he has at any given 
time. How long will it take him to lose half of his orig¬ 
inal stake? 

9 A cylindrical tank of radius 4 ft and height 10 ft, with 
its axis vertical, is full of water but has a small hole in 
the bottom. Assuming that water squirts out of the hole 
at a speed proportional to the pressure at the bottom of 
the tank, and that one-fifth of the water leaks out in the 
first hour, find a formula for the depth of the water left 
in the tank after t hours. 

10 According to Lambert's law of absorption, the percent¬ 
age of incident light absorbed by a thin layer of translu¬ 
cent material is proportional to the thickness of the layer. 
If sunlight falling vertically on ocean water is reduced to 
one-half its initial intensity 7 0 at a depth of 10 m, show 
that the formula 

/ = / 0 £-(*ln2)/10 

gives the intensity / at a depth of x meters. 


11 According to Newton's law of cooling, a body at tem¬ 
perature T cools at a rate proportional to the difference 
between T and the temperature of the surrounding air. A 
vat of boiling soup at 100°C is brought into a room where 
the air is 20°C, and is left to cool. After 1 hour its tem¬ 
perature is 60°C. How much additional time is required 
for it to cool to 30°C? 

12 Consider a column of air of cross-sectional area 1 in 2 ex¬ 
tending from sea level up to “infinity.” The atmospheric 
pressure p at an altitude h above sea level is the weight 
of the air in this column above the altitude h . Assuming 
that the density of the air is proportional to the pressure 
(this is a consequence of Boyle’s law pV = k at constant 
temperature), show that p satisfies the differential equa¬ 
tion 

dp 

jf = " cp> 

where c is a positive constant, and deduce that 
P = Poe~ c \ 

where p 0 is the atmospheric pressure at sea level. Hint: 
If h increases by a small amount dh and dp is the corre¬ 
sponding change in p (see Fig. 8.9), then —dp is the 
weight of the air in the small portion of the column whose 
height is dh, and this weight is the density times the vol¬ 
ume, so —dp = (cp){\ • dh). 



Figure 8.9 


13 The radiocarbon in living wood decays at the rate of 
15.30 disintegrations per minute (dpm) per gram of con¬ 
tained carbon. Using 5600 years as the half-life of ra¬ 
diocarbon, estimate the age of each of the following spec¬ 
imens discovered by archaeologists and tested for 
radioactivity in 1950: 

(a) a piece of a chair leg from the tomb of King Tu¬ 
tankhamen, 10.14 dpm; 
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(b) a piece of a beam of a house built in Babylon dur¬ 
ing the reign of King Hammurabi, 9.52 dpm; 

(c) dung of a giant sloth found 6 ft 4 in under the sur¬ 
face of the ground inside Gypsum Cave in Nevada, 
4.17 dpm; 

(d) a hardwood atlatl (spear-thrower) found in Leonard 
Rock Shelter in Nevada, 6.42 dpm. 

14 Suppose that two chemical substances in solution react 
together to form a compound. If the reaction occurs by 
means of the collision and interaction of the molecules 
of the substances, then we expect the rate of formation 
of the compound to be proportional to the number of col¬ 
lisions per unit time, which in turn is jointly proportional 
to the amounts of the substances that are untransformed. 
A chemical reaction that proceeds in this manner is called 
a second-order reaction, and this law of reaction is of¬ 
ten referred to as the law of mass actionf Consider a sec¬ 
ond-order reaction in which x grams of the compound 
contain ax grams of the first substance and bx grams of 
the second, where a + b = 1. If there are aA grams of 
the first substance present initially, and bB grams of the 
second, then the law of mass action says that 

= k(aA — ax)(bB - bx) = kab(A — x)(B —x). 
dt 

If A =£ B, show that 


fAA _- pkab(A—B)t /* *\ 

A(B ~ x) 6 ( ) 

provides a solution for which x = 0 when t = 0.* Hint: 
Take the logarithm of both sides and differentiate with 
respect to t. 

15 In Problem 14, find lim,_>oo *(f) 

(a) by solving equation (*) for* as an explicit function 
of t and using this function; 

(b) by merely inspecting equation (*). 

16 A switch is suddenly closed in an electric circuit, con¬ 
necting a battery of voltage £ to a resistance R and in¬ 
ductance L in series (Fig. 8.10). The battery causes a vari¬ 
able current I = I(t) to flow in the circuit. By elementary 


f For a first-order reaction, see Problem 7. 

*In Chapter 10 we develop a method for discovering this solution. 


physics, the voltage drop across the resistance is £/and across 
the inductance is L dl/dt, and the sum of these two voltage 
drops must equal the applied voltage E : 

dl 

L — + /?/= E> 
dt 

By separating the variables and integrating, and using the 
fact that 7 = 0 when t = 0, find the current / as a func¬ 
tion of r. Graph this function. 



17 Consider a given quantity of gas that undergoes an adi¬ 
abatic expansion or compression, which means that no 
heat is gained or lost during the process. The French sci¬ 
entist Poisson showed in 1823 that the pressure and vol¬ 
ume of this gas satisfy the differential equation 


^ + 7 ^ = o, 

P V 


where y is a constant whose value depends on whether 
the gas is monatomic, diatomic, etc.^ Integrate this equa¬ 
tion to obtain 


pV y = c. 


This is called Poisson's gas equation or the adiabatic gas 
law, and is of fundamental importance in meteorology. 

§ Students who are unfamiliar with electric circuits may find it help¬ 
ful to think of the current I as analogous to the rate of flow of water 
in a pipe. The battery plays the role of a pump producing pressure 
(voltage) that causes the water to flow. The resistance is analogous to 
friction in the pipe, which opposes the flow by producing a drop in 
the pressure; and the inductance opposes any change in the flow by 
producing a drop in pressure if the flow is increasing, and an increase 
in pressure if the flow is decreasing. 

^For more details on the physical background, see pp. 275-276 of 
R. A. Millikan, D. Roller, and E. C. Watson, Mechanics, Molecular 
Physics, Heat, and Sound (The M.l.T. Press, 1965). 


As the reader is certainly aware, the problem of realistically analyzing the growth 
of a population is not adequately dealt with in Example I of Section 8.5. The 
difficulty with this discussion is that the basic equation, 

= kN (k > 0), 
dt 

describes only the simplest ideal situation, in which the inner impulse of the pop¬ 
ulation to expand is given a completely free rein; it does not take into account 


8.6 

MORE APPLICATIONS. 
INHIBITED 
POPULATION 
GROWTH, etc. 
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any of the inhibiting factors that put a ceiling on the possible size of a real pop¬ 
ulation. It is obvious, for example, that the human population of the earth can 
never expand to the stage where there will be only a small fraction of an acre of 
usable land per person. Long before the point is reached at which the whole sur¬ 
face of the earth becomes a teeming slum, the rate of population growth will be 
forced down; social, psychological, and economic effects will depress the 
birthrate, and there will also be an increase in the death rate due to the starva¬ 
tion, disease, and warfare that are the inescapable companions of overpopulation. 
In our next example we try to recognize some of these factors, and thereby mir¬ 
ror reality a little more closely. 


Example 1 Inhibited population growth. Consider a small colony of rabbits of 
population No that is “planted” at time t = 0 on a grassy island where they have 
no enemies. When the population N = N(t) is small, it tends to grow at a rate 
proportional to itself; but when it becomes larger, there is more and more com¬ 
petition for the limited food and living space, and N grows at a smaller rate. If 
N\ is the largest population the island can support, and if the rate of growth of 
the population N is assumed to be jointly proportional to N and to N\ - N, so 
that 


— = kN(N i - AO (k > 0), 
at 


(1) 


find /V as a function of t. 


Solution It should be noticed explicitly at the outset that N increases slowly — 
that is, dNldt is small—when N is small, and also when N is large but close to 
A^i, so that N\ - N is small. To solve (1), we separate variables and integrate, 


i 


dN 

N{N\ - N) 



( 2 ) 


The calculation of the integral on the left side of (2) requires the easily verified 
algebraic fact that 

-!-= -U± + —!—Y ,3) 

N{N\ -N) N\ \N N { - N) v ' 

With the aid of (3), we can write (2) in the form 



which yields 

-L [In /V - In (W, - AO] = kt + c { 
N\ 


or 


_L i n 

N\ — N 


— kt + c i. 


If we multiply through by N\, this becomes 


In 


N 

N\ — N 


= N\kt + c, 
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where c = N\C\. Since N = N 0 when t = 0, we see that c = In [Nq/(N\ - No)], 
so we have 


111 Tf - AT = N ' kt + ^ 

N\ - N 


No 

N\ — No' 


which is equivalent to 

N = No Ntkt 
N\ - N N\ — N 0 


We solve this equation for N by writing 

AW - No) = N 0 N]e N]kt - NN 0 e N ' kt , 
N[N 0 e N ' kt + (Ah - N 0 )] = NoN { e N ' kt , 


and 


NpN\e Nikt 

N 0 e N ' kt + (Ah - No )' 


We can write this in a more convenient form, and thereby obtain our final result, 
by multiplying the numerator and denominator on the right by e~ N]kt \ 


N pN \ 

No + (Ah - No)e~ N ' kt ‘ 


(4) 


It should be observed that (4) gives N = Nq when t = 0, and also that N —> A^ 
as r ^ oo, as we expect. The graph of (4) is shown in Fig. 8.11. In ecology and 
mathematical biology this curve is called the inhibited growth curve, or some¬ 
times the sigmoid growth curve. 


In Example 1 of Section 5.5, we discussed the idealized problem of a freely 
falling body, in which we ignored the effect of air resistance and assumed that 
the only force acting on the body was the force of gravity. We are now in a po¬ 
sition to improve our discussion of this problem by taking air resistance into ac¬ 
count. 



Example 2 Falling body with air resistance. Consider a stone of mass m that 
is dropped from rest from a great height in the earth’s atmosphere. If the only 
forces acting on the stone are the earth’s gravitational attraction mg (where g is 
the acceleration due to gravity, assumed to be constant) and a retarding force due 
to air resistance, which is assumed to be proportional to the velocity v, find v as 
a function of the time t. 


Solution Let s be the distance the stone falls in time t, so that the velocity v = 
ds/dt and the acceleration a = dvldt = d 2 s/dt 2 . There are two forces acting on 
the falling stone, a downward force mg due to gravity, and an upward force kv 
due to air resistance, where k is a positive constant. Newton’s second law of mo¬ 
tion F = ma says that the total force acting on the stone at any moment equals 
the product of its mass and its acceleration. With our assumptions, the equation 
ma = F becomes 


dv 

m —— = mg — kv, 
at 
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or dividing through by m, 


dv 

~dt 


= g " cv. 


( 5 ) 


where c = k/m. We solve (5) by separating variables and integrating, which gives 

dv 

8 ~ cv 

or 


i—-i 

J J2 — CV J 


dt 


-In (g - cv) = t + c\‘, 

c 


and by changing the notation for constants in a familiar way, we can write this 
in the form 


or 


In (g — cv) — —ct + C 2 


g — cv = c$e 


( 6 ) 


The initial condition v = 0 when t = 0 tells us that C 3 = g, so ( 6 ) becomes 


8 ~ cv = ge ct 


or 


v=-{\ - e~ ct ). 
c 


(7) 


Since c is positive, this formula tells us that v^> glc as t —> 00 . It is a surprising 
fact that the velocity of our falling stone does not increase indefinitely, but in¬ 
stead approaches a finite limiting value. This limiting value of v is called the ter¬ 
minal velocity . If we differentiate (7), we find that the acceleration is given by 
the formula a = ge ~ ct , so a —> 0 as t —» 00 . From the physical point of view, this 
means that as time goes on the air resistance tends to balance out the force of 
gravity, so that the total force acting on the stone approaches zero. 


Our next example is typical of many problems involving continuously chang¬ 
ing mixtures. 

Example 3 Mixing. Brine containing 2 lb of salt per gallon flows into a tank 
that initially holds 200 gal of water in which 100 lb of salt are dissolved. If the 
brine enters the tank at the rate of 10 gal/min, and if the mixture (which is kept 
uniform by stirring) flows out at the same rate, how much salt is in the tank af¬ 
ter 20 minutes? After 100 minutes? 

Solution Let x be the number of pounds of salt in the tank after t minutes. The 
key to thinking about this problem is the following fact: 

rate of change of x = rate at which salt enters tank — rate at which salt leaves tank. (8) 

It is clear that salt enters the tank at the rate of 2 • 10 = 20 lb/min. The concen¬ 
tration of salt at any time is x /200 lb/gal, so the rate at which it leaves the tank 
is (jc/200) -10 = x/20 lb/min. Accordingly, ( 8 ) becomes 

dx_ x 400 —x 
~dt “ 20 ~~ 20 


20 
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By the familiar process of separating variables and integrating, and using the ini¬ 
tial condition x = 100 when t = 0 , we obtain 

x = 400 300e~ t/2Q . (9) 

(As usual when we omit computational details, students should carry these de¬ 
tails through for themselves.) By using a calculator, we now find that x = 289.7 
when t — 20, and that x = 398.0 when t = 100. Also, it is obvious from (9) that 
x —> 400 as t —» 


PROBLEMS 


1 In Example 1, what is the population when its rate of 
growth is largest? 

2 In a genetics experiment, 50 fruit flies are placed in a 
glass jar that will support a maximum population of 1000 
flies. If 30 days later the population has grown to 200 
flies, when will the fly population reach half of the jar’s 
capacity? 

3 Let x be the number of people in a community of total 
population x\ who have heard a certain rumor t days af¬ 
ter the rumor was launched. Common sense suggests that 
the rate of increase of that is, the rate at which this ru¬ 
mor spreads through the community, is proportional to 
the frequency of contact between those who have heard 
the rumor and those who have not, and this in turn is 
jointly proportional to the number of people who have 
heard the rumor and the number of those who have not. 
This yields the differential equation 

= cx(x 1 - *), 

where c is a constant expressing the level of social ac¬ 
tivity. If the rumor is initially imparted to xo individuals 
(jc = jc 0 when t = 0), find x as a function of r. Use this 
function to show that x —> x\ as t —> «>. Sketch the graph. 

4 Rework Example 2 under the more general assumption 
that the initial velocity is vo. Show that the terminal ve¬ 
locity is still g/c, and therefore does not depend on vo. 
Convince yourself that this is reasonable. 

5 A motorboat moving in still water is resisted by the wa¬ 
ter with a force proportional to its velocity v. Show that 
the velocity t seconds after the power is shut off is given 
by the formula v = voe ~ ct , where c is a positive constant 


and vo is the velocity at the moment the power is shut 
off. Also, if 5 is the distance the boat coasts in time t, 
find 5 as a function of t and sketch the graph of this func¬ 
tion. Hint: Use Newton’s second law of motion. 

6 Consider the situation described in Problem 5, with the 
difference that the resisting force is proportional to the 
square of the velocity v. Find v and s as functions of t, 
and sketch the graph of the latter function. 

7 By the result of Problem 5, the distance 5 approaches a 
finite limit as t increases; but in Problem 6 this distance 
becomes infinite. Because the resisting force seems to be 
greater in the second case, we would expect the distance 
traveled to be less than in the first case. Explain this 
seeming contradiction. 

8 A tank initially contains 400 gal of brine in which 100 
lb of salt are dissolved. Pure water is run into the tank 
at the rate of 20 gal/min, and the mixture (which is kept 
uniform by stirring) is drained off at the same rate. How 
many pounds of salt remain in the tank after 30 minutes? 

9 Rework Problem 8 if instead of pure water, brine con¬ 
taining -j|j lb of salt per gallon is run into the tank at 20 
gal/min, the mixture being drained off at the same rate. 

10 A country has 5 billion dollars of paper money in circu¬ 
lation. Each day 30 million dollars is brought into the 
banks for deposit and the same amount is paid out. Be¬ 
cause of a change of regime, the government decides to 
issue new paper money displaying pictures of different 
people, so whenever the old money comes into the banks 
it is destroyed and replaced by the new money. How long 
will it take for the paper money in circulation to become 
90 percent new? 


CHAPTER 8 REVIEW: CONCEPTS, FORMULAS 


Define and think through the following. 

1 Exponential and logarithm functions. 

2 Definition of e. The exponential function y = e x . 

3 -j- e u = e u -j" and f e u du = e u + c. 

dx dx J 


4 The natural logarithm function y = In x. 

F J , 1 du . f du 

5 — In u = -— and — = In u + c. 

dx u dx J u 

6 Exponential growth and decay. 
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ADDITIONAL PROBLEMS FOR CHAPTER 8 


SECTION 8.3 

In Problems 1-6, find the derivative dy/dx of the given func¬ 
tion. 


1 

C 1 

II 

2 

y = 0 - 

3 

y = e x2 ~ 2x+l . 

4 

H 

II 

5 

II 

+ 

$1 

6 

II 

Evaluate the integrals in 

Problems 7-11. 

7 

J e 3x dx. 

8 

j e ax+b 

9 

f e x,x dx 

1 X 2 ' 

10 

! 4 dx 

J V?' 


dx. 


II 


e x dx 


\Te x + 1 ' 

*12 Find the area between y = e x and the chord y = ex - 
x + 1. 

13 Find the point on the graph of y = e ax at which the tan¬ 
gent line passes through the origin. 

14 Evaluate the following limits: 

/ 1 \ 4/7 +9 

(a) i™( 1+ 4^) ; 

(b) lim (l + -V'” 2 ; (c) lim ( 1 + LV"; 

n—>°° \ n \ n 


(d) lim 1 + 


_ 1 _\" 
3 n 


(e) L m 4 ,+ ir 


15 Verify that y = e x2 is a solution of the differential equa¬ 
tion y" - 2xy' - 2y = 0. 

16 Verify that y = ( e 2x - \ )/{e 2x + 1) is a solution of the 
differential equation dy/dx = 1 - y 2 . 

17 The area under y = e x from a = 0 to a = 3 is revolved 
about the A-axis. Find the volume generated in this way. 

*18 Prove that for all a > 0 and all positive integers n, 

y-2 y3 y ^ 

e* > 1 + x + — + — +••• + — 

2! 3! n\ 

where the symbol n\ (read ‘Vi factorial”) denotes the 
product 1-2-3 ■ ■ ■ n. Hint: Since e l > 1 for t > 0, 


+ 


= 1 + f e r dt > 1 + f dt = 

Jo Jo 

: = 1 + £ dt > 1 + jj (1 + t) dt 


= 1 + X + 


2 ’ 


and so on. 

*19 If n is any given positive integer, prove that e x > x n for 
all sufficiently large values of a. Hint: Use Problem 18 
for n + 1. 


*20 Prove that 

e x 

lim — = °° 
x —>°° X n 

for any positive integer n. 

*21 If n is a positive integer, show that y = x n e~ x assumes 
its maximum value at a = n, so that its values at a = 
n — 1 and x = n + \ are less than the maximum. Use 
this fact to show that 

n + 1 \n 


< e < 

n ) \n 

and use this in turn to show that 




for every n. When n = 5, the second inequality here 
yields e < 3. Verify this. 

SECTION 8.4 

22 Find dy/dx in each case: 

(a) y = x In a 2 - 2a; (b) y = \ In (a 2 + 2a); 

(c) y = a 2 In a; (d) y = In (5a 4 - 7a 3 + 3); 

(f ) y = In a 5 ; 

1 


In a 

(e) y = T : 


(g) y = (In x) 5 ; (h) y = 

(i) y = Vln x. 

23 Find dy/dx in each case: 

(a) 3a - y 2 + In xy = 1; 

(b) a 2 -f In — + 3y + 2 = 0. 

y 


In a’ 


(a) 

y = 

In 

Vx; 

(b) 

y = 

= In 

xV^; 


(c) 

y = 

In 

(x 2 +4\ 
\2x + 3/’ 

(d) 

y = 

= In 

V2x 3 - 

- 4x; 

(e) 

y = 

In 

U+ l) 5 ; 

(f) 

y = 

= In 

(x 2 V^ 

+ i); 

(g) 

y = 

In 

A 

3 - 2x’ 






(h) 

y = 

In 

V6x 2 + 3x; 







(i) y = In 

(j) y = In 


/ 4 + a 2 
4 — x 2 ’ 

A 


1 + Vl + X 2 ; 

(k) y = a Vx 2 - 3 - 3 In (a + Vx 2 - 3); 


(1) y= --In 


2 + Vx 2 + 4 
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25 Integrate each of the following: 


(a) 

r dx 

(b) j 

f dx 

J 1 + 2jc’ 

’ 1 - 3jc’ 

(c) 

fi x 2 dx 

(d) j 

P x dx 

Jo 2 -X 3 ’ 

'o x 2 + r 

(e) 

! 6 x dx 

(f) 

f dx 

Jo x + 3 ’ 

' xV\nx' 

(g) 

p x m dx 

(h) j 

f x dx 

Jo 1 + 3x 4/3 ’ 

1 l -x 2 ’ 

(i) 

p In (x + 1) dx 

Jo X + 1 

(j) j 

(• (2x — l) dx 

(k) 

r e x dx 

« J 

(• (2x + 3) dx 

J e x + r 

' (x+ l)(;c+ 2)’ 

(m) 

f (In xfdx 

j X 

(n) 

f In vGc dx 

l X 

(o) 

f In (In x) dx 

J x\n x 

(P) j 



26 If p is a positive constant, show that 


lim 

X —>oo 



Hint: Replace x by the variable y = x p. 

27 If a and b are positive constants, show that 


lim 

x— 


(In x) a 
x h 


= 0 . 


28 In Problem 27, find the largest value of 


(In x) a 
y = — 


for x > 1. 


29 If a is a positive constant, find the length of the curve 


x z a , 

y = —-- In x 

7 2a 4 


between x = 1 and x = 2. For what value of <3 is this 
length a minimum? 

30 If <3 and b are positive constants, find the length of the 
curve 



from x = a to x = 3a. 

31 Use the fact that a = e ]na to find dy/dx in each of the 
following cases: 

(a) y = 10*; (b) y = 3*; 

(c) y = 7r v ; (d) y = 7 3 *; 

(e) y = 6*~ 2x \ (f) y = 5^. 

32 Use the idea of Problem 31 to integrate each of the fol¬ 

lowing: 

(a) £ 2 X dx\ (b) 1C Ydx\ 

fV2 , fl „ , 

(c) J ] x3 x ~ dx\ (d) l 2x 1 dx\ 


(e) j 3 ~ x dx; 

(f) | x9 2x2 dx\ 

ri 

f 10^ dx 

(g) J o 5 3t dx\ 

(h) 1U _ . 

1 V^c 


33 Sketch the graph of y = x 2 /5 x , and locate its maximum 
and two points of inflection. 

34 (a) In changing logarithms from the base a to the 

base b, one needs the equations log*, x = (log^ a) • 
(log n *) and (log a b)(\og b a) = 1. Prove them. 

(b) Compute f — ~r— -. 

J X log , 0 x 

(c) For each choice of the constant a > I, show that 

y = (log fl x)lx has a maximum at x = e and a point 

of inflection at ;t = e\Je. Sketch the graph. 

35 Find dy/dx if 

(a) y = (\nx) x \ (b) y = x lnx \ 

(c) y = (In jc) ln (d) y = 

(e) y = x^ x . 


SECTION 8.5 

36 The number of bacteria in a culture doubles every hour. 
How long does it take for a thousand bacteria to pro¬ 
duce a billion? 

37 The world population at the beginning of 1970 was 3.6 
billion. The weight of the earth is 6586 X 10 18 tons. If 
the population of the world continues increasing at a 
rate of 2 percent per year, and if the average person 
weighs 120 lb, in what year will the weight of all the 
people equal the weight of the earth? 

38 Cesium 137 is used in medical and industrial radiology. 
Estimate its half-life if 20 percent decays in 10 years. 

39 In a certain chemical reaction a substance S decomposes 
at a rate proportional to the amount of S not decom¬ 
posed. If 25 g of this substance is reduced to 10 g in 4 
hours, when will 21 g be decomposed? 

40 A certain object cools from 120°F to 95°F in half an 
hour when surrounded by air whose temperature is 
70°F. Use Newton’s law of cooling to find its temper¬ 
ature at the end of another half hour. 

41 A cup of coffee is made with boiling water at 212°F 
and taken into a room whose air temperature is 72°F, 
After 20 minutes it has cooled to 100°F. What is its tem¬ 
perature after cooling for a full hour? 

42 Assume that the atmospheric pressure p is related to the 
altitude h above sea level by the differential equation 


where c is a positive constant. If p is 15 lb/in 2 at sea 
level and 10 lb/in 2 at 10,000 ft, find the atmospheric 
pressure at the top of Mount Everest, where h = 30,000 
ft. 
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43 A rocket of total mass m is traveling with velocity v in 
a distant region of space where the force of gravity is 
negligible. Its thrust is provided by burning an appro¬ 
priate fuel and expelling the exhaust products backward 
at a constant velocity a relative to the rocket. The mass 
m is therefore variable, and Newton’s second law of mo¬ 
tion is 


F = -(mv), 


which in this case becomes 


"if) 


. . . dv dm 

(a) Show that m — = -a ——. 

dt dt 

(b) Use part (a) to show that —~r = —- m. 

dv a 


(c) 


if v = 0 and 


Use part (b) to show that m = m 0 e~ 
m = m 0 when t = 0. 

(d) The mass m clearly diminishes as the flight pro¬ 
gresses, so the velocity v increases. If m\ is the mass 
of the initial fuel supply and v is the maximum 
velocity, show that 

m 0 

v = a In-. 

m 0 - m i 

Notice that m 0 — m \ is the so-called structural mass 
of the rocket, i.e., its mass exclusive of fuel. 

44 The presence of a certain antibiotic destroys a type of 
bacteria at a rate jointly proportional to the number N 
of bacteria and the amount of antibiotic. If there were 
no antibiotic present, the bacteria would grow at a rate 
proportional to their number. Assume that the amount 
of antibiotic is 0 at t = 0 and increases at a constant 
rate. Construct a suitable differential equation for N, 
solve this equation, and sketch the solution. 

45 Assume for the sake of simplicity that uranium 238 de¬ 
cays directly into lead 206 with a half-life of T — 4.5 
billion years. 

(a) If a given quantity of just-solidified volcanic rock 
contains xo atoms of uranium and no lead, show that 
t years later there are ;t = xoe~ kt atoms of uranium 
and y = x 0 (l - e~ kt ) atoms of lead, where kT = 
In 2. 

(b) If we can measure the ratio r = ylx in an ancient 
volcanic rock, and if we have reasonable grounds 
for believing that all the lead comes from uranium 
that was locked in the rock when it solidified, then 
we can calculate the age of the rock with a fair de¬ 
gree of confidence. Show that this age is given by 
the formula 


r = — In (1 + r) 


T Tr 

— In (1 + r) = — 
In 2 In 2 


when r is small. Hint: Examine the graph of 
In (1 + r) for small values of r 

(c) In a certain rock, r is found to be 0.082. Show that 
this rock may be about 530 million years old. 

46 In the branch of psychology called psychophysics, an 
attempt is made to establish a quantitative connection 
between the sensation S experienced by a person and 
the stimulus R that causes this sensation, as in the sen¬ 
sation of heaviness produced by a weight held in the 
hand. If a small change dR in the stimulus from R to 
R + dR produces a corresponding change dS in the sen¬ 
sation, then dS is not proportional to dR. Thus, if a 
weight we hold in our hand is increased from 5 lb to 6 
lb, we detect much more of a difference in heaviness 
than when it is increased from 20 lb to 21 lb. The Fech- 
ner-Weber law was first formulated by E. H. Weber in 
1834 and expounded in detail by G. T. Fechner in 1860, 
and it played a substantial role in early experimental 
psychology through the influence of Wilhelm Wundt. 
This law states that dS is proportional, not to the actual 
amount dR the stimulus is changed, but to the relative 
amount it is changed. 


Find 5 as a function of R if S = 0 when R = 1. 


SECTION 8.6 

47 A flu epidemic hits a city and spreads at a rate jointly 
proportional to the number of people who are infected 
and the number of those who are not. If the number of 
people stricken grows from 10 percent to 20 percent of 
the population in the first 10 days, how many more days 
will be required for half the population to be infected? 

*48 Volterra's prey-predator equations describe an ecolog¬ 
ical community of the following kind. On an island with 
plenty of grass, there live * rabbits (the prey) and y foxes 
(the predator). The number of encounters per unit time 
between rabbits and foxes is proportional to the prod¬ 
uct xy of their populations. The rabbits tend to increase 
at a rate proportional to their number and to decrease 
at a rate proportional to the product xy. The foxes tend 
to decrease at a rate proportional to their number and 
to increase at a rate proportional to xy. This gives the 
system of differential equations 

dx dy 

— = ax — bxy, — = — cy + dxy, 

dt dt 

where a, b, c, d are positive constants. 

(a) Show that ;t = dd and y = alb is a solution of the 
system. These are called the equilibrium popula¬ 
tions. 
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(b) Show that any solution x = x(t), y = y(t) satisfies 
the equation ( x c e~ dx ) (y a e~ by ) = k, where k is a pos¬ 
itive constant. Hint: Eliminate dt from the system 
by division, separate variables, and integrate. 

(c) Use the equation in part (b) to show that neither x(t) 
nory(t) can —> °o as 

49 Consider a falling body of mass m and assume that the 
retarding force due to air resistance is proportional to 
the square of the velocity. If the body falls from rest, 
find a formula for the velocity in terms of the distance 
fallen, and thereby find the terminal velocity in this 
case. Hint: dvldt = ( dv/ds)(ds/dt ) = v dv/ds. 

*50 A torpedo is traveling at a velocity of 60 km/h at the 
moment it runs out of fuel. If the water resists its mo¬ 
tion with a force proportional to the velocity v, and if 
1 km of travel reduces v to 40 km/h, find the distance 
5 the torpedo coasts in t hours, and also the total dis¬ 
tance it coasts. 

51 Brine containing 1 lb of salt per gallon flows at the rate 
of 10 gal/min into a tank initially filled with 120 gal of 
pure water. If the concentration is kept uniform by stir¬ 
ring, and the mixture flows out at the same rate, when 
will the tank contain 40 lb of salt? When will it contain 
100 lb of salt? 

52 A large tank initially contains 45 lb of salt dissolved in 
50 gal of water. Pure water flows in at the rate of 3 
gal/min, and the mixture (which is kept uniform by stir¬ 
ring) flows out at the rate of 2 gal/min. When will the 


tank contain 5 lb of salt? How many gallons of water 
will be in the tank at that time? 

53 An aquarium contains 10 gal of polluted water. A filter 
is attached to this aquarium which drains off the pol¬ 
luted water at the rate of 5 gal/h and replaces it at the 
same rate by pure water. How long does it take to re¬ 
duce the pollution to half its initial level? 

54 Let e be a small positive number. The differential equa¬ 
tion 


where k is a positive constant, is called the doomsday 
equation because the “growth term” kN Ue is slightly 
larger than that for normal—or natural—exponential 
growth (that is, kN). 

(a) Solve this equation \f N = No when t = 0. 

(b) Show that there is a finite time t = to such that 
j\l —» °° as t —» t 0 . 

(c) The fast-growing population of a certain inhabited 
planet has a growth term kN iA . If there are 5 bil¬ 
lion people initially and 6 billion people 10 years 

— later, when is doomsday? 

■A!!J (d) In part (c), when is doomsday if the growth term is 
kN l0] and the other conditions are unchanged? (No 
matter how small the positive number € may be, 
there is an inescapable doomsday.) 
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9.1 

REVIEW OF 
TRIGONOMETRY 


TRIGONOMETRIC 

FUNCTIONS 


We continue the program started in Chapter 8 of extending the scope of our work 
to include broader and broader classes of functions, this time the full range of 
trigonometric functions. In science, we have already pointed out that these func¬ 
tions are indispensable tools for the study of periodic phenomena of all kinds, 
ranging from the back-and-forth movement of the bob of a pendulum clock to 
the revolution of the planets in their orbits around the sun. And in mathematics 
—as we shall see in Chapter 10—almost all of the more advanced methods of 
integration lean heavily on the trigonometric functions and their properties. 

We assume that students have studied trigonometry in high school. Also, in 
Section 1.7 we provided a brief account of a few of the simpler properties of the 
sine and cosine functions. Nevertheless, no matter how well the basic facts have 
been learned, they are easy to forget unless they are needed and used on a day- 
to-day basis, which they will be through most of the rest of this book. We there¬ 
fore devote this section to a review of the subject from the beginning, ignoring 
the fact that students already have a little knowledge of sines and cosines. There 
are a number of fundamental formulas built into this exposition, and these are so 
important for the purposes of calculus that students should relearn them system¬ 
atically and thoroughly. Even though our treatment is very concise, it is essen¬ 
tially self-contained; and hard-working students who have little previous experi¬ 
ence with trigonometry should be able to get along comfortably in the following 
chapters with only what they find in these pages. 

RADIAN MEASURE 

The most common unit for measuring angles is the degree (1 right angle = 90 
degrees = 90°). However, the standard unit for angle measurement in calculus is 
the radian . One radian is the angle which, placed at the center of a circle, sub¬ 
tends an arc whose length equals the radius (Fig. 9.1, left). More generally, the 
number of radians in an arbitrary central angle (Fig. 9.1, center) is defined to be 
the ratio of the length of the subtended arc to the radius, 6 = s/r, so that s = rO. 
Since the circumference of the circle is c = 2nr, a complete central angle of 360° 
is equivalent to lirrlr = 2 tt radians. Thus, 

277 radians = 360°, 77 radians = 180°, 

1 radian = — = 57.296°, 1° = a 0.0175 radian. 

77 180 
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Figure 9.1 


Further, 90° = 7t/2, 60° = 7t/3, 45° = 7t/4, and 30° = 7t/6, where we follow the 
convention of omitting the word “radian” in using radian measure. 

Just as the calculus of logarithms is simplified by using the base e , the calcu¬ 
lus of the trigonometric functions is simplified by using radian measure. We will 
point out the specific reason for this in Section 9.2. Throughout our work we will 
use radian measure routinely and mention degrees only occasionally. 

It is sometimes useful to know that the area A of the sector whose central an¬ 
gle is 6 (Fig. 9.1, right) is given by the formula 

A = jrs = jr 2 0, 

since s = rO. This is easy to prove by using the fact that the area of the sector is 
to the area of the circle as the arc s is to the circumference: 


This is easy to remember by thinking of the sector as if it were a triangle with 
height r and base s. 


THE TRIGONOMETRIC FUNCTIONS 


Consider the unit circle in the xy-plane (Fig. 9.2). If 6 is a positive number, let 
the radius OP start in the position OA and revolve counterclockwise through 6 
radians. Thus, 6 = it produces half a revolution and 6 = 2tt produces a complete 
revolution, both counterclockwise. If 6 is negative, we let OP revolve clockwise 
through —6 radians. See Fig. 9.3. In this way, each real number 6 (positive, neg¬ 
ative, or zero) determines a unique position of OP in Fig. 9.2, and therefore a 
unique point P = (jc, y) with the property that x 2 + y 2 = 1. The sine and cosine 
of 6 are now defined by 


sin 6 = y and cos 6 = x. 

It is evident from the definition that — 1 < sin 0 < 1, and similarly for cos 0; and 
the algebraic signs of these quantities depend on which quadrant of the plane the 
point P happens to lie in. For every 6 , the numbers 6 and 6 + 2 tt clearly deter¬ 
mine the same point P, so 



( 1 , 0 ) 


sin (6 + 27r) = sin 6 and 


COS (6 + 277) = cos 6 . 
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Figure 9.3 



b 

Figure 9.4 



Thus the values of sin 6 and cos 6 repeat when 6 increases by 2tt. We express 
this property of sin 6 and cos 6 by saying that these functions are periodic with 
period 2 tt. 

The remaining four trigonometric functions—the tangent, cotangent, secant, 
and cosecant—are defined by 

tan 6 = —, cot 6 = —, sec 6 = —, esc 6 = —. 

* y * y 

The sine and cosine are the basic functions, and the others can be expressed in 
terms of these two [see identities (1) to (4) below]. 

When 6 is a positive number <7 t/2 , the right triangle interpretations of the 
sine, cosine, and tangent are as follows (see Fig. 9.4): 

. . opposite side a 

sin 0 = -- = — 

hypotenuse h 

. adjacent side b 

cos 6 = --= — 

hypotenuse h 

n opposite side a 

tan 6 = — 77 --— = 

adjacent side b 

We have drawn the right triangle here with base angle equal to the angle 6 shown 
in Fig. 9.2, and the validity of these statements rests on the similarity of the two 
triangles in the figures (since sin 6 = y = y/ 1, etc.). In the equivalent forms 

a = h sin 6 , b = h cos 0 , a = b tan 6 , 

the right triangle interpretations have many uses in physics and geometry. Nev¬ 
ertheless, the purposes of calculus require that 6 be an unrestricted real variable, 
and for this reason the unit circle definitions are preferable. 


IDENTITIES 

Several simple relations among our functions are direct consequences of the de¬ 
finitions: 
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tan 6 = 


sin 6 
cos S' 


cot 6 = 


cos 0 
sin O' 


sec 6 = 


1 

cos S' 


esc 6 = 


1 

sin S' 


tan 6 = 


1 

cot S’ 


(1) 

( 2 ) 

(3) 

(4) 

(5) 


Altogether, there are 21 fundamental identities that express the main properties 
of the trigonomeric functions and constitute the core of the subject. These iden¬ 
tities fall into several natural groups, and are therefore easier to remember than 
we might expect. We emphasize these groups by enclosing them in boxes. 

Our next identities state the effect of replacing 6 by —6. From Fig. 9.5 and 
the obvious fact that the endpoints of the two radii lie on the same vertical line 
for all values of 6 , we at once have the first two of the identities 

( 6 ) 

(7) 

( 8 ) 

The third follows easily from (1) combined with (6) and (7).* 

Our next group consists of three equivalent versions of the equation x 2 + y 2 = 
1. Before stating these, we must explain that the symbols sin 2 6 and cos 2 6 are 
standard notations for the numbers (sin 6) 2 and (cos 6) 2 . If we write x 2 + y 2 = 
1 in the form y 2 + x 2 = 1, then this yields the first of the identities 

(9) 
( 10 ) 
(ID 

The second and third in this group are obtained by dividing (9) first by cos 2 6 , 
and then by sin 2 6. 

For obvious reasons, the following are called the addition formulas: 

( 12 ) 

(13) 

(14) 


sin (6 + 0) = sin 6 cos (f) -I- cos 6 sin c/>, 
cos (6 + 0) = cos 0 cos 0 - sin 6 sin c/>, 
tan S + tan 0 


tan (6 + 0) = ■ 


tan 6 tan 


sin 2 6 + cos 2 6 = 1, 
tan 2 6 + 1 = sec 2 0, 
1 + cot 2 6 = esc 2 6 . 


sin (- 6 ) = -sin 6 , 
cos ( — 6) = cos 6 , 
tan (-6) = -tan 6 , 



*It is clear that there are similar identities for the cotangent, secant, and cosecant. However, these 
are of little significance; and in keeping with our purpose of presenting a stripped-down version of 
trigonometry, we ignore them. 
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We indicate in Problem 10 a method for proving the first two of these—which 
are not obvious and ought to be memorized—and the third follows from the first 
two by a straightforward argument. Write 

sin (0 + 0) sin 0 cos 0 + cos 0 sin 0 

tan (0 + 0) = - - — = ---r-: ——— ~r. 

cos (0+0) cos 0 cos 0 — sin 0 sin 0 

Now, by dividing both numerator and denominator on the right by cos 0 cos </>» 
we obtain 


sin 0/cos 0 + sin 0/cos 0 
tan + 1 — (sin 0/cos 0)(sin 0/cos 0) ’ 

which is essentially (14). The corresponding subtraction formulas are 

(15) 

(16) 

(17) 

These follow directly from the addition formulas by replacing 4> by — 4> and us¬ 
ing (6), (7), and (8). 

The double-angle formulas are 

(18) 

(19) 

These are the special cases of (12) and (13) obtained by replacing 4> by 0. (There 
is also an obvious double-angle formula for the tangent; but this is of minor im¬ 
portance and we omit it.) 

The half-angle formulas are 

( 20 ) 
( 21 ) 

These are easy to prove by writing (9) and (19) together, as 

cos 2 0 + sin 2 0=1, 
cos 2 0 - sin 2 0 = cos 20. 

Adding yields (20), and subtracting yields (21). 


2 cos 2 0 = 1 + cos 20, 
2 sin 2 0 = 1 - cos 20. 


sin 20 = 2 sin 0 cos 0, 
cos 20 = cos 2 0 - sin 2 0. 


sin (0 — 0) = sin 0 cos 0 — cos 0 sin 0, 
cos (0 — 0) = cos 0 cos 0 + sin 0 sin 0, 
tan 0 — tan 0 


tan (0 - 0) 


1 + tan 0 tan 0 ’ 


VALUES 

If we keep firmly in mind the definitions of sin 0, cos 0, and tan 0, then there 
are several first-quadrant values of 0 for which the exact values of these func¬ 
tions are easy to find. All that is necessary is to remember the Pythagorean the¬ 
orem and look carefully at the three parts of Fig. 9.6: 
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Also, an inspection of Fig. 9.2 with OP in various positions gives us similar in¬ 
formation for the cases 0 = 0 , tt/2, tt, 37 t/ 2 , and 2ir (the entry * means that the 
quantity is undefined): 

7 t 3 77 

sin 0 = 0 sin — = 1 sin 77 = 0 sin —— = —1 sin 2 tt = 0 

2 2 

77 3 77 

cos 0 = 1 cos — = 0 cos 77 = —1 cos 2=0 cos 2 tt = 1 

tan 0 = 0 tan = * tan 77 = 0 tan = * tan 277 = 0 

2 2 

In our subsequent work, facts of this kind will often be needed at a moment’s 
notice. They are best learned, not by an effort of memory, but rather by an act 
of understanding—knowing the definitions of the trigonometric functions and 
visualizing (or quickly sketching) appropriate pictures. We also emphasize the 
way the algebraic signs of our functions vary from one quadrant to another. The 
facts are obvious from the definitions and Fig. 9.2, and are stated in the follow¬ 
ing table: 


Quadrant 

1 

2 3 4 

sin 6 

+ 

+ - - 

cos 6 

+ 

+ 

tan 6 

+ 

+ - 


GRAPHS 

The graph of sin 0 is easy to sketch by looking at Fig. 9.2 and following the way 
y varies as 0 increases from 0 to 277, that is, as the radius swings around through 
one complete counterclockwise revolution. It is clear that sin 0 starts at 0, in¬ 
creases to 1, decreases to 0, decreases further to — 1, and increases to 0. This 
gives one complete cycle of sin 0, as shown on the left in Fig. 9.7. The complete 
graph (on the right in Fig. 9.7) consists of infinitely many repetitions of this cy¬ 
cle, to the right and to the left. The graph of cos 0 can be sketched in essentially 


Figure 9.6 
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Figure 9.7 


Figure 9.8 



the same way (Fig. 9.8), the main difference being that cos 6 starts at 1, decreases 
to 0, decreases further to — 1, increases to 0, and increases further to 1. 

The graph of tan 6 is quite different from the graphs of sin 6 and cos 6. We 
point out first that tan 6 is periodic with period i r: 


tan (0 + 7r) — 


sin (0 + 77) 
cos (0 + 7r) 


— sin 0 
-cos 0 


= tan 0. 


This permits us to get the full range of values of tan 6 by visualizing the ratio 
y/x in Fig. 9.2 and allowing 6 to increase from —tt/2 to 7 t/ 2. The result is the 
central curve shown in Fig. 9.9, and the complete graph of tan 6 consists of in¬ 
finitely many repetitions of this curve to the right and to the left. The fact that 
tan 6 —» oo as 6— > 7t/2 (from the left) is often loosely expressed by writing 
tan 7t/ 2 = oo. 



Figure 9.9 
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LAW OF COSINES 

This is a useful tool in a variety of situations in mathematics and physics. It ex¬ 
presses the third side of a triangle (Fig. 9.10) in terms of two given sides a and 
b and the included angle 6: 

c 2 = a 2 + b 2 ~ 2ab cos 9. 

The proof is routine if we place the triangle in the jcy-plane as shown in the fig¬ 
ure and apply the distance formula to the vertices (a cos 6 , a sin 6) and ( b , 0). 
The square of the side c is clearly 

c 2 = (a cos 9 - b) 2 + (a sin 9 - 0) 2 



= <2 2 (cos 2 0 + sin 2 6) + b 2 - lab cos 6 


= a 2 + b 2 — lab cos 6 , 


and the argument is complete. An important application of the law of cosines is 
made in Problem 10, where it is used to prove identity (16), and thereby identi¬ 
ties (12) and (13). 


PROBLEMS 


1 Convert from degrees to radians: 


(a) 

15°; 

(b) 

d " 

»n 

O 

(C) 

120°; 

(d) 

75°; 

(e) 

150°; 

(f) 

135°; 

(g) 

225°; 

(h) 

210°; 

(i) 

630°; 

(j) 

900°. 





Convert from 

radians to degrees: 


(a) 

5 77/3; 

(b) 

777/6; 

(c) 

2 77/9; 

(d) 

3 77/2; 

(e) 

477/3; 

(f) 

377; 

(g) 

777/15; 

(h) 

77/36; 

(i) 

77/5; 

(j) 

25 77/3. 






3 A decorative garden is to have the shape of a circular 
sector of radius r and central angle 6. If the perimeter 
is fixed in advance, what value of 6 will maximize the 
area of the garden? 

4 Find the values of sin 6 , cos 6, and tan 6 when 6 equals 

(a) -tt/ 6; (b) 3 tt/ 4; (c) 4 tt/3; 

(d) -5 77/4; (e) 2 tt/ 3; (f) 17 tt; 

(g) -10277. 

5 If the base of an isosceles triangle is 10, express its area 
A as a function of the vertex angle 6 . 

6 If the height of an isosceles triangle is h, express its 
perimeter p as a function of the base angle 9. 

7 Express the height H of a flagpole in terms of the length 
L of its shadow and the angle of elevation 9 of the sun. 

8 A hunter sits on a platform built in a tree 30 m above 
the ground. He sees a tiger at an angle of 30° below the 
horizontal. How far is he from the tiger? 

9 Sketch the graph of 

(a) sin 29 (hint: this curve runs through one complete 
cycle as 29 increases from 0 to 2 77); 


(b) 3 sin 20; (c) sin 40; 

(d) sin y0; (e) 2 cos 39. 

10 In this problem we outline a method of proving identi¬ 
ties (12) and (13) by first establishing (16). Figure 9.11 
shows the unit circle with two arbitrary angles 9 and 4> 
and their corresponding points P e = (cos 9 , sin 9) and 
P$ = (cos 4 >, sin 4>)- 



Figure 9.11 


(a) Calculate the square of the distance between these 
points in two ways, by using the distance formula 
and the law of cosines, and thus prove identity (16), 

cos (9 — 4>) = cos 9 cos 4> + sin 9 sin 4>- 

(b) Use part (a) to prove identity (13), 

cos (6 + 4>) = cos $ cos 4> ~ sin 9 sin 4>■ 
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(c) Use part (a) to show that cos (tt/2 — </>) = sin </>. 

(d) Use part (c) to show that sin (7 t/ 2 — </>) = cos $. 
Hint: Replace </> by 7 t/2 - 4 >• 

(e) Use parts (a), (c), and (d) to prove identity (12), 

sin (0 + 0) = sin 0 cos 4> + cos 0 sin 4>- 

Hint: sin (0 + 4>) = cos - (0 + </>)] 

= cos [(tt/2 — 0) — (/)] = ■■■ . 


11 Derive formulas for sin 30 in terms of sin 0, and for 
cos 30 in terms of cos 0. 

12 Derive a formula for cos 40 in terms of cos 0. 

13 Derive a formula for sin 40 in terms of sin 0 and cos 0. 

14 If a and b are any constants, show that there exist con¬ 
stants A and B with the property that a sin 0 + b cos 0 
can be written in the form A sin (0 + B). 

15 Compute sin 15° by using 

(a) 15° = 45° - 30°; (b) 15° = y(30°). 

16 Find all solutions of the given equation in the interval 
0 < 0 < 2 ?t: 

(a) sin 0 = 0 ; (b) sin 0 = 1 ; 

(c) sin 0 = — 1 . 

17 Find all solutions of the given equation in the interval 
0 < 0 < 27r: 

(a) cos 0 = 1 ; (b) cos 0 = 0 ; 

(c) cos 0 = - 1 . 

18 Find all values of 0 in the interval 0 < 0 < 2 7 r for which 

(a) sin 0 = -{; (b) cos 0 = 7 V 2 ; 

(c) sin 0 = yV3; (d) sin 0 = cos 0 . 

19 Find all solutions of each of the following equations: 

(a) sin 30 = \\fl\ (b) cos {0 = — 1; 

(c) sin 50 = -7. 

20 Show that 

(a) sin 0 sin 4> = {[cos (0 - $) - cos (0 + </>)]; 

(b) cos 0 cos 4> = {[cos (0 -</>) + cos (0 + </>)]; 

(c) sin 0 cos 4> = {[sin (0 + </>) + sin (0 - </>)]. 

21 Show that 

, X • n . . „ . /0 + 4>\ I 6- 4>\ 

(a) sin 0 + sm </> = 2 sm —-— cos —-— ; 


(b) sin 0 - sin 4> = 2 cos 

(c) cos 0 + cos 4> = 2 cos 





(d) cos 0 — cos 4> = ~2 sin 




sin 


¥)■ 


Hint: These identities can be established laboriously, by 
working from the right sides to the left sides, or easily, 
by an ingenious use of Problem 20. 


Establish the identities in Problems 22-32. 
2 tan 0 


22 sin 20 = 


1 + tan 2 O' 


23 


cos 20 = 


1 - tan 2 0 
1 + tan 2 0' 


24 

25 

26 

27 

28 

29 

30 

31 

32 

33 


34 


35 


*36 


37 


*38 


tan 0 = 


sin 0 + sin 20 
1 + cos 0 + cos 20 ’ 


tan 0 = 


sin 20 
1 + cos 20* 


cot 0 = 


sin 20 
1 — cos 20* 


tan 2 


1 — cos 20 
1 + cos 20' 


tan 0 tan {0 = sec 0 - 1. 


tan y0 = 


sin 0 

1 + cos 0* 


1 + sin 0 + cos 0 
1 + sin 0 — cos 0 


cot {0. 


1 - 4 sin 4 0 = cos 20(1 +2 sin 2 0). 
tan y0 + cot {0 = 2 esc 0. 

Sketch the graph of the function 

(a) y = sin —; (b) y = x sin —; 

x x 


(c) y = x 2 sin —. 

x 

Notice particularly that each of these functions is un¬ 
defined at x = 0. 

In Problem 33, supplement the definition of each func¬ 
tion by specifying that y = 0 when Jt = 0. With these 
changes, show that at the point jc = 0 the function (a) 
is discontinuous, the function (b) is continuous but not 
differentiable, and the function (c) is differentiable. 
If one side and the opposite angle of a triangle are fixed, 
and the other two sides are variable, use the law of 
cosines to show that the area is a maximum when the 
triangle is isosceles. (Can you prove this by using 
geometry alone?) 

A conical paper cup is formed from a circular sheet of 
paper by cutting out a circular sector and joining the 
two straight edges of the remaining piece. What should 
the central angle of the sector be in order to maximize 
the volume of the cup? 

A light hangs above the center of a circular table whose 
radius is 3 ft. The illumination at any point on the table 
is directly proportional to the sine of the angle between 
the table and the ray of light to that point, and inversely 
proportional to the square of the distance from the point 
to the light source. How high should the light be hung 
in order to maximize the illumination at the edge of 
the table? 

A heavy spherical ball is lowered carefully into a full 
conical wine glass whose depth is a and whose gener¬ 
ating angle (between the axis and a generator) is a. 
Show that the greatest overflow occurs when the radius 
of the ball is 


a sin a 


sin a + cos 2 a ' 
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The calculus of the trigonometric functions begins with two of the most impor¬ 
tant formulas in all of mathematics, 

d 

— sin x = cos x (1) 


and 


9.2 

THE DERIVATIVES OF 
THE SINE AND COSINE 


— cos ;t = —sin x, (2) 

dx 


We have already discussed these formulas in Section 3.4. However, no harm will 
be done—and perhaps much good—if we pretend that we have never heard of 
them before and discuss them all over again from the beginning. We emphasize 
that the letter x used here is simply an ordinary real variable, as in any function 
y — /(jt); and if it is thought of as an angle, then this angle is always to be un¬ 
derstood in radian measure. 

Formulas (1) and (2) can be proved by straightforward applications of the de¬ 
finition of the derivative, 


d_ 

dx 


f(x) = lim 
Ajt—»• 


f(x + /ix)~f(x) 
Av 


(3) 


In order to carry through these calculations, it turns out that we need to know 
the following two special limits, 


r sin 9 
lim —— = 1 
0 ->o 9 


(4) 


and 


lim = 0 .* 
0 ->o 9 


(5) 


The validity of these statements can be understood directly from geometry, by 
thinking of 6 as a small angle and looking at the unit circle on the left in Fig. 
9.12, where the definitions given in the previous section tell us that PQ = sin 0 , 
PR = 6 , and QR = 1 - cos 6 . It is easy to see that the ratio (sin 6)/6 = PQ/PR 
is <1 and close to 1, and it visibly approaches 1 as 6 approaches 0. This be¬ 
havior of the ratio (sin 6)19 is further emphasized by the magnified version of 


"These limits were both established in Section 2.5. For the sake of variety, we offer supporting ar¬ 
guments here that are somewhat different from those used before. 




Figure 9.12 
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part of the picture that is shown on the right in Fig. 9.12, where 0 is taken to be 
very small and the origin is understood to be several feet to the left. The same 
type of “proof by inspection” can also be applied to (5). This time the figure tells 
us that the ratio (1 - cos 0)10 = QR/PR is a small number that clearly approaches 
0 as 0 approaches 0. 

To establish formula (1), we apply (3) to the function/(x) = sin x, 

sin (x + Ax) - sin x 


— sin x = lim 
ax a*—» o 


Ax 


Since sin (x + Ax) = sin jc cos Ax + cos jc sin Ax, we exercise a little ingenuity 
and write 

d sin jc cos Ax + cos x sin Ax — sin x 

— sin x = lim --- 

dx A*->o Ax 


= lim 

Ax —>0 


COS X 


sin Ax 
Ax 


— sin x 


- cos Ax 
Ax 


Using (4) and (5) with 0 replaced by Ax now yields 
d 


dx 


sin x = (cos x) ■ 1 — (sin x) ■ 0 = cos x, 


which concludes the proof of (1). To prove formula (2), we begin with 

d cos (x + Ax) — cosx 

— cos x = lim - t-. 

dx Ax— >o Ax 

Since cos (x + Ax) = cos x cos Ax — sin x sin Ax, we have 

d ,. cos x cos Ax — sin x sin Ax — cos x 

— cos x = lim -;- 

dx Ax->o Ax 


= lim 

Ax —>0 


—sin x 


sin Ax 
Ax 


— COS X 


cos Ax 
"Ax 


= ( —sin x) • 1 — (cos x) ■ 0 = —sin x, 


and the proof of (2) is complete. The addition formulas for the sine and cosine 
obviously play essential roles in these arguments, and this is their main use in 
mathematics. 

We now generalize (1) and (2) by means of the chain rule, and obtain the ex¬ 
tremely useful formulas 


d . du 

—— sin u — cos u —— 
dx dx 


( 6 ) 


and 


d du 

— cos u— - sin u —. 
dx dx 


(7) 


As usual, u is understood to be any differentiable function of x. 


Example 1 Find the derivative of each of the following functions: 
(a) >’ = sin 5x; (b) y = sin Vx; (c) y = cos (2 - 3.x 4 ). 


Solution For (a), we use (6) with u = 5x, so 
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= cos 5 jc - 7 - (5jc) = 5 cos 5 jc. 
ax ax 

For (b), u = Vx = jc 1/2 , so 

= cos Vjc -7 (jc 1/2 ) = — cos Vx 
dx dx 2 Vjc 

For (c), we use (7) with u — 2 — 3x 4 , so 

^ = -sin (2 - 3* 4 )-f (2 - 3jc 4 ) = 12 a: 3 sin (2 - 3a 4 ). 


Students must learn to use formulas ( 6 ) and (7) in combination with all pre¬ 
vious rules of differentiation. In this connection it is necessary to remember the 
standard notation for powers of trigonometric functions: sin n x means (sin x) n . 
There is one exception to this usage, for (sin jc) -1 is never written sin -1 jc; the 
latter expression is reserved exclusively for the inverse sine function discussed 
in Section 9.5. 


Example 2 Find the derivative of each of the following functions: 
(a) y - sin 3 4jc; (b) y — e cosx \ 

(c) y = In (sin jc); (d) y = sin (In x). 


Solution 
d) 
dx 


(a) = 3(sin Ax) 2 - 7 - (sin 4jc) = 3 (sin 4jc) 2 (cos 4jc) • 4 

dx 


= 12 sin 2 4jc cos 4jc. 

,COS X 


(b) — = e cosx - 7 - (cos jc) = — sin jc e 

dx dx 

, x dy 1 d / . . cos x 

(c) — = —-— (sin jc) = — -= cot jc. 

dx sin jc dx sin x 

/ j \ dy n , d „ , cos (In x) 

(d) — = cos (In jc) (In jc) =- . 

dx dx x 


Example 3 Show that {did) c)(j cos 3 jc — cos jc) = sin 3 jc. 
Solution 

j cos 3 jc — cos jcj = -j • 3 cos 2 jc( —sin jc) + sin jc 
= sin jc(1 — cos 2 jc) = sin 3 jc. 



Remark 1 Formulas ( 6 ) and (7) enable us to understand very clearly why ra¬ 
dian measure is preferred to degree measure when working with the trigonometric 
functions in calculus. Let sin jc° and cos jc° denote the sine and cosine of an an¬ 
gle of jc degrees. We know that jc degrees equals ttx/ 1 80 radians, so 
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By formula (6), 


d . 0 d . TTX 

-s mx =^s,n — 


180 


COS 


TTX 

780 


or 


d . 0 TT 

~dx sm x = m cosx 


.o. 


and similarly 


dx 


cos x 


180 


These formulas make it obvious why we use radian measure routinely in calcu¬ 
lus—for the sake of simplicity, in order to avoid the repeated occurrence of the 
nuisance factor 7 t/180. 



Remark 2 There is another way of obtaining the basic derivative formulas (1) 
and (2) for the sine and cosine. This alternate approach has the advantage of pro¬ 
viding direct insight into why these formulas are true. We start with the defini¬ 
tions from Section 9.1 (see Fig. 9.13), 

y = sin 0 and t - cos 0, 


in which we use 6 for the independent variable because t and y are the coordi¬ 
nates of P. We change 0 by a small amount AO and examine the resulting changes 
Ay and At in y and jc, as shown in the figure. If we think of PQR as a tiny “right 
triangle” with hypotenuse” A0, then the crux of the present argument is the fact 
that the triangles PQR and POS are similar. By using proportional sides of sim¬ 
ilar triangles we obtain the approximate equations 


Ay 

AO 



and 


-At _ y_ 
AO = 7 


As A 0 approaches 0, these approximations get better and better, and we conclude 
that 


dy _ ■■ Ay 

de he 


and 


dx Ax 

~de ~ Afl™o ~Ke ~ -;v ’ 


or equivalently, 


— sin 0 = cos 0 
dO 


and 


dO 


cos 0 = - sin 0. 


This geometric reasoning may be somewhat lacking in the precision of the tra¬ 
ditional proofs of formulas (1) and (2) given at the beginning of this section. 
Nevertheless, it has the great merits of simplicity and directness. 


PROBLEMS 


In each of Problems 1-18, find the derivative dy/dx of the 
given function. 

1 y = sin (3 t — 2). 

2 y = cos (1 — lx). 


3 y = 3 sin 16 t. 

4 y = sin 2 x. 

5 y = sin x 2 . 

6 y = sin 2 6x. 






7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 
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y = 5 sin 3x + 3 cos 5x. 

y = sin 2 x + cos 2 x. 

y = x sin x. 

y = x 3 sin 3x. 

y = sin 2 3x cos 2 3x. 

y = cos 4 x — sin 4 x. 

y = j sin 5 x — j sin 3 x + sin x. 

y = sin (sin x). 

y = e 2x sin 3x. 

y = sin (In x 2 ). 

y = In (cos x). 

y — gX 2 +sinx 

If a is a positive constant, verify that y = c\ sin ax + C 2 * 
cos ax is a solution of the differential equation 


for every choice of the constants c\ and c 2 . (In the Ad¬ 
ditional Problems we outline a proof of the important 
fact that every solution of this differential equation has 
the stated form. For this reason, y = c\ sin ax + c 2 • 
cos ax is called the general solution of the differential 
equation.) 

Show that (d/dx) cos x = — sin x by using the identity 
cos x = sin (7t/2 — x) and formula (6). 

Find the angle at which the curve y = } sin 3 jc crosses 
the ;t-axis. 

Sketch the graph of y = sin x + cos x on the interval 0 < 
x < 27 t, and find the maximum height of this curve above 
the x-axis. 

Find the maximum height of the curve y = 4 sin x — 

3 cos x above the x-axis. 

Obtain the second of the following identities by differ¬ 
entiating the first: sin 2x = 2 sin x cos x, cos 2x = 
cos 2 x — sin 2 x. 

Obtain the second of the following identities by differ¬ 
entiating the first: sin 3x = 3 sin x - 4 sin 3 x, cos 3x = 

4 cos 3 x — 3 cos x. 

Obtain the second of the following identities by differ¬ 
entiating the first with respect to either of the variables, 
keeping the other fixed: 

sin (x + y) = sin x cos y + cos x sin y, 

cos (x + y) = cos x cos y — sin x sin y. 

Show that y = sin x and y = tan x have the same tangent 
at x = 0. 

Show that the function y = x + sin x (x > 0) has no max¬ 
imum or minimum values even though there are many 
points where dy/dx = 0. Sketch the graph. 


29 A regular polygon with n sides is inscribed in a circle of 
radius r. 

(a) Show that the perimeter of this polygon is p n = 2nr ■ 
sin (7 Tin). 

(b) Find lim n _ >00 /? n , and verify by elementary geometry 
that your answer is correct. 

30 If a, b, c are constants with ab =£ 0, show that the graph 
of y = a sin ( bx + c) is always concave toward the 
x-axis and that its points of inflection are the points where 
it crosses the x-axis. 

31 Sketch the graphs of y = sin x and y = sin 2x together 
on a single set of axes. These curves have many points 
of intersection. Find the smallest positive x-coordinate of 
such a point, and calculate the acute angle at which the 
curves intersect at this point. Hint: See identity (17) in 
Section 9.1. 

32 The functions/(x) = sin x and g(x) = cos x have the fol¬ 
lowing properties: (a) f\x) = g(x); (b) g'(x)= -/(x); 

(c) /(0) = 0; (d) g(0) = 1. If F(x) and G(x) is any pair 
of functions with the same properties, prove that F{x) = 
sin x and G(x) = cos x. Hint: Show that 

[F(x) -/(x)] 2 + [G(x) - g(x)] 2 = a constant, 

and find the value of this constant. [This problem has a 
very remarkable meaning: The functions sin x and cos x 
are completely described by properties (a) to (d), and 
therefore the total nature of these functions—everything 
that is now known about them or ever will be known— 
is implicitly contained in these four simple properties.] 

In each of Problems 33^13, find the value of the indicated 


limit. 



33 

tan x 
lim 

*->o x 

34 

sin 3x 
lim 

-*->0 x 

35 

,. tan x 
lim . 
sin x 

36 

.. sin 3x 
lim . _ . 

*->• sin 5x 

37 

lim tan 3x esc 6x. 

X —>0 

38 

1 — cos X 
hm 9 

x->0 x 2 

39 

lim x sin —. 

x 

40 

lim 3x tan —. 

41 

2x 2 + 2x 
sin 2x 

42 

lim sin 3x cot 5x. 

X—>0 

43 

COS X 

lim 

x—>77/2 X — 7T/2 



44 

The graph of y = 

cos 6 has 

an obvious maximum at the 


point corresponding to 6 = 0, so the tangent is horizon¬ 
tal there. Show that this fact is equivalent to the limit (5). 
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9.3 

THE INTEGRALS OF 
THE SINE AND COSINE. 
THE NEEDLE PROBLEM 


Example 1 Evaluate f cos 5x dx. 

Solution Let u = 5x. Then du = 5 dx , dx = y du, and formula (2) gives 
cos 5x dx = J cos u ■ (y du) = y J cos u du 

= y sin u + c = y sin 5x + c. 

After a little practice, it will be easy for students to make this kind of substitu¬ 
tion mentally. In fact, we can dispense with the new variable u altogether, and 
compress this solution to the following simple steps: 

cos 5x dx = y cos 5x d(5x) = y sin 5x + c. 


Example 2 Evaluate f 7x sin (2 — 9x 2 ) dx. 

Solution Let u - 2 — 9a: 2 . Then du - — 1 8jc dx , xdx- — jg du , and 
J lx sin (2 — 9x 2 ) dx = j 7 sin u • ( — ^ dw) 

= —-jy J sin w dw 

= -jy cos u + c = -jy cos (2 - 9x 2 ) + c. 

Here the auxiliary variable u plays an important part in our work. It not only em¬ 
phasizes the need to apply formula (1), but also helps us keep track of the vari¬ 
ous coefficients and algebraic signs that appear in the calculation—and there¬ 
fore helps us avoid mistakes. 


The differential versions of formulas (7) and (6) in the previous section are 
d{ cos u) = — sin u du and d{ sin u) = cos u du. 

These immediately yield the integration formulas 

J sin u du = 


-cos u + c 


and 


J cos u du = sin u < 


( 1 ) 


( 2 ) 


Example 3 Compute the definite integral 


f^ /4 cos 2x dx 
Jtt/6 sin 3 2x 


Solution We begin by finding the indefinite integral. Since d (sin 2x) = 
2 cos 2x dx , we put u = sin 2x. This gives du = 2 cos 2x dx , so 


cos 2x dx 
sin 3 2x 



,. — 3 



1 


4 sin 2 2x' 







9.3 THE INTEGRALS OF THE SINE AND COSINE. THE NEEDLE PROBLEM 


307 


We remind students that the constant of integration can always be ignored in 
computing definite integrals, and for this reason we don’t bother to write it here. 
The Fundamental Theorem of Calculus now permits us to complete the solution 
by writing 



In our next example we discuss an application of these methods to a famous 
problem about probability that was invented by the French scientist Buffon in 
the early eighteenth century. 

Example 4 Buffon’s needle problem. A needle 2 in long is tossed at random 
onto a floor made of boards 2 in wide. What is the probability that the needle 
falls across one of the cracks? 

Solution We begin with a brief digression to explain our use of the word “prob¬ 
ability.” In mathematics this word means a numerical measure of the likelihood 
that a certain event will occur. As an example, consider the rectangle shown on 
the left in Fig. 9.14, in which a portion of the figure is shaded. If a point is cho¬ 
sen at random in this rectangle, for instance by making the rectangle into a tar¬ 
get and throwing a dart blindfolded, then the probability of choosing a shaded 
point is We assume here that each point is just as likely to be chosen as any 
other, and this number expresses the fact that the proportion of shaded points 
among all points in the rectangle is In the second rectangle the probability of 
choosing a shaded point is and in the third rectangle it is f. We take it as self- 
evident that the probability of choosing a shaded point equals the ratio of the 
shaded area to the total area. 

Let us now return to the needle problem. We describe the position in which 
the needle falls on the floor by the two variables x and 6 shown in Fig. 9.15; x 
is the distance OP from the midpoint of the needle to the nearest crack, and 6 is 
the smallest angle between OP and the needle. A toss of the needle amounts to 
a random choice of the variables x and 6 in the intervals 


0 ^ x < 1 and 0 < 6 < -y, 


(3) 


and this in turn amounts to a random choice of a point in the rectangle shown in 
Fig. 9.16. Furthermore, a close inspection of Fig. 9.15 shows that the event we 
are interested in, namely, that the needle falls across a crack, corresponds to the 
inequality 


x < cos 6. 


(4) 



Figure 9.14 
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1 X 

1 



1 1 

Jf- 

■V 


VI 

1 

1 

1 

1 

1 

1 

1 

1 

1 


cos 6 



Figure 9.15 Needle crosses crack 




Figure 9.16 


This inequality describes the shaded region in Fig. 9.16 under the graph of 
x = cos 6. We therefore conclude that the probability of the needle falling across 
a crack equals the following ratio of areas: 

[tt/2 

, I cos 6 d6 . ~ 

area under curve _ Jo_1 _ 2 

area of rectangle 7 t/2 7t/2 7 r’ 

which is slightly less than f. This calculation can be extended at once to the more 
general situation in which d is the distance between adjacent cracks and the length 
of the needle is L < d. The inequalities (3) are replaced by 


and (4) becomes 


0 < jc < — d 
2 


and 


0 < d 


7r 

r 


x <{L cos 6 . 


In this case the probability of the needle falling across a crack is easily seen to 
be 


jLcosQdQ 

area under curve _ Jo _ 2L 

area of rectangle {{d)(Till) 7 rd' 


( 6 ) 


(Students should draw their own sketch for this case similar to Fig. 9.16, and in 
particular should notice the reason for the restriction L < d.) 


Remark We obtained these conclusions about the probability of success in the 
needle experiment by pure reason alone, without any appeal to experience. How¬ 
ever, the “sequence of trials” approach to the concept of probability has some 
interesting implications for the needle problem. In the case of the 2-in needle 
and the 2-in floorboards, let us actually perform the experiment of tossing the 
needle onto the floor a large number of times, say n times, where n = 100 or 
n = 1000 depending on our ability to tolerate boredom. Let us also keep careful 
count of the number k of times the needle falls across a crack. Then the abstract 
probability that the needle falls across a crack on any one toss should be closely 
approximated by the ratio k/n , and this approximation should improve as n in¬ 
creases. Roughly speaking, this means that 
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lim — = 

n _>oo fl 


2_ 
7r’ 


so we should have 


k 


n 


2_ 
7r’ 


and solving this approximate equation for tt yields 

2 n 


for large values of n. In principle, therefore, this provides an experimental method 
of calculating 77 . In fact, however, this method is not capable of much accuracy 
because of the inherent errors that appear in all measurements. We will discuss 
practical methods for computing 7rto very great accuracy in Chapter 13. 


PROBLEMS 


Evaluate the indefinite integrals in Problems 1-20. 

1 / sin 5jc dx. 

2 / cos (2jc - 5) dx. 

3 / sin (1 — 9jc) dx. 

4 /(3 cos 2jc - 2 sin 3jc) dx. 

5 /2 sin x cos x dx. 

6 / cos 2 x sin x dx. 

7 / sin 3 2jc cos 2jc dx. 

8 / sin x cos x (sin jc + cos x) dx. 

9 / sin 7 \x cos \x dx. 


It 

/4jc sin jc 2 dx. 


11 

f sin Vx dx 

-7=-• 12 

C cos (In jc) dx 

J Vx 

J JC 

13 

f cos (sin 2jc) cos 2jc dx. 


14 

f cos jc dx 

J sin 2 * ' 

f sin [(2jc — l)/3] dx 

J cos 2 [( 2 x - l)/3] ' 

16 

f cos x dx 

( sin * dx 

. 17 



J sin jc 

j COS X 

18 

r cos 3jc dx 


^ Vsin 3jc 


19 

/(2jc + 1) cos (jc 2 + jc) dx. 


20 

/(jc + cos jc) 4 (1 - sin jc) dx. 



29 Find the volume generated by revolving about the jc-axis 
the region under y = sin x and between x = 0 and 
jc = 77. Hint: Remember the half-angle formula 2 sin 2 jc = 
1 - cos 2jc. 

30 Consider the region between y = sin jc and the jc-axis for 
0 < jc < 77/2. For what constant c does the line jc = c di¬ 
vide this region into two parts of equal areas? 

31 Anticipate the results of the next section by deriving the 
following differentiation formulas: 

d 2 

— tan jc = sec 2 jc; 

dx 

d 2 

— cot jc = -esc 2 jc; 

dx 

d 

—— sec jc = sec jc tan jc; 

dx 

d 

— CSC JC = —CSC JC cot JC. 

dx 

Hint: Express each function in terms of sin jc and cos jc. 

32 Obtain the following integration formulas from the dif¬ 
ferentiation formulas in Problem 31: 


Evaluate the definite integrals in Problems 21-24. 

77/5 (-277/3 

22 cos 3jc dx. 


21 

23 

25 

26 


27 

28 


i 


sin 5jc dx. 

f ^2 cos jc dx 
J77/4 sin 2 jc 


24 


1 : 

f 


- 77/6 
CV^T 


x cos jc 2 dx. 


Find the area under one arch of y = sin 3jc. 

In the first quadrant, the y-axis and the curves y = sin jc 
and y = cos jc bound a “triangle-shaped” region. Find its 
area. 

Find the area under one arch of y = 3 cos 2jc. 

Find the area under one arch of y = 6 sin \x and above 
the line y = 3. 


J sec 2 x dx = tan x + c; 

J esc 2 x dx = —cot jc -he; 

J sec jc tan jc dx = sec jc + c; 

J esc jc cot jc dx = — esc jc + c. 
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9.4 

THE DERIVATIVES OF 
THE OTHER FOUR 
FUNCTIONS 


The results of Problem 31 in Section 9.3 enable us to complete our list of for¬ 
mulas for differentiating the trigonometric functions: 


d 

■j du 

(i) 

dx 

tan u = sec z u —; 

dx 

d 

-> du 

( 2 ) 

dx 

cot u = -csc z u —; 

dx 

d^ 

du 

(3) 

dx 

sec u = sec u tan u 

dx 

d_ 

du 

(4) 


esc u — —esc u cot u —— 

dx 

dx 


These formulas are quite easy to remember if we notice that the derivative of 
each cofunction (cot, esc) can be obtained from the derivative of the corresponding 
function (tan, sec) by (a) inserting a minus sign, and (b) replacing each function 
by its cofunction. Thus, formula (2) is obtained from formula (1) by inserting a 
minus sign, replacing tan u by cot u, and replacing sec u by esc u . In view of 
this rule, it is only necessary to memorize formulas (1) and (3), because the rule 
immediately produces the other two. 

Example 1 Find dy/dx if y = tan 3 Ax. 

Solution Since y — tan 3 Ax = (tan Ax) 3 , the power rule gives 

-j- = 3(tan Ax) 2 ■ - 7 - tan Ax. 
dx dx 

By formula (1) with u = Ax, 

d 

— tan Ax = (sec^ 4;t)(4), 
dx 

and by putting the various pieces together we obtain 

= 12 tan 2 4* sec 2 Ax. 
dx 


Example 2 Find dy/dx if y = cot (1 — 3 jc). 

Solution By formula (2) with u = 1-3*, 

^ = -CSC 2 (1 - 3x) ■ (-3) = 3 CSC 2 (1 - 3.V). 
dx 

The differentiation formulas (1) to (4) immediately produce four new integra¬ 
tion formulas: 


j* sec 2 u du = tan u + c; 
j* esc 2 u du = —cot u + c; 
j* sec u tan u du = sec u + c; 


(5) 

( 6 ) 


( 7 ) 
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CSC u cot u du = —CSC u + c. (8) 

Example 3 Calculate f sec 3 jc tan 3 jc dx. 

Solution This reminds us of (7) with u = 3x, so we write 

sec 3x tan 3x dx = -j J sec 3x tan 3x d{3x) = y sec 

In this problem the structure of the integral is clear enough 
real need to make an explicit change of variable. 

Example 4 Evaluate J 3x sec 2 x 2 dx. 

Solution This reminds us of (5) with u = x 2 . Since du = 2x dx and x dx = \ du, 
we have 

j 3x sec 2 x 2 dx = 3 J sec 2 u • du^j = y J sec 2 u du 
= f tan u + c = f tan x 2 + c. 

Here we use the auxiliary variable u as insurance against error. After students 
have acquired a bit of experience with problems of this type, they will prefer to 
carry out the integration directly, by inspection. 


Example 5 Calculate J tan 2 2x dx. 

Solution This integral doesn’t resemble any of our types. However, the trigono¬ 
metric identity tan 2 2x + 1 = sec 2 2x connects our problem with formula (5). 
Once this fact is noticed, we easily write 

j* tan 2 2 x dx = J (sec 2 2x — 1) dx = J sec 2 2 x dx — J dx 

= jj sec 2 2x d(2x) - J dx = y tan 2x — x + c. 


3x + c. 

so that there is no 


PROBLEMS 


In each of Problems 1-12, 
1 y = tan 4x 2 . 

3 y = tan 2 (sin x). 

5 y = sec 2 x — tan 2 x. 
7 y = 4 esc ( — 6x). 

9 y = Vcsc 2x. 

11 y = e tan* 


calculate dy/dx. 

2 v = cot 4x. 

4 y = 3 cot (1 — x 3 ). 

6 y = 2 sec 3x. 

8 y = (cot x + esc x ) 2 
10 y = cot (cos x). 

12 y = In (esc x). 


Evaluate the integral in each of Problems 13-20. 


13 


i 


esc 2 6x dx. 



sec 2 2x dx. 


15 

17 

19 

21 

22 


! 

I 

I 


dx 

sin 2 2x 

tan 4 x sec 2 * dx. 
cot lx esc 1 x dx. 


16 

18 

20 


J sec 2 \x dx. 

sec 2x tan 2x dx. 
sec 7 x tan x dx. 


Ctt/6 

Jo 5 


Find the area bounded by the curve y = tan x sec 2 
the x-axis, and the line x = tt/4. 

Find the area in the first quadrant bounded by y = 
sec 2 y = 8 cos x, and the y-axis. 


14 
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23 Find the area in the first quadrant bounded by y = 
sec 2 x, y = 2 tan 2 x, and the y-axis. 

24 The region bounded by the curve y = tan x, the x-axis, 
and the line x = ttI?> is revolved about the x-axis. Find 
the volume of the solid of revolution generated in this 
way. 

25 Sketch the graph of the function y = tan x + cot x on 
the interval 0 < x < 7t/2 and find its minimum value. 

26 Solve Problem 25 without calculus, by using the iden¬ 
tity 

2 

tan x + cot x = ———. 

sin 2x 

*27 Sketch the graph of the function y = 8 esc x — 4 cot x 
on the interval 0 < x < 7t/2 and find its minimum value. 
Is there a point of inflection? 

*28 The classic corridor problem (Problem 29 in Section 
4.3) can be expressed as follows. If two corridors of 
widths a and b meet at right angles (Fig. 9.17), then the 



length of the longest thin rod that can be moved in a 
horizontal position around the comer is the length of 
the shortest line segment placed like the one in the fig¬ 
ure. Find this length by using the angle 0 as the inde¬ 
pendent variable. 

29 A revolving light 6 mi offshore from a straight shore¬ 
line makes 4 revolutions per minute. How fast is the 
spot of light moving along the shore at the instant when 
the beam makes an angle of 30° with the shoreline? 

*30 A rope with a ring at one end is looped over two pegs 
in a horizontal line. The free end is passed through the 
ring and has a weight suspended from it, so that the rope 
is held taut. If the rope slips freely through the ring and 
over the pegs, then the weight will descend as far as 
possible in order to minimize its potential energy. Find 
the angle formed at the bottom of the loop. 


31 (Arterial branching) In the flow of blood through a hu¬ 
man artery as discussed in Example 3 of Section 7.4, 
the physical resistance to the flow is called vascular re¬ 
sistance. This quantity is denoted by R and defined to 
be the ratio of the driving pressure P to the flow F: 



By rearranging Poiseuille’s law in the example referred 
to, we find that 



where L and r are the length and radius of the artery 
and k = 817/77 is a constant determined by the viscosity 
17 of the blood. Figure 9.18 shows an artery with radius 
r\ branching into a smaller artery with radius ^ 



Figure 9.18 


(a) Show that the total resistance along the path ABC 
is 

„ . a — b cot 6 b esc 6 

R= k -+ k -— 

r 1 >2 

(b) Show that the resistance along the path ABC is min¬ 
imized when 



(c) In part (b), find the value of 6 to the nearest degree 
if the radius of the branch artery is four-fifths the 
radius of the larger artery. 

tFor students who know a bit about electric circuits, we point out that 
this concept of vascular resistance, when rewritten in the form F = 
P/R , is precisely analogous to Ohm’s law / = E/R , which relates the 
current / in a circuit to the electromotive force E and the electrical 
resistance R. 
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Our attention in this section is focused on the two integration formulas 


i 


dx 

Vi - .i 2 


sin 1 x 


0 ) 


and 


r dx 

J IT 1? 


= tan 1 x. 


( 2 ) 


The unfamiliar functions on the right sides of these equations will be fully ex¬ 
plained below. They are called inverse trigonometric functions , and are created 
expressly to enable us to calculate the integrals on the left. These functions have 
other uses, but this is their primary purpose, the main justification for their ex¬ 
istence. 

Before we start at the beginning and give a careful and orderly description of 
these functions, we pause briefly to understand in a rough way how they origi¬ 
nate. The dif ficulty w ith the integral on the left of (1) is caused by the awkward 
expression vT - x 2 in the denominator. If we consider this obstacle for a mo¬ 
ment, the inside quantity 1 - x 2 might make us think of the trigonometric ex¬ 
pression 1 - sin 2 9 , which of course equals cos 2 9. Thus, if we write 

x = sin 6, (3) 


then we have Vl - x 2 = V 1 - sin 2 6 = Vcos 2 6 = cos 6 , and the square root 
sign disappears. But we also have dx = cos 9 d9, so we can unravel our trou¬ 
blesome integral as follows: 


/ 


dx 

Vl -x 2 


cos 9 d9 
cos 9 



(4) 


The process of solving (3) for 9 in terms of x is symbolized by writing 9 = 
sin -1 x, so (4) yields (1). A similar analysis can be applied to (2), but these re¬ 
marks are perhaps enough to make our point about the way the inverse trigono¬ 
metric functions arise—they are forced upon us by the need to calculate certain 
integrals. Now for the details that make these functions respectable. 


THE INVERSE SINE 

We know that sin 7t/6 = Thus, if we are asked to find an angle (in radian mea¬ 
sure) whose sine is y, we can answer at once that 7t/6 is such an angle. We are 
also aware that there are many other angles with this property. 

As we have just seen, it is necessary in calculus to have a symbol to denote 
an angle whose sine is a given number x. There are two such symbols in every¬ 
day use, 

sin -1 x and arcsin x. 

These notations are fully equivalent to each other and can be used interchange¬ 
ably, though we shall confine ourselves to the first. The first is read “the inverse 
sine of x,” and the second “the arc sine of x,” and both mean “an angle whose 
sine is x.” It is essential to understand that in the symbol sin -1 x, the — 1 is not 
an exponent, and therefore sin " 1 x never means l/(sin x). We discuss the reason 
for this seemingly strange notation in Remark 2. 
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These ideas can be summarized as follows: The formulas 
x = sin y and y = sin -1 x 
mean exactly the same thing, in the sense that 

x = 3 y and y = }jc 

mean exactly the same thing. In each case the equation is first written in a form 
solved for x, and then (the same equation!) in a form solved for y. 

In order to sketch the graph of y — sin - 1 jc, it suffices to sketch x = sin y with 
y treated as the independent variable—on the horizontal axis—and then to turn 
the picture over, returning the axes to their customary positions (Fig. 9.19). It is 
clear that y exists only when x lies in the interval — 1 < x < 1. However, for any 
such jc there are infinitely many corresponding y’s, and this situation cannot be 
allowed if y = sin -1 jc is to be considered a function. (Recall that a function is 
single-valued by the very definition of the concept.) We deal with this difficulty 
by means of a universally understood agreement: The only values of y = 
sin -1 jc we consider are those that lie in the interval — tt/2 < y < 7t/2, and this 
restriction is henceforth part of the meaning of the symbol y = sin” 1 jc. The graph 
of the function y = sin -1 jc (it is truly a function now, because of the restriction 
just described) is the heavy portion of the curve in Fig. 9.19. 


THE INVERSE TANGENT 

The function y = tan -1 jc (the other notation here is y = arctan jc) is defined in 
essentially the same way: 

-1 . 77 7T 

y = tan 1 jc means jc = tan y and —— < y < 

The symbol tan -1 jc is read “the inverse tangent of jc,” and it means “the angle 
(in the specified interval) whose tangent is jc.” The graph of the function y = 
tan -1 jc is the heavy curve in Fig. 9.20. 

We now calculate the derivative dy/dx of the function y = sin -1 jc by differ¬ 
entiating 


jc = sin y 


implicitly with respect to jc. The result is 


1 


dy 

cos y —, 
ax 


so 


dy = 1 1 = _1_ 

dx cosy Vl — sin 2 y Vl - x 2 ' 

We choose the positive square root here because y = sin -1 x is clearly an in- 
creasing function (see Fig. 9.19). This result can be written in the form 

d . _i 1 

— sin 1 jc = — f 


( 5 ) 
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where — 1 < jc < 1. In just the same way we find the derivative of y = tan 1 jc 
by differentiating 

jc = tan y 

implicitly with respect to x. This gives 

i 2 d y 
1 = sec 2 y —, 
ax 

so 


dy = 1 1 _1_ 

dx sec 2 y 1 + tan 2 y 1 + jc 2 ' 

We therefore have 


d . 

— tan 1 jc = 
dx 


_J_ 

1 + JC 2 


( 6 ) 


for all jc. 

Formulas (5) and ( 6 ) are the facts that lead to the main tools of this section. 
First, we have the chain rule extensions of these formulas, which greatly broaden 
their scope: 


— sin 1 u = 
dx 


1 du 
VI - u 2 dx 


(7) 


and 



1 du 
1 + a 2 dx' 


( 8 ) 


As usual, u is understood to be any differentiable function of jc. 


Example 1 Find dy/dx for each of the following functions: 

(a) y = sin -1 4jc; (b) y = sin -1 jc 3 ; (c) y = y tan -1 (3jc - 5). 

Solution For (a), we use (7) with u = 4jc, so 

dy 1 d 4 

d x V1 — (4jc) 2 dx V1 — 1 6jc 2 

For (b), we use (7) with u — jc 3 , so 

dL = 1 A U 3) = 3 ^ 2 . 

dx Vl - U 3 ) 2 ^ 

For (c), we use (8) with u — 3jc — 5, so 

dy_ = 1 1 _ d_ n cx _ 1 

dx 3 1 -h (3 jc — 5 ) 2 dx y X } 1+(3jc-5) 2 ' 


Even more important for our future work are the integration formulas equiv¬ 
alent to (7) and ( 8 ): 


/ 


du 

V I - u 2 


= sin 1 u + c 


(9) 
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and 


r du 

J 1 + u 2 ~ 


tan 1 u + c. 


( 10 ) 


These formulas are indispensable tools for integral calculus, and all by them¬ 
selves amply justify the study of trigonometry. 


Example 2 Calculate each of the following integrals: 


(a) / 


dx 


V1 - 9x 2 ' 


(b) /- 


dx 


+ 25a: 2 


(c) f 


5x 2 dx 
1 + 4jc 6 


Solution (a) Put u = 3jc. Then du = 3dx, and by (9) 


dx 


-i 


1 Vl -9a: 2 
(b) Put u = 5x. Then du = 5dx, and by (10) 

dx c 1 du 


y du j j 

— , = — sin - 1 m + c = — sin -1 3x + c. 

vT ~ 2 3 3 


_= [ 

J 1 + 25a 2 J 


— -r = — tan 1 u + c = — tan 1 5a + c. 

1 + 5 5 


(c) To get started here we must notice that 4x 6 = (2x 3 ) 2 . This suggests putting 
u = 2jc 3 . Then du = 6x 2 dx , and by (10) 


r 5 jc 2 <& _ r [ 

J 1 + 4jc 6 ~ 5 J 


6 ® 5 5 

--- = — tan -1 u + c = — tan 1 2 jc 3 + c. 

1 + u 2 6 6 


The crucial feature of this integral is clearly the presence of x 2 in the numerator, 
for without this factor the method we have used would be unworkable. 


Remark 1 As students doubtless suspect, four other inverse trigonometric func¬ 
tions can be defined if we wish to do so. However, these functions are not really 
needed for the purpose of integration. We can illustrate this point by observing 
that if u > 0 then 


du 


-i 


du 


mV u 2 — 1 mVm 2 ( 1 — Mu 2 ) m 2 V 1 — (1/m) 2 


‘i 


du 


Vl - (1/m) 2 u 

(If u < 0, the factor u 2 under the radical in the second step comes out of the rad- 
ical as — u.) This integral is a standard type that many writers integrate by using 
the inverse secant—which this calculation clearly shows to be superfluous. The 
fact of the matter is that the inverse sine and the inverse tangent suffice for all 
our purposes for calculating integrals, so for the sake of simplicity we ignore the 
other inverse trigonometric functions. (The notation cos -1 jc will occasionally be 
used, but only for convenience in designating the angle between 0 and it whose 
cosine is jc, where jc is a number between 1 and —1.) 



Remark 2 Suppose that a variable jc is a function of a variable y as shown on 
the left in Fig. 9.21. In this case, not only does each y (in a certain interval) de- 
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Figure 9.21 


termine a unique x, but also each x determines a unique y. Thus, y is also a func¬ 
tion of x. If the given function is written x = f(y), then the second function is 
often called the inverse function of the first and denoted by the symbol y — f~\x) 
[read “/inverse of x”]. The graph of y = / -1 (x) is simply the graph of x =f(y) 
turned over as shown on the right in Fig. 9.21, so that the axes are returned to 
their normal positions. The essence of this situation is that when two functions 
are related in this way, then each undoes what the other one does, in the sense 
that 

= y and f(J~\x)) = x. 

It is this reciprocal relation that is suggested by the word “inverse” and the sym¬ 
bol/ -1 . We have encountered inverse functions in Chapter 8 and also in this sec¬ 
tion, but we have no special need to develop the subject in detail. We do point 
out, however, that any increasing or decreasing function x = f(y) obviously has 
an inverse; and it can be proved that if either function has a nonzero derivative 
at a point, then so does the other and 

dy _ 1 

dx dxldy 

Here again—as in the case of the chain rule—we have a situation in which the 
Leibniz fractional notation for derivatives strongly suggests a true theorem in the 
guise of a simple manipulation of differentials. 

Remark 3 Formula (10) leads rather quickly (though unrigorously) to the fa¬ 
mous Leibniz formula 


77 

T 



(ii) 


which connects the number tt with the odd numbers 1, 3, 5, 7, .... To see this 
connection, we begin with the formula from elementary algebra for the sum of 
a geometric series, 

1 + r + r 2 + r 3 + • ■ ■ = —-—. (12) 

1 - r 


(The reader will perhaps recall from high school algebra that this formula is valid 
for \r\ < 1, but here we pay little attention to such cautionary details.) If r in (12) 
is replaced by - 1 2 and the resulting equation is reversed, we get 
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= 1 - t 2 + t 4 - t 6 + 


1 + t 2 

We now apply (10) to obtain 

tan -1 x = f — , = [ \ \ - t 2 + t 4 - t 6 + 
Jo l + r 2 Jo 


I dt 


(13) 


= *“T + T“T + 


which yields Leibniz’s formula (11) when x = 1. These ideas and the legitimacy 
of these procedures will be studied much more carefully in Chapters 13 and 14. 


PROBLEMS 


1 Given that 9 = sin -1 (-^), find cos 9, tan 0, cot 0, 
sec 0, esc 6. 

2 Given that 6 = tan -1 V3, find sin 6, cos 6 , cot 0, sec 6 , 
esc 6. 

3 Evaluate each of the following: 

(a) sin -1 1 - sin -1 (-1); (b) tan -1 1 - tan -1 (—1); 

(c) sin (sin -1 0.123); (d) cos (sin -1 0.6); 

(e) sin (2 sin -1 0.6); (f) tan -1 (tan 7t/7); 

(g) sin -1 (sin 57t/6); (h) tan -1 (tan [ —3 tt/4]). 


Find dy/dx in each of Problems 4-13. 

5 y = j tan -1 jx. 


9 

y = 

x sin - 

] X+V\ - X 2 . 

10 

y = 

x tan - 

1 jc - In Vl + x 2 . 

11 

y = 

x(sin - 

1 jc) 2 — 2jc + 2 V1 — jc 2 sin -1 jc. 

12 

y = 

i(sin - 

1 jc + jcV 1 — X 2 ). 

13 


ton 1 

4 sin x 

y = 

Idll 

3 + 5 cos x * 

14 

If a 

is a positive constant, show that 


4 y = sin 1 \x. 


6 y = y tan 1 x 2 . 


8 y = tan 


x + 


i 


du 

Va 2 - u 2 


. U 

sin 1 —he 
a 


and 


f du 
J a 2 + u 2 


1 u 

= — tan 1 —he. 
a a 


These simple generalizations of formulas (9) and (10) are 
often more convenient in applications. 


Evaluate the integrals in Problems 15-25. 

15 f 1/2 —*L— 16 (' _A_ 

Jo Vj _^2 J -1 1 + V 


17 

19 

21 

23 

25 

26 


dx 

,8 j 

r dx 

Vl - 4x 2 

1 1 +3V 

1/2 dx 

20 j 

f x dx 

) 1 + Ax 2 ' 

1 1 + 4x 4 ' 

dx 

22 j 

f dx 

V9 - 4x 2 

V16 - 9x 2 

dx 

24 j 

r 2 dx 

4 + 9x 2 ' 

^ - 1 


-Vi dx 


1-2 xV^\' 


A picture hangs on a wall with its base a feet above the 
level of an observer’s eye. If the picture is b feet high 
and the observer stands * feet from the wall, show that 
the angle 9 subtended by the picture is given by the for¬ 
mula 

- . a + b a 

9 = tan 1 -tan 1 —. 

x x 


What value of jc maximizes this angle? 

27 The points (1, 2) and (2, 1) in the first quadrant are joined 
by two segments to a variable point (0, y) on the y-axis, 
where y < 3. If 9 denotes the angle between these seg¬ 
ments, what is the largest value 9 can have? 

28 A balloon is released at eye level and rises at the rate of 
5 ft/s. An observer 50 ft away watches the balloon rise. 
How fast is the angle of elevation increasing 6 seconds 
after the moment of release? 

29 The top of a 15-ft ladder is sliding down a wall. When 
the base of the ladder is 9 ft from the wall, it is sliding 
away at the rate of 3 ft/s. (a) What is the angle between 
the wall and the ladder at that moment? (b) How fast is 
the angle increasing at that moment? 

30 Sketch the curve y = 1/(1 + x 2 ). Find the area of the re¬ 
gion under this curve between jc = 0 and x = b y where 
b is a positive constant. Find the limit of this area as 
b —> °o. 
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31 Comment on the legitimacy of the formula 


vT^T 

32 Sketch the curve y = 1 /V1 - x 2 on the interval 0 £ 
x < 1. Find the area under this curve between x = 0 


r 


and x = b where 0 < b < L Find the limit of this area 
as b —> 1. 

33 The circle x 2 + y 2 = a 2 is revolved about the line y = a. 
Find the area of the resulting surface of revolution. 


Most people understand that sound is vibration, and for this reason alone the 
study of vibrations is an important part of science. But vibrations—or oscilla¬ 
tions, or waves, or periodic phenomena generally—are much more pervasive 
than this. They appear in many contexts having little to do with sound, for in¬ 
stance in connection with radio waves, light waves, alternating electric currents, 
the vibration of atoms in crystals, etc. The study of vibrations in this broader 
sense is clearly one of the most fundamental themes of physical science, and in 
any such study sines and cosines play a central role. 

One of the simplest types of vibrations occurs when an object or point moves 
back and forth along a straight line (the jc-axis) in such a way that its accelera¬ 
tion is always proportional to its position and is directed in the opposite sense: 

d 2 x 

■TT = ~ kx ' k > °- 0) 


9.6 

SIMPLE HARMONIC 
MOTION. THE 
PENDULUM 


Motion of this kind is called simple harmonic motion. To emphasize that the con¬ 
stant k is positive, it is customary to write k = a 2 with a > 0. The differential 
equation (1) then takes the form 

d 2 x 9 

-IT +a 2 x = 0. (2) 


It is easy to see that any function of the form 

x = A sin (at + b), A ± 0, (3) 

satisfies equation (2).* We merely calculate 
dx d 2 x 

— — Aa cos (at + b) and ~jy = ~Aa 2 sin (at + b) = —a 2 x, 


and observe that 


fx 

dt 2 


+ a 2 x = 0. 


It is equally true, though not so easy to see, that every nontrivial solution of (2) 
can be written in the form (3). We will demonstrate this in Remarks 1 and 2, but 
meanwhile we take it for granted. 

Since the function sin (at + b) oscillates between —1 and 1, the function (3) 
oscillates between —\A\ and \A\. The number \A\ is called the amplitude of the 
motion (Fig. 9.22). Also, since the sine is periodic with period 277, sin (at + b ) 
is periodic with period 2ida, because this is the amount t must increase in order 
to increase at + b by 2tt. This number T = 2ida is called the period of the mo¬ 
tion, and is the time required for one complete cycle. If we measure t in seconds, 



*We add the condition A =£ 0 to avoid the trivial case in which x is identically zero and consequently 
there is no motion. 
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then the number/of cycles per second satisfies the equation fT = 1, and is there¬ 
fore the reciprocal of the period, 


This number is called the frequency of the motion. 

Another equivalent form of the general solution (3) that is often useful is 

x = A cos (at + b). (4) 


This is easily seen from the fact that b in (3) is an arbitrary constant, and can 
therefore be replaced by the equally arbitrary constant b + tt/2. This gives 

Jt = A sin [at + b + -^J = A cos (at + b), 



Figure 9.23 


since sin (0 + 7t/2) = cos 6. 

There are two main interpretations of simple harmonic motion, one geometric 
and the other physical. 

The geometric meaning can be understood by considering a point P that moves 
with constant angular velocity around a circle of radius A (Fig. 9.23). If this con¬ 
stant angular velocity is denoted by a, then 

d0 - , .. 

I = a and therefore 6 = at + b, 
dt 

where b is the value of 6 when t = 0. If Q is the projection of P on the x-axis, 
then its x-coordinate is 


x = A cos 6 = A cos (at + b). 

This shows that Q moves back and forth along the x-axis in simple harmonic mo¬ 
tion as P moves steadily around the circle in uniform circular motion, and any 
simple harmonic motion can be visualized in this way. 

The physical meaning appears when we think of equation (1) as describing the 
motion of a body of mass m rather then merely a point. Newton’s second law of 
motion says that F = ma, so equation (1) becomes 


— F = —kx or F = -kmx . 
m 


A force F of this kind is called a restoring force, because its magnitude is pro¬ 
portional to the displacement x and it always acts to pull the body back toward 
the equilibrium position x = 0. We discuss the idea more fully in our first two 
examples. 


F = -kx 


m 


0 , O 


Example 1 Consider a cart of mass m attached to a nearby wall by means of a 
spring (Fig. 9.24). The spring exerts no force when the cart is at its equilibrium 
position x = 0. If the cart is pulled aside to a position x, then the spring exerts a 
restoring force F = -kx, where k is a positive constant whose magnitude is a 
measure of the stiffness of the spring (see Example 1 in Section 7.7). Suppose 
that the cart is pulled out to the position jc = jco and released without any initial 
velocity at time t = 0. Discuss its subsequent motion if friction and air resistance 
are negligible. 


Figure 9.24 
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Solution We are assuming that the only force acting on the cart is the restoring 
force F = —kx, so by Newton’s second law of motion we have 


d 2 x 

= -kx 


or 


d 2 x k 

~yy H-.V = 0. 

dt 2 m 


It is convenient to write this equation as 

d 2 x 


dt 2 


+ a 2 x = 0, 


where a = Xk/m. The form of the general solution we prefer here is 

x - c i sin at + C 2 cos at , (5) 

which can be obtained by expanding either (3) or (4). The initial conditions 


X = Xq 


and 


dx 

v = — = 0 
dt 


when 


t = 0 


imply that C 2 — *o and c\ — 0, so (5) becomes 

x = xq cos at. 


It is clear from th is th at the cart moves in simple harmonic motion with period 
T = lirla — 2 ttX m/k and frequency 


1 = _L [± 

T 2i t \ m 


( 6 ) 


We see from (6) that the frequency of this vibration increases if the stiffness k 
of the spring is increased, and decreases if the mass m of the cart is increased, 
as our common sense would have led us to expect. 


Example 2 Suppose that a tunnel is bored straight through the center of the earth 
from one side to the other, and that a body of mass m is dropped into this tun¬ 
nel. Assuming as usual that the earth is a perfect sphere of uniform density and 
radius R of about 4000 mi, the effect of gravity is such that the body is attracted 
toward the center of the earth by a force F proportional to its distance x from the 
center (Fig. 9.25).* Show that the body traverses the tunnel from one end to the 
other and back again with simple harmonic motion, and calculate the period of 
this motion. 

Solution Clearly F — -kx for a suitable constant k. To find the value of this 
constant we use the fact that F — —mg at the surface of the earth, where x = R, 
so 


-mg = —kR or k = 

K 

Newton’s second law of motion therefore takes the form 



Figure 9.25 


d 2 x 

m ~dt 2 


mg 

R 


or 


d 2 x g 

—y + y~ x = 0. 
dt 2 R 


*The reason for this law of force will be explained in a later chapter, in connection with triple inte¬ 
grals in spherical coordinates. 
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No further discussion is need ed in order to conclude that this is simple harmonic 
motion with period 2i t\/R ig. A sequence of easy approximate calculations gives 


2i t ^ = 6.3 


/ 4000 • 5280 ft 
32 ft/s 2 




500 • 5280 


= 6.3 


500 • 5280 
' 4•3600 


min = 6.3 V200 


min 


= 3 “ ■ 14 min = 89 min. 


The period is of course the total time required for a round trip through the tun¬ 
nel to the other side of the earth and back again. A one-way trip requires only 
about 45 minutes, and the journey to the center of the earth only about 22 min¬ 
utes. 



Example 3 A pendulum consists of a bob (a weight) suspended at the end of a 
light string and allowed to swing back and forth under the action of gravity. As 
usual, we idealize the situation and consider a particle of mass m at the end of 
a weightless string of length L (Fig. 9.26). Find the period of this pendulum un¬ 
der the assumption that its oscillations are small. 


Solution The downward force of gravity on the bob is mg , and this has a com¬ 
ponent mg cos (f) = mg sin 6 tangent to the path. Since s = L6, the tangential ac¬ 
celeration of the bob is 

d 2 s = d 2 (L6) = d 2 ^ 

dt 2 dt 2 dt 2 ’ 


and Newton’s second law applied to the motion of the bob along its circular path 
is 


d 2 6 

mL = —mg sin 6 


d 2 6 
dt 2 



6 = 0 . 


(7) 


The presence of sin 0 makes this differential equation impossible to solve ex¬ 
actly, and the motion is not simple harmonic. However, for small oscillations we 
recall that sin 6 is approximately equal to 6 , so (7) becomes (approximately) 


d 2 6 
dt 2 


+ j 6 = 0. 


This equation tells us that the angular motion is approximately simple harmonic 
with period 27rVZ/g. When these ideas are analyzed in more detail, it turns out 
that the period of this oscillation actually depends on the amplitude of the mo¬ 
tion, and this is the source of the so-called circular error in pendulum clocks. 


Remark 1 We return to the matter of proving that (3) is indeed the general so¬ 
lution of (2). By Problem 19 in Section 9.2 we know that every nontrivial solu¬ 
tion of (2) has the form 


x = c\ sin at + c 2 cos at, 


( 8 ) 
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where the constants c\ a nd c 2 are not both zero. To write (8) in the form (3), we 
begin by setting A = Vci 2 + c 2 2 . Then the point ( c\/A , c^lA) is a point on the 
unit circle, and therefore there is an angle b such that 

, C\ , • , c 2 

cos b = — and sin b = —. 

A A 

These equations now enable us to write (8) as 

jc = A(sin at cos b + cos at sin b) 

= A sin ( at + b), 

by the addition formula for the sine. 

Remark 2 It is also possible to obtain (3) directly from (2), as follows. If we 
write 


d 2 x dv dv dx dv 

dt 2 dt dx dt dx ’ 


(9) 


then (2) becomes 

v + a 2 x = 0 or v dv + a 2 x dx = 0, 
dx 

and by integrating we get 

v 2 + a 2 x 2 = a constant or v 2 + a 2 x 2 = a 2 A 2 , 

where A is the positive value of jc at which v = 0. This yields 

-j- = v = ±aV A 2 - x 2 or ^ _ = ±a dt , 

dt Va 2 ^ 2 

where the choice of sign here depends on whether the velocity v is positive or 
negative at the moment. We suppose for definiteness that ^>0 and integrate 
again to obtain 

x x 

sin -1 — = at + b or — = sin (at + b ), 


so 

which is (3). 


x = A sin (at + b). 


PROBLEMS 


1 In each of the following motions calculate the amplitude 
and period by rewriting in the form x = A sin (at + b). 

(a) jc = 5 sin r — 5 cos t\ 

(b) jc = V3 cos 3 1 - sin 3r; 

(c) jc = sin r + cos t\ 

(d) jc = 2V3 sin 2r - 2 cos 2 1 . 


2 In any simple harmonic motion of the form (3), show 
that the velocity vis related to the position jc by the equa¬ 
tion 

v 2 = a 2 (A 2 — jc 2 ). 

Deduce that the speed is greatest when the body passes 
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through its equilibrium position, and is zero at the ends 
of the interval, where the body reverses the direction of 
its motion. 

3 In Example 1, suppose the spring is stretched 3 in by a 
force of 6 lb. If the cart weighs 12 lb, and if it is pulled 
out 4 in from its equilibrium position and struck a sud¬ 
den blow sending it back toward its equilibrium position 
at a velocity of 3 ft/s, find the amplitude and period of 
the resulting simple harmonic motion. Hint: Recall that 
mass is weight divided by g. 

4 A body in simple harmonic motion passes through its 
equilibrium position at t = 0, 1,2,.... Find a position 
function of the form (3) if v = dx/dt = — 3 when t = 0. 

5 Suppose that a straight tunnel is bored through the earth 
between any two points on the surface. If tracks are laid, 
then—neglecting friction — a train placed in the tunnel 


at one end will roll through the earth under its own 
weight, stop at the other end, and return. Show that the 
time required for a complete round trip is the same for 
all such tunnels, and estimate its value. 

*6 A spherical buoy of radius r floats half submerged in wa¬ 
ter. If it is depressed slightly, Archimedes’ principle tells 
us that a restoring force equal to the weight of the dis¬ 
placed water presses it upward; and if it is released, it 
will bob up and down. Show that if the friction of the 
water is negligible, then the motion will be simple har¬ 
monic, and find its period. 

7 People who manufacture grandfather clocks have a pro¬ 
fessional interest in pendulums that take I second for 
each swing and thus have a period of 2 seconds. Esti¬ 
mate the length of such a pendulum. 


9.7 

(OPTIONAL) 

HYPERBOLIC 

FUNCTIONS 


The hyperbolic functions are certain combinations of exponential functions that 
occur in various applications, with properties similar to those of the trigonomet¬ 
ric functions. The reason for the name will be made clear below. 

The two basic hyperbolic functions are the hyperbolic sine and hyperbolic co¬ 
sine, defined by 

sinh x = \(e x — e~ x ) and cosh x = j(e x + e~ x ). (1) 


The symbol “sinh” is pronounced “cinch,” rhyming with “pinch,” and “cosh” 
rhymes with “gosh.” Just as in trigonometry, inspection of the definitions shows 
at once that these functions have the simple properties 

sinh (— x) = —sinh x and cosh (-x) = cosh x. (2) 


On the other hand—in contrast to the sine and cosine—these functions are not 
periodic. The other four hyperbolic functions are less often used and are defined 
by identities analogous to those of trigonometry: 

, sinh x , 1 

tanh x =-:—, sech x =-;—, 

cosh x cosh x 


coth x = —;—, 
tanh x 


csch x = —rz —. 

sinh x 


There are numerous relations among these functions that are similar to the 
trigonometric identities. Also, their derivatives and integrals resemble those of 
the corresponding trigonometric functions. However, there is a different pattern 
of algebraic signs, and close attention is needed in order to avoid making mis¬ 
takes. Since these identities and formulas are so easily confused with those of 
trigonometry—which are much more important—we do not recommend that 
students make an effort to learn them. 


IDENTITIES 

Among the many identities, we mention only the ones most frequently used: 

cosh 2 x - sinh 2 x = 1, (3) 
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cosh 2 x + sinh 2 x = cosh 2x, (4) 

sinh (x + y) = sinh x cosh y + cosh x sinh y, (5) 

cosh (x + y) = cosh x cosh y + sinh x sinh y. (6) 

Each of these, and many others, can be proved directly from the definitions (1). 
Thus, in the case of (3), 

cosh 2 x - sinh 2 x = \(e x + e~ x ) 2 — }(e x - e~ x ) 2 

= }[e 2x + 2 + e" 2 * - (e 2x - 2 + e- 2 *)] 


This identity can also be established by multiplying together 

e x = cosh x + sinh x and e~ x = cosh x — sinh x, 
which are the results of adding and subtracting equations (1). 

WHY THE NAME? 

If t is any real number, then the point P = (cos t, sin t) lies on the unit circle 
x 2 + y 2 = 1, because 

cos 2 t + sin 2 t = 1. 

In fact (see Fig. 9.27), the definitions of the sine and cosine allow us to interpret 
t as the angle AOP from OA to OP, where the angle is understood to be mea¬ 
sured in radians. For this reason the sine and cosine are often called circular func¬ 
tions. 

Similarly, for any real number t the point P = (cosh t, sinh t) lies on the right 
branch of the curve x 2 — y 2 = 1 (see Fig. 9.28), because 

cosh 2 t - sinh 2 t = 1 


and cosh t > 0. We shall see later that this curve is called a hyperbola, and ac¬ 
cordingly cosh t and sinh t are called hyperbolic functions. This time the vari¬ 
able t is not the angle AOP. However, it turns out (see Problem 31) that t is twice 
the area of the shaded hyperbolic sector in the figure, just as in the trigonomet¬ 
ric case t is twice the area of the shaded circular sector in Fig. 9.27. 

DERIVATIVES AND INTEGRALS 

The derivatives are similar to those of the trigonometric functions, but not ex¬ 
actly the same: 




d . . , du 

—— sinh u = cosh u —, 
dx dx 

d , . , du 

— cosh u = sinh u —, 
dx dx 

- 7 - tanh u = sech 2 u 
dx dx 


(7) 

( 8 ) 
(9) 


The first, for example, comes from 

d 


d 1 

— sinh x = — 
dx 2 


dx 


(e* - e x ) = y(e r + e x ) = cosh x. 
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By turning these around we have the integration formulas 

j cosh u du = sinh u + c, 

j* sinh u du = cosh u + c, 

and so on. In the case of tanh u we have 

, j f sinh u du [ d(c osh w) , , , x , 

tanh u du — I---= ---= In (cosh u) + c. 

J J cosh u J cosh u 


( 10 ) 

( 11 ) 


( 12 ) 


GRAPHS 


We begin with y = cosh jc. First, the fact that cosh (— x) = cosh jc shows that the 
graph is symmetric about the y-axis, and it crosses the y-axis at the point (0, 1) 
because 

cosh 0 = \(e° + e~°) = 1. 


We see that 




— cosh x = sinh x = j(e x 
dx 


x ) = 0 


if and only if jc = 0, so the graph has a horizontal tangent at the point (0, 1) and 
at no other point. Since 


_d^_ 

dx 2 


cosh x = — sinh jc 
dx 


= cosh x = \{e x + e x ) > 0, 


the graph is concave up everywhere and therefore has the appearance shown in 
Fig. 9.29. This graph can also be obtained by geometrically adding the two curves 
y — e x and y = e ~ x —that is, by adding the two y’s for each x —and taking half 
of each resulting y-value. 

Now for the graph of y = sinh jc. This graph (Fig. 9.30) passes through the 
origin since 

sinh 0 = -j(e° ~ e~°) = 0, 

and the identity sinh (— jc) = — sinh jc shows that it is symmetric about the ori¬ 
gin. The graph is rising at every point because 

-7- sinh jc = cosh jc = j(e x + e~ x ) > 0. 
dx 

The fact that 


dx 2 


sinh x = — cosh jc = sinh jc 
dx 


= j(e x - e~ x ) is 


> 0 

for jc > 

< 0 

for jc < 


shows that the graph is concave up for jc > 0 and concave down for jc < 0. The 
point (0, 0) is the only point of inflection. Figure 9.30 shows a comparison of 
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the two graphs. The graph of y = cosh x lies above the graph of y = sinh x be¬ 
cause 

cosh x — sinh x = \{e x + e~ x ) - j(e x — e~ x ) = e~ x > 0, 
and their difference e~ x —> 0 as x —» 


INVERSE FUNCTIONS 


The inverse hyperbolic functions can be expressed in terms of logarithms, and 
therefore present nothing new. We illustrate with the hyperbolic sine. The graph 
is rising at all points (Fig. 9.30), so there exists an inverse 

y = sinh -1 x 


which is obtained by solving 

x = sinh y = \{e y — e~ y ) 
for y in terms of x. This can be written as 

2x—e y — e~ y or e 2y — 2xe y —1=0, 
which is a quadratic equation in e y . Solving this by the quadratic formula gives 

2 x ± V 4 x 2 + 4 
2 - 

= x + Vx 2 + 1, 

where the minus sign is discarded because it gives a negative value to e y . By 
solving for y we now obtain 

y = sinh “ 1 x = In (x + Vx 2 + 1). (13) 


The derivative can be found by differentiating this logarithm. A better way is to 
differentiate sinh y = x implicitly: 

, dy dy 1 1 1 

cosh y~r= 1 — = —r— = , — — = - - 

dx dx cosh y Vl + sinh 2 y Vl + x 2 

This gives the formulas 


— sinh 1 u = 
dx 


1 du 

vTT^ 2 ^ 


(14) 


and 


— = sinh 1 u + c. (15) 

J VTT^ 

We ask the student in the problems to provide a similar discussion of the inverse 
function y = tanh -1 x. 


THE CATENARY 

The most famous application of the hyperbolic functions is to the following clas¬ 
sical problem: Determine the exact shape of the curve assumed by a flexible chain 
or cable of uniform density which is suspended between two points and hangs 
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under its own weight. This curve is called a catenary\ from the Latin word for 
chain, catena . 

Let the y- axis pass through the lowest point of the chain (Fig. 9.31), let s be 
the arc length from this point to a variable point (x, y), and let wo be the linear 
density (weight per unit length) of the chain. We obtain the differential equation 
of the catenary from the fact that the part of the chain between the lowest point 
and (x, y ) is in static equilibrium under the action of three forces: the tension T 0 
at the lowest point; the variable tension T at (x, y), which acts in the direction of 
the tangent because of the flexibility of the chain; and a downward force 
equal to the weight of the chain between these two points. 

Equating the horizontal component of T to Tq and the vertical component of 
T to the weight of the chain gives 


T cos 0 = Tq and T sin 0 = w 0 s, 


and by dividing we eliminate T and get tan 6 = wqs/Tq or (since tan d = dy/dx) 

dy w ( ) 

-f- = as, where a = — 

dx T 0 

We next eliminate the variable s by differentiating with respect to x, 

d 2 y _ ds _ \4/.v 2 + dv 2 

dx 2 Q dx Q dx 


= aj 1 + 


dyV 
dx ' 


(16) 


This is the differential equation of the catenary. 

We now solve equation (16) by two successive integrations. This process is fa¬ 
cilitated by introducing the auxiliary variable m = dy/dx , so that (16) becomes 

dm r——j 

dx 


On separating variables and integrating we get 

dm 


( dm f 

Jw7=I =a J 


dx, 


V1 4- m 2 

and by (15) this yields 

sinh -1 m = ax + c t . 

Since m = 0 when x = 0 (why?), we see that c\ = 0, so sinh -1 m — ax or 

m = sinh ax. 

But m — dy/dx , so we have another differential equation to solve: 

dy 

— = sinh ax, 
dx 

dy = sinh ax dx, 

y — J sinh ax dx 
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If we now place the origin of the coordinate system at just the right level so that 
y = 1 la when x = 0, then = 0 and our equation for the catenary takes its fi¬ 
nal form, 


y = — cosh ax. 
a 


(17) 


Equation (17) reveals the precise mathematical nature of the catenary and is the 
basis for a number of practical applications to problems such as the sag of tele¬ 
phone lines and the design of suspension bridges like the Golden Gate Bridge in 
San Francisco. 


PROBLEMS 


1 Find the exact numerical value of 

(a) sinh (In 2); (b) cosh (-In 3); (c) tanh (2 In 3). 


Establish the identities in Problems 2-11. 

2 sinh (jc + y) = sinh x cosh y + cosh x sinh y. 

3 cosh (jc + y) = cosh x cosh y + sinh jc sinh y. 

4 sinh (jc — y) = sinh x cosh y — cosh x sinh y. 

5 cosh (jc — y) = cosh x cosh y — sinh x sinh y. 

6 sinh 2x = 2 sinh x cosh x. 

7 cosh 2x = cosh 2 x + sinh 2 x. 

8 2 cosh 2 x = cosh 2x + 1. 

9 2 sinh 2 jc = cosh 2jc - 1. 

10 tanh 2 jc + sech 2 jc = 1 . 

11 coth 2 x 1 — csch 2 jc. 

12 Show that (cosh jc 4- sinh jc)” = cosh nx + sinh nx for 
any positive integer n. 

In Problems 13-18, find dyldx. 

13 y = sinh jc 3 . 

14 y = cosh (5jc - 3). 

15 y = In (tanh 3jc). 

16 y = sinh 4 3jc. 

17 y = cosh 2 5 jc - sinh 2 5jc. 

18 y — tanh jc 2 . 


In Problems 19-24, find the integral. 
19 / sinh (5jc — 3) dx. 

sinh 3jc 


20 


+ cosh 3jc 


dx. 


21 /V1 + cosh jc dx. 
r e x - e~ x 

22 -— dx [compare with formula (12)]. 

J e x + e x 

23 / tanh 2 jc dx. 

24 /(sinh 2 jc + cosh 2 jc) dx. 

25 Sketch the graph of y = tanh jc by merely inspecting Fig. 
9.30. 


26 Use tanh y = x to express the inverse function y = 
tanh -1 jc in the form 


y = tanh 1 jc = 



1 + JC 

1 - jc' 


27 


Use Problem 26 to show in two ways that 

1 


_d_ 

dx 


so that 


dx 


28 

29 

30 


J 1 - jc 2 
Use formula (15) to show that 
f3 dx 


= tanh 1 jc. 


Vx 2 + 9 


= In (1 + V2). 


1 


Find the area under the catenary y = — cosh ax from jc = 
0 to jc = 1 la. a 

If an object moving on the jc-axis is repelled from the 
origin (instead of being attracted to it) with a force pro¬ 
portional to its position jc, then its motion satisfies the 
differential equation 

d 2 x ~ 

~TT = ax 
dt 2 


where a is a positive constant. Show that for any con¬ 
stants c\ and C 2 the function 

jc = ci sinh at + C 2 cosh at 

is a solution of this differential equation. 

(a) Find the solution of 



that satisfies the initial conditions jc(0) = 2 and jc'(0) = 

1. 

(b) Find the solution of 


4 


d 2 x 

dt 2 


= x 


that satisfies the initial conditions jc(0) = 1 and jc' (0) = 

2 . 
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31 Show that the area Aft) of the shaded hyperbolic sector 
in Fig. 9.28 is {t, so that t = 2Aft\ as stated. Hint: Be¬ 
gin by observing that 

, fcosh t f—2 - 

Aft) = j cosh t sinh t - V x 2 — 1 dx. 


Then show that A'ft) = { by using various identities and 
the rule for differentiating integrals given in Section 6.7. 
Finally, use the fact that Af 0) = 0. 


CHAPTER 9 REVIEW: DEFINITIONS, FORMULAS 


Think through and learn the following . 

1 Basic trigonometry (see front endpaper). 

2 The derivatives of the six trigonometric functions, and the 
corresponding integral formulas. 

3 The definitions of sin -1 x and tan -1 x. 

4 The derivatives of sin -1 x and tan -1 x, and the corre¬ 
sponding integral formulas. 


5 The differential equation + a 2 x = 0 of simple 

harmonic motion, and its solutions. 

6 The definitions of sinh x and cosh x. 


ADDITIONAL PROBLEMS FOR CHAPTER 9 


SECTION 9.2 

In each of Problems 1-18, find the derivative dy/dx of the 
given function. 


y = sin (1 - 9x). 
y = cos 2 x. 
y = cos 2 5x. 
y = cos 2 3x - sin 2 3x. 


9 y 


2 y = 7 cos fix - 13). 

4 y = cos x 2 . 

6 y = 5 sin (1 — 18x). 

8 y = cos 2 9x + sin 2 9x. 
sin x 

10 y =-. 


11 

13 

15 

17 

19 


y = x sin x + cos x. 
y = cos (cos x). 
y = cos (sin x). 
y = sin fe lnx ). 


12 

14 

16 

18 


y = Vl + sin 2x. 

y = e $in 2 x 

y = In fx sin x). 
y = In [sin (In jc)]. 


Consider the differential equation 
+ a 2 y = 0, 


a y 

dx 2 


where a is a positive constant. Use the following steps 
to prove that every solution of this equation has the 
form 


y = c\ sin ax + C 2 cos ax 

for a suitable choice of the constants c\ and C 2 . 

(a) If y = gfx) and y = hfx) are solutions, show that 
every linear combination y = c\ gfx) + C 2 hfx) is 
also a solution. 

(b) If y = ffx) is a solution, show that 

a 2 [ffx)] 2 + [ffx)] 2 = a constant. 

Deduce that if y = ffx) is a solution such that 
/(0) =/'( 0) = 0, then ffx) = 0 for all *. 

(c) If y = ffx) is any solution, show that 


ffx) = ci sin ax + C 2 cos ax 

for a suitable choice of the constants ci and C 2 - 
Hint: Apply part (b) to 

ffx) - —/(0) sin ax - ff 0) cos ax. 
a 

20 Use Problem 21 (b) in Section 9.1 to give another proof 
of the formula fd/dx) sin x = cos x. 

21 Give another proof of the limit (4) in Section 9.2 by 
the following steps: If 6 is a small positive angle (0 < 
6 < 7t/2) in the unit circle shown in Fig. 9.32, then 

(a) area A OPQ < area sector OPQ < area A OQR\ 

(b) { sin 8 < { tan 0; 
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*22 Figure 9.33 shows the familiar mechanism of a piston 
(which moves back and forth in a cylinder) attached 
at a point P to a connecting rod of length b which in 
turn is attached to a point Q on a crankshaft that ro¬ 
tates in a circle of radius a with center at O . 



Figure 9.33 


27 

lim 

X—>7T 

sin x 

7T — X 

29 

lim 

x->0 

x + tan x 

sin x 

31 

lim 

x—>0 

2x 

sin 3x' 

33 

lim 

x->0 

sin 2x 

3x 2 + x’ 

35 

lim 

x — >2 

COS 1 t/x 
X- 2 ' 


28 

30 

32 

34 

36 


lim 

>o 


lim 

*->o 


lim 

x->0 


lim 

*—>0 


lim 

X—>TT 


sin 2 x 
x 

tan 3* 

4x ' 

x cot 3x. 

x esc 2 V3x 

sin 2x 
7 T — x' 


(a) Find dx/dt, the velocity of the piston, in terms of 
dO/dt , the angular velocity of the crankshaft. Hint: 
Use the law of cosines. 

(b) If the angular velocity of the crankshaft is denoted 
by the customary symbol o>, show that the speed 
of the piston is co • OR, where R is the point in 
which the line PQ intersects the line through O 
perpendicular to OP. 

*23 A given fixed circle has radius a. A second circle has 
its center on the given circle, and the arc of the sec¬ 
ond circle that lies inside the given circle has length 
s. Show that s has its largest value when a suitable an¬ 
gle 0 satisfies the equation cot 6=6. 

24 A heavy block of weight W is to be dragged along a 
flat table by a force F whose line of action is inclined 
at an angle 0 to the line of motion, as shown in Fig. 
9.34. The motion is resisted by a frictional force fxN 
which is proportional to the normal force N = W — F • 
sin 6 with which the block presses perpendicularly 
against the surface of the table (/jl is a constant called 
the coefficient of friction). The block moves when the 
forward component of F equals the frictional resis¬ 
tance, i.e., when F cos 6 = /jl(W - F sin 0). Find the 
direction and magnitude of the smallest force F that 
will move the block. 



In each of Problems 25-36, find the value of the indicated 
limit. 


25 


* 2 


26 


lim 


sin x 
2x 


SECTION 9.3 

Evaluate the indefinite integrals in 

Problems 37-54. 

37 

/ cos 3x dx. 


38 

S sin (lx + 1) dx. 


39 

/ cos (1 — \x) dx. 


40 

S cos 2 lx sin lx dx. 


41 

/ sin 5 3^ cos 3x dx. 


42 

/ cos 2 jx sin jx dx. 


43 

f(2 — cos 2 3*) sin 3x dx. 


44 

S3 sin x sin 2x dx. 


45 

fx 2 cos x 3 dx. 


46 

S \Tx sin * 3/2 dx. 


47 

S sin (cos 2x) sin 2x dx. 


48 

S Vcos 2x sin 2x dx. 


49 

f cos 4x dx 


J sin 2 4x 


50 

f sin x dx 

J cos 5 X 


51 

f sin x dx 


J (3+2 cos x) 2 ‘ 


52 

f Vl + sin 2x cos 2x dx. 


53 

r cos 5x dx 

J V7 - sin 5* 


54 

/(I + 4 sin 8a:) 7 cos 8* dx. 


Evaluate the definite integrals in Problems 55-58. 

55 

f 71/14 

cos lx dx. 56 

Jo 

r 7 t/i 8 

J sin 6x dx. 

57 

rn/6 sin 2x dx 

Jo cos 2 2x ' 58 

fW 

10 x 4 sin x 5 dx. 
Jo 


59 Find the area bounded by y = sin x and y = cos x be¬ 
tween the first two positive values of x at which these 
curves intersect. 

60 Find the area bounded by y = 1 - cos 2x and y = 
cos x — 1 between * = 0 and x = 2i t. 

61 Find the area bounded by y = 4 - 3 sin 2x and y = 
2 cos 5x — 3 between x = 0 and x = 3 tt. 

62 Show that if m and n are positive integers, then 


sin nix sin nx dx = 


if m =£ n, 
if m = n, 


7 T 
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cos mx cos nx dx = 


0 

77 


sin mx cos nx dx = 0. 


if m i= n, 
if m = n. 


Hint: See Problem 20 in Section 9.1. (These facts are 
very important in the theory of Fourier series, which 
is one of the most useful parts of advanced mathe¬ 
matics from the point of view of applications to sci¬ 
ence.) 

*63 In this problem we ask students to establish the for¬ 
mula 


Cb 

sin x dx = cos a - cos b (*) 

Ja 

directly from the limit definition of the integral, with¬ 
out making any use of the Fundamental Theorem of 
Calculus. 

(a) Show that 

sin x 4- sin 2jc + • • • + sin nx 

cos yjc - cos (n + y)jc 

A I 

2 sin jx 

Hint: Write down the identity 2 sin 6 sin c/> = 
cos (6 - (f)) - cos (6 + </>) for the n cases in which 
the pair ( 6, 4>) is taken to be (jc, yjc), (2 jc, yjc), 
. . . , (nx, y*), and add. 

(b) For b > 0, the limit definition of the integral gives 

( h ■ j r V ( • kb \ b 

sin xdx = hm > sin — ■ — 

Jo \ n ) n 


b s-' . kb 
= lim — > sin —. 
n->” n £rj n 

Use part (a) with x = bln to show that the value 
of this limit is 1 - cos b. 

(c) Use simple area arguments to show that the result 
in part (b) is also valid for the cases b = 0 and b < 
0 . 

(d) Use parts (b) and (c) to establish (*). 

*64 Establish the formula 


fh 

cos x dx = sin b — sin a (**) 

Ja 

by a line of reasoning similar to that in Problem 63. 
65 Give another proof of formula (**) in Problem 64 by 
using the following reasoning: If the graph of y = 
cos jc is moved a distance 7t/2 to the right, it is trans¬ 
lated into the graph of y = sin jc; the integral in (**), 
which represents the area between the curve y = 
cos jc and the jc-axis from jc = a to jc = b, can there¬ 


fore be written as another integral representing the area 
between the curve y = sin jc and the jc-axis from jc = 
a + 77/2 to jc = b + 77/2. 


SECTION 9.4 

In each of Problems 66-79, calculate dy/dx. 


66 

3 

1 = cot (2 — 5jc). 

67 

y = 4 tan 3jc. 

68 

3 

; = j sec 4 jc. 

69 

y = Vcot 2jc. 

70 

3 

1 = esc (1 - 2jc). 

71 

y = sec 4 jc — tan 4 jc. 

72 

3 

; = 2jc + tan 2jc. 

73 

y = cot 2 5jc. 

74 

y = sec 3 x. 

75 

1 

y = jc tan —. 

JC 

76 

3 

1 = cot (In jc). 

77 

y = Vsec Vjc. 

78 

y = esc 3 jc + esc jc 3 . 

79 

y = tan (tan jc). 

Evaluate the integral in each of Problems 80-87. 

80 

1 

f dx 

81 

J CSC yJC cot jjc dx. 

J 

cos 2 5jc ' 

82 

J 

ftr/4 

esc 2 jc cot jc dx. 

7T/6 

83 

f esc 2 3jc dx. 

84 

/( 2 + 5 tan jc) 7 sec 2 jc dx. 

85 

f esc 4 jc cot jc dx. 

86 

J Vcot jc esc 2 jc dx. 

87 

f cot 3 jc esc 2 jc dx. 


88 The region under the curve y = sec jc between jc = 0 
and jc = 77/4 is revolved about the jc-axis. Find the vol¬ 
ume of the solid of revolution generated in this way. 

89 Solve Problem 88 for the curve y = sec 2 jc. 

90 Sketch the graph of the function y = y tan 2jc + 
cot 2 jc on the interval 0 < jc < 7r/4 and find its mini¬ 
mum value. 

91 A racing car is moving around a circular track at a 
constant speed of 100 km/h. There is a bright light at 
the center of the track and a straight fence tangent to 
the track at a point T. How fast is the shadow of the 
car moving along the fence when the car is y lap be¬ 
yond T1 

*92 In Problem 18 of Section 4.4 students were asked to 
show that the volume of the smallest cone that can be 
circumscribed about a given sphere of radius a is ex¬ 
actly twice the volume of the sphere. Solve this prob¬ 
lem by trigonometric methods, by taking the generat¬ 
ing angle of a circumscribed cone (half the vertex 
angle) as the independent variable. 


SECTION 9.5 

93 Evaluate each of the following: 

(a) tan -1 ( —V3); (b) sin -1 yV3; 

(c) 4 sin -1 (-yV2); (d) sin (sin 1 0.7); 

(e) sin -1 (sin 0.7); (f) tan -1 (tan [-1]); 

(g) sin -1 (cos 77/6). 

94 If the base b and area A of a triangle are fixed, use 
geometry alone to find the base angles if the angle op¬ 
posite the base has its largest value. 
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Find dy/dx in each of Problems 95-103. 

95 y = sin -1 yx. 

96 y = \ tan -1 yx. 

97 y = y tan -1 x 5 . 

98 y = Vx - t an" 1 V x. 

99 y = tan -1 Vx 2 - 1. 

100 y = -sin -1 —. 

x 

101 y = tan -1 x + In V1 + x 2 . 

102 y = a sin -1 — + V a 2 — x 2 . 

a 

103 y = Vx 2 — 1 - tan -1 Vx 2 — 1. 


Evaluate the integrals in Problems 104-112. 


104 J 

fV3 dx 

105 | 

(1/2)V3 dx 

0 1 + x 2 ' 

i 

to 

< 

1 

106 J 

r dx 

107 { 

dx 

< 

1 

ON 

*U 

1 + 5*2' 

108 J 

r> dx 

109 j 

dx 

uV ~ 3 *V4;c 2 - 1 

V25 - 4* 2 

110 J 

f dx 

M, / 

y?dx 

' 49 + 36JC 2 ' 

i + x 8 ' 

1,2 j 

f 15 x* dx 



O 

1 

> 



113 A billboard is perpendicular to 

a straight road, and its 


nearest edge is 18 ft from the road. The billboard is 
54 ft wide. As a motorist approaches the billboard 
along the road, at what point does he see the billboard 
in the widest angle? 

114 An airplane at an altitude of 7 mi and a speed of 500 
mi/h is flying directly away from an observer on the 
ground. What is the rate of change of the angle of el¬ 
evation when the airplane is over a point 4 mi away 
from the observer? 

115 A woman is walking along a sidewalk at the rate of 6 
ft/s. A police car spotlight 30 ft from the sidewalk fol¬ 
lows her as she walks. At what rate is the spotlight 
turning when the woman is 40 ft past the point on the 
sidewalk nearest the light? 


SECTION 9.6 

116 With reference to Example 1, recall the definitions of 
kinetic and potential energy given in Section 7.7. 

(a) Show that the potential energy V of the cart is y&x 2 , 
where it is understood that V = 0 when x = 0. 

(b) Show directly from Newton’s second law of mo¬ 
tion 


that the sum of the kinetic and potential energies 
of the cart is constant. Hint: Use equation (9) in 
Section 9.6. 

(c) Express the total energy E of the cart in terms of 
its initial position x 0 and initial velocity v$. 

(d) Express the total energy E of the cart in terms of 
the amplitude A and frequency/of the vibration. 

117 A block of wood 6 in on an edge and weighing 4 lb 
floats upright in water. If the block is depressed 
slightly and released, find its period of oscillation as¬ 
suming that the friction of the water is negligible. Hint: 
Use w = 62.5 lb/ft 3 for the density of water. 

118 A body in simple harmonic motion has amplitude A 
and period T. Find its maximum velocity. 

119 Find the amplitude and frequency of the simple har¬ 
monic motion x = 3 sin 2t + 4 cos 2 1. 

120 If the period of a simple harmonic motion is 27 t/ 3, find 
a position function of the form (3) that satisfies the 
conditions x = 1 and v — dx/dt = 3 when t = 0. 

*121 Let the pendulum in Example 3 be pulled to one side 
through an angle a and released. Use the principle of 
conservation of energy to show that the period T of 
oscillation is given by the formula 

T „ A JZ\' - , 

* 0 Vcos 6 - cos a 
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10.1 

INTRODUCTION. THE 
BASIC FORMULAS 


METHODS OF 
INTEGRATION 


If we start with the constants and the seven familiar functions Jt, e x , In jc, sin jc, 
cos x, sin -1 jc, and tan" 1 jc, and go on to build all possible finite combinations 
of these by applying the algebraic operations and the process of forming a func¬ 
tion of a function, then we generate the class of elementary functions. Thus, 

. [ tan' 1 (x 2 + 35a: 3 ) 

In - 

- e x + sin Vr+1 - 

is an elementary function. These functions are often said to have closed form , 
because they can be written down in explicit formulas involving only a finite 
number of familiar functions. 

It is clear that the problem of calculating the derivative of an elementary func¬ 
tion can always be solved by a systematic application of the rules developed in 
the preceding chapters, and this derivative is always an elementary function. How¬ 
ever, the inverse problem of integration—which in general is much more im¬ 
portant— is very different and has no such clear-cut solution. 

As we know, the problem of calculating the indefinite integral of a function 

/(*), 

J f(x) dx = F(x), (1) 

is equivalent to finding a function F(x) such that 

-£-F(x)=f(x). (2) 

It is true that we have succeeded in integrating a good many elementary func¬ 
tions by inverting differentiation formulas. But this doesn’t carry us very far, be¬ 
cause it amounts to little more than calculating the integral (1) by knowing the 
answer (2) in advance. 

The fact of the matter is this: There does not exist any systematic procedure 
that can always be applied to any elementary function and leads step by step to 
a guaranteed answer in closed form. Indeed, there may not even be an answer. 
For example, the function/( jc) = e~ xl looks simple enough, but its integral 

J e~ xl dx (3) 
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cannot be calculated within the class of elementary functions. This assertion is 
more than merely a report on the present inability of mathematicians to integrate 
(3); it is a statement of a deep theorem, to the effect that no elementary function 
exists whose derivative is e~ x2 .* 

If all this sounds discouraging, it shouldn’t be. There is much more that can 
be done in the way of integration than we have suggested so far, and it is very 
important for students to acquire a certain amount of technical skill in carrying 
out integrations whenever they are possible. The fact that integration must be 
considered as more of an art than a systematic process really makes it more in¬ 
teresting than differentiation. It is more like solving puzzles, because there is less 
certainty and more scope for individual ingenuity. Many students find this an 
agreeable change from the cut-and-dried routines that make some parts of math¬ 
ematics rather dull. 

Since integration is differentiation read backwards, our starting point must be 
a short table of standard types of integrals obtained by inverting differentiation 
formulas as we have done in the previous chapters. Much more extensive tables 
than the one given below are available in libraries, and with the aid of these ta¬ 
bles most of the problems in this chapter can be solved by merely looking them 
up. However, students should realize that if they follow such a course they will 
defeat the intended purpose of developing their own skills. For this reason we 
make no use of integral tables beyond the short list of 15 formulas given below. 
Instead, we urge students to concentrate their efforts on gaining a clear under¬ 
standing of the various methods of integration and learning how to apply them. 

In addition to the method of substitution, which is already familiar to us, there 
are three principal methods of integration to be studied in this chapter: reduction 
to trigonometric integrals, decomposition into partial fractions, and integration 
by parts. These methods enable us to transform a given integral in many ways. 
The object of these transformations is always to break up the given integral into 
a sum of simpler parts that can be integrated at once by means of familiar for¬ 
mulas. Students should therefore be certain that they have thoroughly memorized 
all the following basic formulas. These formulas should be so well learned that 
when one of them is needed it pops into the mind almost involuntarily, like the 
name of a friend. 

f u n+l 

1 u n du =-- + c (n =£ — 1). 

J n + 1 



2 


*Let there be no misunderstanding. The indefinite integral (3) does exist, because the function F(;c) 
defined by 



is a perfectly respectable function with the property that 


[See equations (12) and (13) in Section 6.7.] The difficulty is that it can be proved that there is no 
way of expressing F(x ) as an elementary function. Some of the facts in this interesting part of cal¬ 
culus are described in Appendix A.9. 
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3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 


i 

i 

i 

i 

i 

i 

i 

i 

i 

i 

i 

i 

i 


II 

+ c. 


cos u du = 

sin u 

+ c. 

sin u du = 

-cos 

u + c. 

sec 2 u du = 

: tan u 

+ c. 

esc 2 u du = 

: —cot 

: u + c. 

sec u tan u 

du = 

sec u + 

CSC u cot u 

du = 

—esc u 

du 


i w 

Va 2 - u 2 

= sin 

_1 — + , 


a 

du 

1 



— tan 

_l — + 

a 2 + u 2 

a 

a 


tan u du — —In (cos u) + c. 

cot u du = In (sin u) + c. 

sec u du = In (sec u + tan u) + c. 

esc u du = —In (esc u + cot u) + c. 


The last four formulas are new, and complete our list of the integrals of the six 
trigonometric functions. Formulas 12 and 13 can be found by a straightforward 
process: 

f , f sin u du f d( cos u) . . 

tan u du = -= — -= — In (cos u) + c 

j J COS U j cos u 

and 

f , f cos u du f d( sin u) . , . 

cot u du — I -:-= —:-= In (sin u) + c. 

J j sin u j sin u 


Many people find that the easiest way to remember these two formulas is to think 
of the process by which they are obtained. Formula 14 can be found by an in¬ 
genious trick: If we multiply the integrand by 1 = (sec u + tan w)/(sec u -F tan w), 
then we obtain 



/ 


sec u du — 


i 

i 


(sec u + tan u) sec u du _ f (sec 2 u + sec u tan u) du 
sec u + tan u J sec u -h tan u 


d( sec u + tan u) 
sec u + tan u 


= In (sec u + tan u) + c. 


A similar trick yields formula 15. 

We repeat: These 15 formulas constitute the foundation on which we build 
throughout this chapter, and they must be at our fingertips. Take 20 or 30 min¬ 
utes to memorize them. And then tomorrow, when they have been partially for¬ 
gotten, memorize them again. And so on. The effort will be well rewarded. 
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In the method of substitution we introduce the auxiliary variable u as a new sym¬ 
bol for part of the integrand in the hope that its differential du will account for 
some other part and thereby reduce the complete integral to an easily recogniz¬ 
able form. Success in the use of this method depends on choosing a fruitful sub¬ 
stitution, and this in turn depends on the ability to see at a glance that part of the 
integrand is the derivative of some other part. 

We give several examples to help students review the procedure and make cer¬ 
tain that they fully understand it. 


10.2 

THE METHOD OF 
SUBSTITUTION 


Example 1 Find fxe x2 dx. 

Solution If we put u = —x 2 , then du = —2x dx , x dx — du , and therefore 

j xe~ x2 dx = -y J e u du = —\e u = —\e~ xl + c. 

It will be noticed that we insert the constant of integration only in the last step. 
Strictly speaking, this is incorrect; but we willingly commit this minor error in 
order to avoid cluttering up the previous steps with repeated c' s. We also point 
out that this integral is easy to calculate even though the similar integral / e~ xl dx 
is impossible. The reason for this is clearly the presence of the factor jc, which 
is essentially (that is, up to a constant factor) the derivative of the exponent — x 2 . 


Example 2 Find 


cos x dx 
Vl + sin jc 


Solution Here we notice that cos jc dx is the differential of sin jc, and also of 
1 + sin x. Thus, if we put u = \ + sin x, then du = cos jc dx and 


f cos jc dx i 

r du f 

' V1 4- sin jc J 

< 
R 1 

1 


j. i/z _ 

= - = 2vu = 2V 1 + sin jc + c. 

1 

2 


Example 3 Find 


i 


dx 

jc In jc' 


Solution The fact that dxJx is the differential of In jc suggests the substitution 
u = In jc, so du — dx/: c and 

f —-j—— = f — = In u = In (In jc) + c. 

J x m jc J u 


Example 4 Find 


i 


dx 


V9 - 4jc 2 
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Solution Since 4x 2 = (2x ) 2 we put u = 2x, so that du — 2dx, dx — \ du , and 


V9 - 4jc 2 



Example 5 Find 


;c 


V9 - 4x 2 


Solution Here the fact that the jc in the numerator is essentially the derivative 
of the expression 9 — 4x 2 inside the radical suggests the substitution u — 9 — 
4x 2 . Then du — -&x dx, and 




In any particular integration problem the choice of the substitution is a matter 
of trial and error guided by experience. If our first substitution doesn’t work, we 
should feel no hesitation about discarding it and trying another. Example 5 is 
similar in appearance to Example 4 and it might be thought that the same sub¬ 
stitution will work again, but in fact—as we have seen—it requires an entirely 
different substitution. 

We remind students of the summary of the method of substitution given at the 
end of Section 5.3. Also, we repeat the justification of the method given there 
because we now wish to extend this method to cover the case of definite inte¬ 
grals as well. 

We start with a complicated integral of the form 



(1) 


If we put u = g(x\ then du = g'(x) dx and the integral takes the new form 



If we can integrate this, so that 



( 2 ) 


then since u = g(x) we ought to be able to integrate (1) by writing 


/ f[g(x)]g’(x) dx = F[g(x)] + c. 


(3) 


All that is needed to justify our procedure is to notice that (3) is a correct result, 
because 
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dx 


F[g(x)] = F'[g(x)]g\x) =f[g(x)]g'(x) 


by the chain rule. 

The method of substitution applies to definite integrals as well as indefinite 
integrals. The crucial requirement is that the limits of integration must be suit¬ 
ably changed when the substitution is made. This can be expressed as follows: 

l f[g(x)]g'(x) dx = £ f(u) du, 

where c = g(a) and d = g(b). The proof uses (2) and (3) and two applications 
of the Fundamental Theorem of Calculus, 

I" f[g{x)\g\x) dx = F[g(b)) ~ F[g(a)] 

Ja 

= F(d) ~ F(c) = />) du 

Thus, once the original integral is changed into a simpler integral in the variable 
u, the numerical evaluation can be carried out entirely in terms of u, provided 
the limits of integration are also correctly changed. 

Example 6 Compute 

f sin x dx 
Jo cos 2 x 



Solution We put u = cos x, so that du = — sin jc dx. Observe that u = 1 when 
x = 0 and u = \ when x = 7t/3. By changing both the variable of integration 
and the limits of integration we obtain 


f*r/3 sin x dx _ p/2 —du 
'o COS 2 X J 1 u 2 



1 = 1 . 


This technique removes the necessity of returning to the original variable in or¬ 
der to make the final numerical evaluation. 


PROBLEMS 


Find the following integrals. 


1 | V3 - 2x dx. 2 

f In x dx 
3 J *[1 + (In x) 2 ] ‘ 

5 | sin 2x dx. 6 

7 | cot (3 jc — 1) dx. 8 

9 j xS/x 2 + 1 dx. 10 


/ 

/ 

/ 

/ 

/ 


2x dx 

( 4x 2 - l) 2 ’ 
cos x e smx dx. 
x dx 

Vl6 — x 4 

sin x cos x dx. 
dx 

x + 2‘ 


11 

| e 5x dx. 

12 

J x cos x 2 dx. 

13 

j esc 2 (3x + 2) dx. 

14 

f dx 

J x 2 + 16* 

15 

p dx 

•’“ 3 V3 - 2x 

16 

| (x 3 + l) 2 dx. 

17 

f sin x dx 

18 

f (2x + 1) dx 

V 1 — COS X 

J x 2 + x + 2' 

19 

r x 

20 

j sin ^ dx. 

1 \T X 
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21 J 

f sec 5jc tan 5jc dx. 

22 j 

[ — 

xVTr 

23 J 

f In jc dx 

24 J 

f sin x dx 

' ^ • 

' COS 2 JC 

25 J 

f tt/2 cos JC dx 

o 1 -h sin jc' 

26 J 

[ cos 3jc dx. 

27 J 

f e x dx 

28 J 

f dx 

'a 

1 

> 

' cos 2x 

29 J 

f sin 2 jc cos jc dx. 

30 j 

^ tan 2 yjc sec 2 yjc dx. 

3. j 

f e x dx 

1 + e x ' 

32 J 

f cos (In jc) dx 

X 

33 J 

f tan 3jc dx. 

34 j 

f sec 2 jc dx 

Vl + tan jc 

35 j 

f 4jc dx 

36 J 

f e^~ x dx 

1 + i ‘ 

Vx ' 

37 j 

f e x dx 

38 J 

f sin -1 x dx 

1 1 +e 2 *' 

Vl - j: 2 

39 

[ (e x + 1 fe x dx. 

40 j 

\ 6x 2 e~ x ’ dx. 

41 

f sec 2 5,v dx. 

42 J 

^ cot dx dx. 

43 

f esc 2x cot 2x dx. 

44 J 

P 2x dx 

2 x 2 — 3’ 


Compute each of the following definite integrals by making 
a suitable substitution and changing the limits of integration. 


45 

f 2 (2x + 1) dx 

1 Vx 2 + x+2 

46 J 

r 77-/4 

tan 2 jc sec 2 jc dx. 

0 

47 

( e Vln x dx 

J 1 JC 

48 J 

r tt'3 

1 sec J jc tan jc dx. 

49 

Each of the following integrals 

is easy to compute for a 


particular value of n. Find this value and carry out the 


integration. For example, Jjc” 

sin jc 2 dx is easily com- 


puted for n = 1: 




x sin x 2 dx = —y cos x 2 + c. 

(a) fx n e* 4 dx. (b) Jx n cos jc 3 dx. 

(c) Jx n In x dx. (d) fx n sec 2 Vjc dx. 

50 The derivation given in the text for formula 14 is some¬ 
what tainted by rabbit-out-of-the-hat trickery. Derive this 
formula in a more reasonable way by using 

du _ f cos u du _ f cos u du 
cos u. J cos 2 u J 1 — sin 2 u 

to write the given integral as an integral of the form 
Jdu/(\ — u 2 ), and then use 

_L_ = I/_^ + _L_ 

1 — U 2 2 \1 + M 1 — u 

51 Give a similar derivation for formula 15. 


J sec u du = J 


10.3 

CERTAIN 

TRIGONOMETRIC 

INTEGRALS 


In the next two sections we discuss several methods for reducing a given inte¬ 
gral to one involving trigonometric functions. It will therefore be useful to in¬ 
crease our ability to calculate such trigonometric integrals. 

A power of a trigonometric function multiplied by its differential is easy to in¬ 
tegrate. Thus, 

sin 3 x cos xdx = sin 3 jc d( sin jc) = \ sin 4 jc + c 


and 


J tan 2 jc sec 2 jc dx = j* tan 2 jc d( tan jc) = y tan 3 jc + c. 


Other trigonometric integrals can often be reduced to problems of this type by 
using appropriate trigonometric identities. 

We begin by considering integrals of the form 

sin m jc cos” jc dx, 


0 ) 
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where one of the exponents is an odd positive integer. If n is odd, we factor out 
cos jc dx, which is d {sin jc); and since an even power of cos jc remains, we can 
use the identity cos 2 x = \ — sin 2 x to express the remaining part of the inte¬ 
grand entirely in terms of sin jc. And if m is odd, we factor out sin jc dx, which 
is —d {cos x ), and use the identity sin 2 jc = 1 — cos 2 jc in a similar way. The fol¬ 
lowing two examples illustrate the procedure. 


Example 1 


J sin 2 jc cos 3 x dx = J sin 2 jc cos 2 jc cos jc dx 

= J sin 2 ;t(l - sin 2 x) d (sin x) 
= J (sin 2 jc - sin 4 jc) d(sin jc) 


= j sin 3 jc — j sin 5 jc + c. 


Example 2 


| sin 3 jc dx = J sin 2 jc sin jc dx 


= -J (1 - cos 2 jc) d(c os jc) 

= —COS JC + y COS 3 JC -he. 


If one of the exponents in (1) is an odd positive integer that is quite large, it 
may be necessary to use the binomial theorem, and in such a case an explicit use 
of the method of substitution may be desirable for the sake of clarity. For in¬ 
stance, every odd positive power of cos jc, whether large or small, has the form 

cos 2 " +1 jc = cos 2 " jc cos jc = (cos 2 jc)" cos jc = (1 — sin 2 jc)" cos jc, 

where n is a nonnegative integer. If we put u = sin jc and du = cos jc dx, then 

J cos 2n+1 jc dx - J (1 — sin 2 jc)" cos jc dx 
= J (1 — u 2 ) n du. 

If necessary, the expression (1 - u 2 ) n can now be expanded by applying the bi¬ 
nomial theorem, and the resulting polynomial in u is easy to integrate term by 
term. 

If both exponents in (1) are nonnegative even integers, then it is necessary to 
change the form of the integrand by using the half-angle formulas 

cos 2 6 = j(l + cos 26) and sin 2 6 = {(1 — cos 26). (2) 

We hope students have thoroughly memorized these important formulas, but if 
they are forgotten they can easily be recovered by adding and subtracting the 
identities 
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cos 2 0 + sin 2 0=1, 
cos 2 0 - sin 2 0 = cos 20. 

The uses of (2) are shown in the following examples. 

Example 3 The half-angle formula for the cosine enables us to write 
J cos 2 x dx = j J (1 + cos 2x) dx = jJdx + jJ cos 2x dx 

= l x + i J cos 2x d(2x) — \x + \ sin 2x + c. 

If we wish to express this result in terms of the variable x (instead of 2x), we use 
the double-angle formula sin 2x = 2 sin x cos x and write 

cos 2 x dx = {x -1- { sin x cos x -1- c. 


Example 4 Two successive applications of the half-angle formula for the cosine 
give 

cos 4 x = (cos 2 x) 1 = 1(1 -1- cos 2x) 2 =|(1 + 2 cos 2* + cos 2 2x) 

= j[l -I- 2 cos 2x -I- {(1 -I- cos 4x)] 

= f + 7 cos 2x + j cos 4x, 


so 


As these examples show, the value of the half-angle formulas (2) for this work 
lies in the fact that they allow us to reduce the exponent by a factor of \ at the 
expense of multiplying the angle by 2, which is a considerable advantage pur¬ 
chased at very low cost. 


Example 5 By using both of the half-angle formulas we get 

1 - cos 2x 1 + cos 2x 


J sin 2 x cos 2 x dx = J - 


■ dx 


= j f (1 - cos 2 2x)dx = }J[l-±(l + cos 4x)] dx 

= j J dx - } J cos 4x dx = }x - sin 4x + c. 

We can also find this integral by combining the results of Examples 3 and 4: 
J sin 2 x cos 2 x dx = J (l - cos 2 x) cos 2 x dx 

= J cos 2 x dx - J cos 4 x dx 


2 A Bpi 4 am 8 a 4 

= j x ~ jf sin 4x + c. 
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We next consider integrals of the form 

J tan™ x sec" x dx, 

where n is an even positive integer or m is an odd positive integer. Our work is 
based on the fact that d( tan x) = sec 2 x dx and d( sec x) = sec x tan x dx, and we 
exploit the identity tan 2 x + 1 = sec 2 An example illustrating each case will 
be enough to show the general method. 

Example 6 

J tan 4 x sec 6 xdx = J tan 4 x sec 4 x sec 2 * dx 

= j tan 4 * (tan 2 x + l) 2 d( tan x) 

= J tan 4 * (tan 4 x + 2 tan 2 x + 1) d( tan x) 

= J (tan 8 x + 2 tan 6 x + tan 4 x) d( tan x) 

= j tan 9 x + y tan 7 x + j tan 5 x + c. 


Example 7 


tan 3 x sec 5 x dx = J tan 2 x sec 4 * sec x tan x dx 

= J (sec 2 * - 1) sec 4 * d( sec x) 

= J (sec 6 * - sec 4 x) d( sec x) 

= j sec 7 x — j sec 5 x + c. 


In essentially the same way we can handle integrals of the form 

cot™ x esc" x dx , 

where n is an even positive integer or m is an odd positive integer. Our tools in 
these cases are the formulas d( cot x) = -esc 2 x dx and d( esc x) = — esc x cot x • 
dx, and when necessary we use the identity 1 + cot 2 x = esc 2 x. 

Another approach to trigonometric integrals that is sometimes useful is to ex¬ 
press each function occurring in the integral in terms of sines and cosines alone. 

Example 8 We already know from our work with derivatives that 

sec x tan * dx = sec x 4- c. 

However, this formula can also be obtained directly, by writing 

. f 1 sin a: , f sin * dx 

sec * tan * dx = - dx = 

J cos x cos x 


COS 2 X 






344 


METHODS OF INTEGRATION 


If we now put u — cos jc and du = — sin jc dx, then we get 


sec x tan x dx = 


-i 


-i 


sin x dx 
cos 2 JC 

-du 




COS JC 


= sec jc + c. 


PROBLEMS 


Find each of the following integrals. 


1 

J sin 2 jc dx. 

2 

J sin 4 jc dx. 

3 

f cos 6 jc dx. 

4 

f cos 2 3jc dx. 

5 

J sin 3 jc cos 2 jc dx. 

6 

J sin 2 jc cos 5 jc dx. 

7 

J cos 3 jc dx. 

8 

(IT 12 

sin 3 jc cos 3 jc dx. 
Jo 

9 

J Vsin jc cos 3 jc dx. 

10 

J sin 3 5jc cos 5jc dx. 

11 

J sin 2 3jc cos 2 3jc dx. 

12 

f dx 

J sin jc cos jc' 

13 

f tt/4 

sec 4 jc dx. 

Jo 

14 

f dx 

J COS 2 JC' 

15 

f tan 5 jc sec 3 jc dx. 

16 

/ esc 4 jc dx. 

17 

f cot 2 jc dx. 

18 

f cot 3 jc dx. 

19 

f dx 

J sin 2 4jc ’ 

20 

J cot 2 5jc esc 4 5jc dx. 

21 

f 1 + cos 2jc , 

•20 

J sin z 2 jc 

22 

j tan 2 jc cos jc dx. 


23 / sin 3jc cot 3jc dx. 

24 Find J tan jc dx (which we already know) by the method 
of Example 7. 


25 Use the identity tan 2 jc = sec 2 jc — 1 to find 

(a) J tan 2 x dx, J tan 4 jc dx, J tan 6 jc dx, 

(b) J tan 3 jc dx, J tan 5 jc dx, J tan 7 jc dx. 

26 If n is any positive integer > 2, show that 

f „ . tan” -1 jc f _2 j 

tan' 2 jc dx =---tan” z x dx. 

J n — 1 J 

This is called a reduction formula, because it reduces the 
problem of integrating tan” jc to the problem of integrat¬ 
ing tan” -2 jc. 

27 Find the volume of the solid of revolution generated 
when the indicated region under each of the following 
curves is revolved about the jc-axis: 

(a) y = sin jc, 0 < jc < 77; 

(b) y = sec jc, 0 < jc < 77/4; 

(c) y = tan 2 jc, 0 < jc < 77/8; 

(d) y = cos 2 jc, 77/2 < jc < 77. 

28 Find the length of the curve y = In (cos jc) between 
jc = 0 and jc = 77 / 4 . 

29 Find / sec 3 jc dx by exploiting the observation that sec 3 jc 
will clearly appear in the derivative of sec jc tan jc. 

30 Find J esc 3 jc dx by adapting the idea suggested for Prob¬ 
lem 29. 


10.4 

TRIGONOMETRIC 

SUBSTITUTIONS 


An integral involving one of the radical expressions Va 2 — jc 2 , Va 2 + jc 2 , 
Vjc 2 — a 2 (where a is a positive constant) can often be transformed into a fa¬ 
miliar trigonometric integral by using a suitable trigonometric substitution or 
change of variable. 

There are three cases, which depend on the trigonometric identities 


1 — sin 2 

6 = cos 2 6, 

(i) 

1 + tan 2 

6 = sec 2 6, 

(2) 

sec 2 6 — 

1 = tan 2 6. 

(3) 


If the given integral involves Va 2 — jc 2 , then changing the variable from jc to 6 
by writing 

jc = a sin 0 replaces V<z 2 - jc 2 by 


a cos 6, 


(4) 
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because a 2 — x 2 = a 2 — a 2 sin 2 0 = cz 2 ( 1 — sin 2 0) = a 2 cos 2 0. Similarly, if the 
given integral involves Vcz 2 + x 2 , then by identity (2) we see that the substitu¬ 
tion 

jc = cz tan 0 


replaces 


VcZ 2 -I- JC 2 


by 


a sec 0, 


(5) 


because a 2 + x 2 = a 2 + a 2 tan 2 0 = cz 2 ( 1 + tan 2 0) = a 2 sec 2 0; and if it in¬ 
volves Vjc 2 — a 2 , then by identity (3) the substitution 


jc = cz sec 0 replaces Vjc 2 — cz 2 by cz tan 0, 


( 6 ) 


because jc 2 — a 


2 - „2 


these procedures as follows. 
Example 1 Find 


V^Vc 2 


c£c. 



sec 2 6 - a 2 = cz 2 (sec 2 0 - 1) = a 2 tan 2 0. We illustrate 


Solution This integral is of the first type, so we write 
jc = a sin 0, 


dx = a cos 6 d6 , 


Then 


Vc7 2 — JC 2 


c£c 


-/ 


(2 COS 0 
a sin 0 

f 1 - sin 2 0 


cos 6 d6 = a J 
d6 


V< 2 2 — x 2 = a cos 0. 

cos 2 0 


•c/0 


sin 0 

= C71 (esc 0 - sin 0) c/0 


J sin 0 
= — C7 In (esc 0 + cot 0) + C7 cos 0. 


(7) 


This completes the integration, and we now must write the answer in terms of 
the original variable jc. We do this quickly and easily by drawing a right triangle 
(Fig. 10.1) whose sides are labeled in the simplest way that is consistent with the 
equation jc = a sin 0 or sin 0 = x/a. This figure tells us at once that 


esc 0 = —, cot 0 = 

JC 




and 


cos 0 = 


Va 2 


so from (7) we have 

t 


va 2 —. 


■ dx ■ 


Vo 2 "" 


0 { a + Vcz 2 — jc 2 \ 

jc 2 - a In - + c. 


Example 2 Find 


dx 


Vcz 2 + X 2 

Solution Here we have an integral of the second type, so we write 


= a tan 0, dx = a sec 2 0 c/0, Vcz 2 4- x 2 = cz sec 0. 


This yields 


c£c 


Vcz 2 + JC 2 


f cz sec 2 0 c/0_ r 

J a see 6 J 


sec 0 dO 



Figure 10.1 


= In (sec 0 + tan 0). 


( 8 ) 
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Figure 10.2 


The substitution equation x = a tan 6 or tan 6 = x/a is pictured in Fig. 10.2, and 
from this figure we obtain 


V<2 2 + X 2 x 

sec 6 =- and tan 6 = —. 

a a 

We therefore continue the calculation in (8) by writing 
f dx 


, , Va 2 + X 2 + x\ , 

= In - + c 


1 Vfl 2 + X 2 v a 

= In (Va 2 + * 2 + *) + c. 

Students will notice that since 


/ Vfl 2 + * 2 + Jt\ . f—Z -T 

In (-1 = In (V a 1 + x 2 + x) - In a , 


(9) 

( 10 ) 


the constant -In a has been grouped together with the constant of integration c\ 
and the quantity —In a + c' is then rewritten as c. Usually we don’t bother to 
make notational distinctions between one constant of integration and another, be¬ 
cause all are completely arbitrary; but we do so here in the hope of clarifying 
the transition from (9) to (10). 


Example 3 Find 


V ?-^ 2 


dx. 



\/x 2 - a 2 


Solution This integral is of the third type, so we write 

x = a sec 6, dx = a sec 6 tan 6 d6 , V* 2 - a 2 = a tan 6. 

Then 

f V* 2 - a 2 , f a tan 6 


dx 


-/ 


a sec 6 tan 6 dO 


a sec 6 

= a J tan 2 6 dO = a J (sec 2 6 — 1) d6 
= a tan 6 - a6. 

In this case our substitution equation sec 0 = x/a is portrayed in Fig. 10.3, which 
tells us that 


Vjc 2 - a 2 V * 2 - a 2 

tan 6 = - — and 6 = tan 1 — — 


The desired integral can therefore be written as 


V * 2 - 


. Vx 2 - a 2 

■ dx = v v— - a tan 1 -1- c. 


There is one feature of these calculations that we have not taken into account. 
In (4) we tacitly wrote 
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Vl - sin 2 6 = cos 0 

without checking the correctness of the algebraic sign. This was careless, because 
cos 6 is sometimes negative and sometimes positive. However, the variable 6 , 
which in this case is sin -1 x/a, is restricted to the interval -7t/2 < 0 < 7t/2, and 
on this interval cos 6 is nonnegative, as we assumed. Similar comments apply to 
the substitutions (5) and (6). 


Example 4 Asa concrete illustration of the use of these methods, we determine 
the equation of the tractrix. This famous curve can be defined as follows: It is 
the path of an object dragged along a horizontal plane by a string of constant 
length when the other end of the string moves along a straight line in the plane. 
(The word “tractrix” comes from the Latin tractere , meaning “to drag”) 

Suppose the plane is the xy-plane and the object starts at the point (a, 0) with 
the other end of the string at the origin. If this end moves up the y-axis as shown 
on the left in Fig. 10.4, then the string is always tangent to the curve, and the 
length of the tangent between the y-axis and the point of contact is always equal 
to a. The slope of the tangent is therefore given by the formula 

dy _ Va 2 - x 2 

cLx x 


and by separating the variables and using the result of Example 1, we have 


y 



dx = a In 



— Va 2 - x 2 + c. 


Since y = 0 when x = a, we see that c = 0, so 


y/a 2 - x 2 


Figure 10. 


y 


-a In 



— x 


2 


is the equation of the tractrix, or at least of the part shown in the figure. 

If the end of the string moves down the y-axis, then another part of the curve 
is generated; and if these two parts are revolved about the y-axis, the resulting 
“double-trumpet” surface shown on the right in Fig. 10.4 is called a pseudo¬ 
sphere. In the branch of mathematics concerned with the geometry of curved sur¬ 
faces, the pseudosphere is a model for Lobachevsky’s version of non-Euclidean 
geometry. It is a surface of constant negative curvature, and the sum of the an¬ 
gles of any triangle on the surface is less than 180°. 


Another famous curve whose equation can be determined by these methods of 
integration is the catenary, which is the curve assumed by a flexible chain or ca¬ 
ble hanging between two fixed points. The details are a bit complicated, so we 
give a derivation in Appendix 1 at the end of this chapter for students who have 
chosen to omit the optional Section 9.7. 

The substitution procedures described in this section can be given a general 
justification or proof similar to that provided in Section 10.2. Students who are 
interested in such matters will find the details in Appendix A. 10. 
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PROBLEMS 


Find each of the following integrals. 


| V^7 2 


dx. 


/ 


XT 

dx 


(a 2 + x 2 ) 2 ' 
x 3 dx 


V9-x 2 

dx 


I 


xVa - 2 + x 2 

dx 

Vjc 2 - a 2 


11 f Va 2 +x 2 dx: 

J a 2 - x 2 ' 

Va 2 T7 . 


13 

15 

17 

19 


i 
i 

J jc 2 V<z 2 — x 2 dx. 


Vx 2 ^ 2 

x 2 


10 

12 

14 

16 

18 

20 


x 2 dx 

V4-x 2 ' 

dx 

JC 2 Vfl 2 + x 2 
dx 

x Va 2 — x 2 
dx 

x + JC 3 ‘ 
dx 

xWx 2 ~a 2 ' 

x 3 dx 
a 2 + x 2 ' 
dx 

(< a 2 - x 2 ) 312 ' 
jc 3 Va 2 + x 2 dx. 
dx 

(.x 2 - a 2 ) 3/2 ‘ 

(1 - 4jc 2 ) 3/2 dx. 


The following integrals would normally be found in a differ¬ 
ent way, but this time work them out by using trigonometric 
substitutions. 


» j 


23 


h. 


x dx 

\/4 - x 2 
dx 


22 


24 


x dx 


(a 2 ~ jc 2 ) 3/2 ' 
x dx 


a 2 + x 2 ’ 

*Hint: See Problem 29 in Section 10.3. 


4 + x- 


2 ‘ 


J jcV9 - jc 2 dx. 

26 j 

r dx 

Va 2 - jc 2 

( x dx 

28 j 

f jc dx 

' V9 + x 2 

1 

rr 

> 


29 Use integration to show that the area of a circle of radius 
a israz 2 . 

30 In a circle of radius a, a chord b units from the center 
cuts off a chunk of the circle called a segment. Find a 
formula for the area of this segment. 

31 If the circle (jc — b ) 2 + y 2 = a 2 (0 < a < b ) is revolved 
about the y-axis, the resulting solid of revolution is called 
a torus (see Problem 11 in Section 7.3). Use the shell 
method to find the volume of this torus. 

32 Find the length of the parabola y = x 2 between x = 0 
and x = 1. Hint: Use the result of Problem 29 in Section 
10.3. 

33 Find the length of the curve y — \nx between * = 1 and 

x = V8. 

34 The given region under each of the following curves is 
revolved about the Jt-axis. Find the volume of the solid 
of revolution. 

x 3 ' 2 

(a) y = — -i=^= between x = 0 and x = 4. 

Vjc 2 + 4 

(b) y = 9 \ f between x = 0 and x = L 

x l + 1 

(c) y = V4 — x 2 between x = 1 and x — 2. 

35 The curve \x 2 + y 2 = 1 is an ellipse. Sketch the graph 
and show that its complete length equals the length of 
one cycle of y = sin x. (This integral is a so-called 
elliptic integral, and is known to be impossible to eval¬ 
uate in terms of elementary f unctions. For more details 
see Appendix A.9.) 


10.5 

COMPLETING THE 
SQUARE 


In Section 10.4 we used trigonometric substitutions to calculate integrals con¬ 
taining Va 2 — jc 2 , V a 2 + x 2 and Vjc 2 — a 2 . By the algebraic device of com¬ 
pleting the square, we can extend these methods to integrals involving general 
quadratic polynomials and their square roots, that is, expressions of the form 
ax 2 + bx + c and Va* 2 + bx + c. We remind students that the process of com¬ 
pleting the square is based on the simple fact that 

(jc -I- A) 2 = jc 2 + 2Ax + A 2 ; 


this tells us that the right side is a perfect square (the square of jc + A) because 
its constant term is in the square of half the coefficient of jc. 
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Example 1 Find 


(x + 2) dx 

V3 + 2*-* 2 ' 


Solution Since the coefficient of the term x 2 under the radical is negative, we 
place the terms containing jc in parentheses preceded by a minus sign, leaving 
space for completing the square, 

3 + 2x - x 2 = 3 - Ot 2 - 2x + ) = 4 - (x 2 - 2x + 1) 

= 4 - (x - l) 2 = a 2 - u 2 , 

where u — x — 1 and a — 2. Since x — u + 1, we have dx — du and x + 2 = 
u + 3, and therefore 

f (jc + 2) dx _ f (m + 3) du _ f m + 3 [ 

V3 + 2jt “ JC 2 V<3 2 — w 2 Va 2 — w 2 Va 2 - w 2 

= — V<3 2 — w 2 + 3 sin -1 — 
a 

= — V 3 + 2* — x 2 + 3 sin -1 M + c. 


Example 2 Find 


Jjc 


jc 2 + 2jc + 10’ 


Solution We complete the square on the terms containing jc, and write 
x 2 + 2x + 10 = (;c 2 + 2x + ) + 10 = (;t 2 + 2x + 1) + 9 

= (x + l) 2 + 9 = u 2 + a 2 , 
where u = x + 1 and a = 3. We now have dw = dx or dx — du , so 


d;c 


;t 2 + 2;c + 10 


■h 


du 


+ a 2 


= — tan 1 


■ — tan 1 


x + 1 


+ c. 


Example 3 Find 


1 


x dx 

Vjc 2 - 2x + 5 


Solution We write 

x 2 - 2jc + 5 = (;t 2 - 2a: + ) + 5 = (x 2 - 2x + 1) + 4 

= (x — l) 2 + 4 = u 2 + a 2 , 

where u = x — 1 and a — 2. Then jc — u + 1, dx — du, and we have 


[ xdx - 1 

f (u + \)du _ 

f | , 

f du 


— 

I + 

[ 

1 V? — 2* + 5 1 

V« 2 + a 2 J 

Vm 2 + a 2 J 

Vm 2 + a 2 
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The second integral here is the one considered in Example 2 in Section 10.4, so 
we have 


i 


du 

Vm 2 + a 2 


= In (« + Vn 2 + a 2 ), 


and therefore 


— *- = Vw 2 + a 2 + In (u + Vw 2 + a 2 ) 

V* 2 - 2x + 5 

= Vjc 2 - 2x + 5 + In (jc- — 1 + Vjc 2 - 2x + 5) + c. 


Example 4 Find 


djc 


Vjc 2 - 4x - 5 


Solution Here we have 

jc 2 - 4x - 5 = (x 2 - 4x + ) - 5 = U 2 - 4x + 4) - 9 

= (jc — 2) 2 — 9 = w 2 — a 2 , 

where u = x - 2 and a = 3. By using the result of Problem 9 in Section 10.4 
(or by quickly working out the necessary formula again by putting u = * sec 0) 
we complete the calculation as follows: 

_ dx _ 

Vx 2 - 4* — 5 

= In (x - 2 + Vx 2 -Ax - 5) + c. 


du 


V u 2 — a 2 


= In (u + Vw 2 — a 2 ) 


If an integral involves the square root of a third-, fourth-, or higher-degree 
polynomial, then it can be proved that there does not exist any general method 
for carrying out the integration. A few integrals of this kind are discussed in Ap¬ 
pendix A. 9. 


PROBLEMS 


Calculate the following integrals. 

i f * 


Vzjc-jc 2 

dx 

x 2 + 4jc + 5 ’ 
(jc + 1) dx 
V2x - x 2 ' 
x 2 dx 

V6x-x 2 ' 


dx 


V5 + 4jc - x 2 
dx 

x 2 - x + r 
(x + 3) dx 
V5 + 4jc - x 2 ’ 
(;t — 1) dx 


9 

11 

13 


(jc + 7) 
x 2 + 2x + 5 ‘ 

__ 

Vx 2 — 2x — 8 ' 

dx 

V4x 2 + 4x + 17 

_dx_ 

(;t 2 — 2jc — 3) 3/2 ’ 


10 


12 


14 


16 


dr. 


Vx 2 + 2x - 3 
x + 1 

_ dx _ 

V5 + 3x - lx 2 ' 

(4x + 3) dx 
(x 2 — 2x + 2) 3/2 ‘ 

_ dx _ 

(jc + 2)Vjc 2 + 4x + 3 


7 


Vjc 2 + 4a: + 5 
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We recall that a rational function is a quotient of two polynomials. By taking the 
denominator of such a quotient to be 1, we see that the polynomials themselves 
are included among the rational functions. As we know, the simple rational func¬ 
tions 


2x + 1, 


x 

x 2 + r 


and 


_L_ 

* 2 + 1 


10.6 

THE METHOD OF 
PARTIAL FRACTIONS 


have the following integrals: 

x 2 + , In jc, 

jc 


\ In (x 2 + 1), and tan 1 x. 


Our purpose in this section is to describe a systematic procedure for computing 
the integral of any rational function, and we shall find that this integral can al¬ 
ways be expressed in terms of polynomials, rational functions, logarithms, and 
inverse tangents. The basic idea is to break up a given rational function into a 
sum of simpler fractions (called partial fractions ) which can be integrated by 
methods discussed earlier. 

A rational function is called proper if the degree of the numerator is less than 
the degree of the denominator. Otherwise, it is said to be improper. For example, 

_ x _ x 2 + 2 

(x - 1 ){x + 2) 2 an x(x 2 - 9) 


are proper, while 



and 


2jc 3 - 3jc 2 + 2jc - 4 
jc 2 + 4 


are improper. If we have to integrate an improper rational function, it is essen¬ 
tial to begin by performing long division until we reach a remainder whose de¬ 
gree is less than that of the denominator. We illustrate with the second improper 
rational function just mentioned. Long division yields 


2 jc — 3 

jc 2 + 4| 2jc 3 — 3x 2 + 2jc — 4 
2jc 3 _ + 8jc 

— 3jc 2 — 6jc — 4 

- 3jc 2 - 12 

- 6jc + 8 


This means that the rational function in question can be written in the form 


2x 3 ~ 3jc 2 + 2x - 4 -6x + 8 

-- = 2x-3+ — 


0 ) 


By applying this process, any improper rational function P(x)/Q(x) can be ex¬ 
pressed as the sum of a polynomial and a proper rational function, 


PM 

QW 


= polynomial + 


RM 

GW’ 


( 2 ) 


where the degree of R(x) is less than the degree of Q(x). In the particular case of 
(1), this decomposition by means of long division enables us to carry out the in¬ 
tegration quite easily, by writing 
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I 


2x^ - 3^ 2 + lx - 4 

x 2 +4 


dx = x 1 — 3x — 6 f y ^ a + 8 
J x 2 + 4 


/ 


dx 

x 2 + 4 


= jc 2 - 3jc - 3 In (jc 2 + 4) + 4 tan“ l y + c. 

In the general case (2), these remarks tell us that we can restrict our attention to 
proper rational functions, since the integration of polynomials is always easy. 
This restriction is not only convenient, but also necessary, because it is only to 
proper rational functions that the following discussions apply. 

In elementary algebra we learned how to combine fractions over a common 
denominator. We must now learn how to reverse this process and split a given 
fraction into a sum of fractions having simpler denominators. This procedure is 
called decomposition into partial fractions. 


Example 1 It is clear that 

3 2 _ 3(jc + 3) + 2(jc - 1) __ 5jc+7 

jc-1 jc + 3 (jc-1)(jc + 3) (jc- 1)(jc + 3) ’ ( j 

In the reverse process we start with the right side of (3) as our given rational 
function and seek constants A and B such that 

5jc + 7 A B 

(jc-1)(jc + 3) jc - 1 jc + 3' y ) 

(For the sake of understanding the method, let us pretend for a moment that we 
don’t know that A = 3 and B = 2 will work.) If we clear fractions in (4) by mul¬ 
tiplying through by (jc - 1 )(jc + 3), we get 

5* + 7 = A{x + 3) + B{x - 1) (5) 


or 


5jc + 7 = (A + B)x + OA - B). (6) 

Since (6) is to be an identity in jc, we can find A and B by equating coefficients 
of like powers of jc. This gives a system of two equations in the two unknowns 
A and B, 

whose solution is A = 3, B = 2. 

There is another convenient way to find A and B, by using (5) directly. Since (5) 
must hold for all jc, it must hold in particular for x = 1 (which removes B) and 
for jc = —3 (which removes A). Briefly, 

jc = 1: 5 + 7 = A(\ + 3) + 0, 4A = 12, A = 

x= —3: —15 + 7 = 0 + #(—3 — 1), -4B = -8, B = 2. 

This method is faster than it looks, and can be carried out by inspection. 
Whichever method we use to find A and B , (4) becomes 

5jc + 7 _ 3 2 

(x — 1 )(jc + 3) jc — 1 jc + 3 ’ 

and this is the partial fractions decomposition of the rational function on the left. 
Of course, the purpose of this decomposition is to enable us to integrate the given 
function, 


l A + B = 5 
[3A - B = 1 
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[ 5 * +? dx 

J (x- 1)U+3) 



dx 


= 3 In (* * — 1) + 2 In (* + 3) + c. 


The type of expansion used in (4) works in just the same way under more gen¬ 
eral circumstances, as follows: Let P(x)/Q(x) be a proper rational function whose 
denominator QM is an rcth-degree polynomial. If QM can be factored completely 
into distinct linear factors x - r\,x — r 2 , . . . , x - r m then there exist n constants 
A |, A 2 , ..., A n such that 

PM = A t + A 2 + . . . + A„ 

QM x- r { x - r 2 x - r n ‘ 

The constants in the numerators can be determined by either of the methods sug¬ 
gested in Example 1; and when this is done, the partial fractions decomposition 
(7) provides an easy way to integrate the given rational function. 


Example 2 Find 


1 


6x 2 + 14* - 20 
x 3 - 4x 


dx. 


Solution We factor the denominator by writing * 3 — 4* = *(* 2 — 4) = 
*(* + 2)(jc — 2). Accordingly, we have a decomposition of the form 

6x 2 + 14* - 20 = 6x 2 + \4x - 20 = B C 

x 3 - 4x ~ x(x + 2)(x -2)~x + x + 2 + x-2 () 

for certain constants A, B, C. To find these constants we clear fractions in (8), 
which yields 

6x 2 + 14* - 20 = A(x + 2)(* - 2) + Bx(x — 2) + Cx{x + 2). 


By setting * = 0, — 2, 2 (this is the second method in Example 1), we easily see 
that A = 5, B = — 3, C = 4, so (8) becomes 

6x 2 + 14* - 20 _ 5 3 4 

* 3 -4* * * + 2 x - 2 


We therefore have 


/ 


6* 2 + 14* - 20 
* 3 - 4* 


dx = 5 In * — 3 In (* + 2) + 4 In (* — 2) + c. 


In theory, every polynomial QM with real coefficients can be factored com¬ 
pletely into real linear and quadratic factors, some of which may be repeated.* 
In practice, this factorization is hard to carry out for polynomials of degree 3 or 
more, except in special cases. Nevertheless, let us assume this has been done, 
and let us see how the decomposition (7) must be altered to take account of the 
most general circumstances that can arise. 


*This statement is a consequence of the Fundamental Theorem of Algebra, which is discussed in Sec¬ 
tion 14.8. 
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If a linear factor x — r occurs with multiplicity m, then the corresponding term 
A!(x - r ) in the decomposition (7) must be replaced by a sum of the form 



A quadratic factor x 2 + bx + c of multiplicity 1 gives rise to a single term 

Ax + B 
X 2 + bx + c * 

and if this quadratic factor occurs with multiplicity m, then it gives rise to a sum 
of the form 


A\x + B i + A 2 x + B 2 + . . . + AmX + 
x 2 + bx + c ( x 2 + bx + c) 2 (a : 2 + bx + c) m * 

This is the whole story, and the theory guarantees that every proper rational func¬ 
tion can be expanded into a sum of partial fractions in the manner described 
above.* 


Example 3 Find 


3* 3 - 4x 2 - 3* + 2 

r 4 _ r 2 


dx. 


Solution We have 


3;c 3 — 4x 2 — 3x + 2 _ 3x 3 — 4x 2 — 3x + 2 
x 4 -x 2 ” x 2 (x + l )(x - 1) 


A B C , D 
— + “T +-r +-r- 

X X 1 X + 1 X —\ 


Clearing fractions gives the identity 

3x 3 - 4x 2 - 3x + 2 = Ax(x + 1)(jc - 1) + B(x + \)(x - 1) + Cx 2 (x - 1) + Dx 2 (x + 1). 
Now put 


X = 0: 

2 = -B, 

B= - 

x=\: 

-2 = 2D, 

D= - 

x = -1: 

-2 = — 2C, 

C = 1 . 

Equating coefficients of jc 3 

gives 


3 = 

A + C + D, 

so 


Our partial fractions decomposition is therefore 

3* 3 - 4x 2 - 3x + 2 3 2 1 1 

A Si ' . 


SO 


3a: 3 - 4a: 2 - 3a: + 2 


dx = 3 In x + —hln(A:+ 1) 
a: 


In (x - 1) + c. 


"This statement is called the Partial Fractions Theorem ; it is proved in Appendix A. 11. Students will 
notice that the above description of the partial fractions decomposition assumes that the highest power 
of * in Q(x ) has coefficient I; this can always be arranged by a minor algebraic adjustment. 
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Example 4 Find 


2x 3 + * 2 + 2x -1 


dx. 


Solution We have 

2x 3 + x 2 + 2* -1 _ 2x 3 + x 2 + 2x - 1 

x 4 - 1 “ (x + 1)(jc - l)(x 2 + 1) 

= A B Cx + D 

~ x + 1 * - 1 x 2 + 1 * 

so 

2x 3 + x 2 + 2x - 1 = A(jc - l)(x 2 + 1) + £(x + 1)U 2 + 1) + Cx(x 2 - 1) + D(x 2 - 1). 
Now put 

* = 1: 4 = 4 B, B= 1; 

x = -1: -4 = —4A, A = 1; 

;c = 0: -1 = -A + B-D, D= 1. 

Equating coefficients of x 3 gives 

2 = A + 5 + C, so C = 0. 

Our partial fractions decomposition is therefore 

2x 3 + x 2 + 2x - 1 _ 1 1 _1_ 

x 4 - \ _ x+ix-ix 2 +r 

so 

f 2x 3 + x 2 + 2x — 1 

- 4 _ -= In + 1) + In (x - 1) + tan -1 x + c. 


As a final comment, we point out that all the partial fractions that can possi¬ 
bly arise have the form 

A Ax + B 

{x — r) n ° r (x 2 + bx + c) n ’ 

Functions of the first type can be integrated by using the substitution w = x — r, 
and it is clear that the results are always logarithms or rational functions. A func¬ 
tion of the second type in which the quadratic polynomial x 2 + bx + c has no 
real linear factors, that is, in which the roots of x 2 + bx + c = 0 are imaginary, 
can be integrated by completing the square and making a suitable substitution. 
When this is done, we get integrals of the form 

f u du f du 

J (u 2 + k 2 ) n ' J (u 2 + k 2 ) n * 

The first of these is { In ( u 2 + k 2 ) if n— 1, and (w 2 + k 2 ) ] ~ n / 2(1 - n) if 
n > 1. When n— 1, the second integral is calculated by the formula 


f du 

J u 2 + k 2 


1 

— tan 
k 


-i a 


k' 
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The case n > 1 can be reduced to the case #2=1 by repeated application of the 
reduction formula 

f _ du _ = _ 1 _ U _ 2n - 3 f_ du _ 

J (u 2 + k 2 ) n 2k 2 (n - 1) (u 2 + k 2 )“~' 2k 2 (n -\) I (u 2 + k 2 ) n ~ 1 ' ^ ’ 

We state this complicated formula for the sole purpose of showing that the only 
functions that arise from the indicated reduction procedure are rational functions 
and inverse tangents. The formula itself can either be verified by differentiation 
or obtained from scratch by the methods of the next section. 

This discussion shows that the integral of every rational function can be ex¬ 
pressed in terms of polynomials, rational functions, logarithms, and inverse tan¬ 
gents. The detailed work can be very laborious, but at least the path that must be 
followed is clearly visible. 


PROBLEMS 


1 Express each of the following improper rational func¬ 
tions as the sum of a polynomial and a proper rational 
function, and integrate: 

jc 3 


(a) 

X* 

(b) 

jc - r 

(d) 

jc + 3 
jc + 2’ 

(e) 


3jc + 2 ’ 

x 2 - 1 
jc 2 + r 


(c) 


jc 2 + r 


Find each of the following integrals. 
f \2x - 17 


(x- IX* 
10 - 2x 


2 ) 


dx. 


dx. 


dx. 


dx. 


10 


12 


14 


15 


jc 2 + 5x 

9x 2 - 24jc 4 - 6 
x 3 - 5x 2 + 6x 

\6x 2 + 3jc - 7 
jc 3 — JC 

6jc 2 - 9jc + 9 
jc 3 - 3jc 2 

4jc 2 + 2jc + 4 
jc 3 + 4jc 

IFTa dx - 

jc 4 + 3jc 2 - 4jc + 5 
(jc - 1) 2 (jc 2 + 1) ' 


dx. 


dx. 


13 


dx. 


16 

f X 2 + 2x 

1 U + 1)!' * 

17 

18 

l x+l dx. 

J JC- 1 

19 


14jc — 12 
2jc 2 — 2jc — ! 

2 jc + 21 


dx. 


■lx 


dx. 


jc 2 + 46jc - 48 
jc 3 + 5jc 2 — 24jc 

4jc 2 + 1 Ijc — 117 
jc 3 + IOjc 2 - 39jc 

—4jc 2 — 5jc — 3 
jc 3 + 2jc 2 + . 


dx. 


dx. 


dx. 


3jc 2 - jc + 4 
r 3 + 2 jc 2 + 2 jc 


dx. 


jc + 2 

jc 2 4- 1 
jc -I- 2 


dx. 


dx. 


20 

21 

22 

23 


/ 

/ 

/ 

i 


x 3 -3x 2 + 2x- 3 


X 2 + 1 

u 

cos 8 


sin 2 0 + 3 sin 0 

- 4 

16 sec 2 6 

r dO. 


tan 3 8 — 4 tan 2 8 



I 


1 

1 + £>* 


dx. 


25 Use partial fractions to obtain the formula 
f dx 1 . a + jc 

2 2 — o 11 

J r-jc z 2 a a — jc 


Also calculate this integral by trigonometric substitution, 
and verify that the two answers agree. 

26 Find 

f 3 sin OdO f 5e { dt 

(a) J cos 2 6 — cos 0 — 2 ’ (b) J e 2t + e* — 6' 

27 In Problem 14 of Section 8.5 it is stated that the differ¬ 
ential equation 


-y~ = kab(A - x){B - jc), 
dt 


B, 


has 


= pkab(A —B)t 

A(B - JC) 

as a solution for which jc = 0 when t = 0. Derive this so¬ 
lution by using partial fractions. 

28 Verify the reduction formula (9) by differentiating the 
first term on the right. 

29 Suppose that a given population can be divided into two 
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groups: those who have a certain infectious disease, and those 
who do not have it but can catch it by having contact with an 
infected person. If x and y are the proportions of infected and 
uninfected people, then x + y = 1. Assume (1) that the dis¬ 
ease spreads by the contacts just mentioned between sick peo¬ 
ple and well people, (2) that the rate of spread dx/dt is pro¬ 
portional to the number of such contacts, and (3) that the two 


such contacts, and (3) that the two groups mingle freely with 
each other, so that the number of contacts is jointly propor¬ 
tional to x and y.lf x = xq when t = 0, find rasa function of 
t, sketch the graph, and use this function to show that ulti¬ 
mately the disease will spread through the entire population. 
When the formula for the derivative of a product (the 


When the formula for the derivative of a product (the product rule) is written in 
the notation of differentials, it is 

d(uv) = u dv + v du or u dv = d{uv) — v du , 

and by integrating we obtain 

j u dv = uv — J v du. (1) 

This formula provides a method of finding $u dv if the second integral fvdu is 
easier to calculate than the first. The method is called integration by parts , and 
it often works when all other methods fail. 

Example 1 Find fx cos x dx. 

Solution If we put 

u = x, dv = cos jc dx. 


10.7 

INTEGRATION 
BY PARTS 


then 


du = dx, v~ sin x, 


and ( 1 ) gives 

j x cos x dx = x sin x — J sin x dx. 

This is good luck, because the integral on the right is easy. We therefore have 


i 


x cos x dx = x sin x + cos x + c. 


It is worth noticing that in this example we could have chosen u and dv dif¬ 
ferently. If we put 

u = cos x, dv = x dx, 

then 


du = —sin x dx, 


v = 9 x 


2 


and ( 1 ) gives 


x cos x dx = jx 2 cos x + 


x 2 sin x dx. 


This equation is true, but it is completely worthless as a means of solving our 
problem, because the second integral is harder than the first. We urge students to 
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learn from experience, and to use trial and error as intelligently as possible in 
choosing u and dv. Also, students should feel free to abandon a choice that 
doesn’t seem to work, and quickly go on to another choice that offers more hope 
of success. 

The method of integration by parts applies particularly well to products of dif¬ 
ferent types of functions, like x cos x in Example 1, which is a product of a poly¬ 
nomial and a trigonometric function. In using the method, the given differential 
must be thought of as a product u • dv. The part called dv must be something we 
can integrate, and the part called u should usually be something that is simpli¬ 
fied by differentiation, as in our next example. 

Example 2 Find J In x dx. 

Solution Here our only choice is 

u = In jc, dv = dx, 



and we have 


\n x dx = x \n x — I x — = x In x — x + c. 

J x 


In some cases it is necessary to carry out two or more integrations by parts in 
succession. 


Example 3 Find / x 2 e x dx. 


Solution If we put 

then 

and (1) gives 


u = x 2 , dv = e x dx , 


du = 2x dx, v— e x . 


J x 2 e x dx = x 2 e x — 2 J xe x dx. 


( 2 ) 


Here the second integral is easier than the first, so we are encouraged to con¬ 
tinue in the same way. When the second integral is integrated by parts with 

u = x, dv = e x dx, 


so that 


du = dx, v— e x . 


then we get 


J xe x dx = xe x - J e x 


dx 


= xe 
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When this is inserted in (2), our final result is 

j x 2 e x dx = x 2 e x - 2xe x + 2e x + c. 


It sometimes happens that the integral we start with appears a second time dur¬ 
ing the integration by parts, in which case it is often possible to solve for this in¬ 
tegral by elementary algebra. 


Example 4 Find fe x cos x dx. 


Solution For convenience we denote this integral by J. If we put 

u = e x , dv = cos x dx, 


then 


du = e x dx, v= sin x, 


and (1) yields 


J = e x sin x 


i 


e x sin x dx. 


(3) 


Now we come to the interesting part of this problem. Even though the new in¬ 
tegral is no easier than the old, it turns out to be fruitful to apply the same method 
again to the new integral. Thus, we put 

u = e x , dv — sin x dx, 

so that 


du = e x dx, v = —cos x, 


and obtain 


J e x sin x dx = - e x cos x + J e x cos x dx. (4) 

The integral on the right is J again, so (4) can be written 

J e x sin x dx = —e x cos x + J. (5) 

In spite of appearances, we are not going in a circle, because substituting (5) in 
(3) gives 

J = e x sin x -1- e x cos x - J. 

It is now easy to solve for J by writing 

2 J = e x sin x + e x cos x or J = j(e x sin x + e x cos x), 
and all that remains is to insert the constant of integration: 

e x cos x dx = ye*(sin x + cos x) + c. 


The method of this example is often used to make an integral depend on a sim¬ 
pler integral of the same type, and thus to obtain a convenient reduction formula, 
by repeated use of which the given integral can easily be calculated. 
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Example 5 Find a reduction formula for J n = / sin" x dx. 


Solution We integrate by parts with 


u = sin" l x , 


dv = sin x dx, 


so that 



du = (n - 1) sin" 2 Jt cos x dx, v = —cos jc, 

and therefore 

J n = —sin" -1 x cos x + (n — 1) J sin" -2 Jt cos 2 x dx 

= —sin" -1 jc cos x + {n — 1) J sin" -2 jc( 1 — sin 2 x) dx 

= —sin" -1 jc cos jc + (« — 1) J sin" -2 x dx — (n — 1) j* sin" x dx 

= -sin" -1 jc cos x + (n — 1) J n ~ 2 ~ (n - 1 ) J n . 

We now transpose the term involving J n and obtain 


so that 


or equivalently, 


nJ n = —sin" 1 x cos Jt + {n — \)J n -i 


_ 1 ._, « — It 

J n = - Sin" 1 Jt COS JC + - Jn- 2, 

n n 


J sin" 


, 1 . „ _ 1 n — 1 

x dx = -sin" 1 x cos x + 

n 


—- [ sin"' 
n J 


'■ x dx. 


( 6 ) 


The reduction formula (6) allows us to reduce the exponent on sin x by 2. By 
repeated application of this formula we can therefore ultimately reduce J n to Jo 
or J 1 , according as n is even or odd But both of these are easy: 

Jo = J sin 0 x dx = J dx = x and J\ = j sin x dx = -cos jc. 

For example, with n = 4we get 


and with n — 2, 


j sin 4 x dx = — \ sin 3 x cos x + f j* sir 


J sin 2 x dx — — j sin jc cos x + \ J dx 


— 2 sin jc cos jc + 2 X - 


sin 4 x dx = —\ sin 3 Jt cos Jt + f (—\ sin jc cos jc + \x) 
= — \ sin 3 jc cos x — f sin jc cos * + jx + c. 


Therefore, 
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The same result can be achieved by earlier techniques depending on repeated use 
of the half-angle formulas, but our present methods are more efficient for large 
exponents. In our next example we illustrate another way in which the reduction 
formula (6) can be used. 

Example 6 Calculate 

f^ 2 • 8 ^ 

sin 8 x dx. 

Jo 


Solution For convenience we write 


Ctt! 2 

/„ = J o sin" x dx. 


By formula (6) we have 


/„ = —- sin” 1 x cos x 
n 


77/2 n 1 [tt/2 . 


~~ 1 [tt/2 . 

- sin” 

n 


' x dx , 


so 


/„ = 


L- 2- 


We apply this formula with n = 8, then repeat with n = 6, n = 4, n = 2: 

j — It — 1 It — 1 1 It — 1 1 1 1 / 

*8 — 8*6 8 * 6*4 “ 8 * 6 * 4*2 — 8 ' 6 * 4 2 * 0 - 

Therefore 


(* 77/2 

Jo 


sin 8 x dx = — ■ 


_5 

6 


A I [ nl2 

4 ’ 2 Jo 



5 3_1 

6 ‘ 4 ’ 2 ‘ 2 


3577 

256 


Remark 1 The reduction formula (6) can also be used to establish one of the 
most fascinating formulas of mathematics, Wallis's infinite product for 7t/2: 

= 2 2 4 4 6^ 6^ 

2 1*3*3*55 *7***’ 

For the details of the proof, see Appendix 2 at the end of the chapter. 

Remark 2 In Section 9.5 we stated Leibniz's formula for 7t/4, 



For students who are interested in little-known comers of the history of mathe¬ 
matics, we describe in Appendix 3 at the end of the chapter how Leibniz him¬ 
self discovered his formula by a very ingenious application of integration by parts. 

At this point we have described all the standard methods of integration that the 
student is expected to be acquainted with. A few additional techniques of minor 
importance remain, and two of these are briefly sketched in the problems at the 
end of Appendix A.9; but for most practical purposes we have reached the end 
of this particular road. 
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PROBLEMS 

Find each of the following integrals by the method of inte¬ 
gration by parts. 


1 

/jc In jc dx. 

2 

/ tan -1 jc dx. 

3 

/jc tan -1 jc dx. 

4 

fxe ax dx. 

5 

Je x sin jc dx. 

6 

fe ax cos bx dx. 

7 

/V1 - x 2 dx. 

8 

/ sin -1 jc dx. 

9 

J x sin -1 jc dx. 

10 

Ctt/2 

jc sin jc dx. 

Jo 

11 

J x cos (3jc — 2) dx. 

12 

f tan -1 jc , 

J x 2 dx. 

13 

/jc sec 2 jc dx. 

14 

/ sin (In jc) dx. 

15 

/ In (a 2 + jc 2 ) dx. 

16 

/jc 2 In (jc + 1) dx. 

17 

f^dx. 

j JC 

18 

j (In x) 2 dx. 


19 The region under the curve y = cos jc between jc = 0 and 
x = 7t/2 is revolved about the y-axis. Find the volume of 
the resulting solid. 

20 Find /(sin -1 x ) 2 dx. Hint: Make the substitution y = 
sin -1 jc. 

21 If P(x) is a polynomial, show that 

I P(x)e x dx = {P - P' + P" - P'" + • • -) e \ 

In the next two problems, derive the given reduction formula 

and apply it to the indicated special case(s). 


22 (a) f cos' 1 x dx = — sin x cos' 1 1 x 

J n 

+ —-- f cos' 1 " 2 x dx. 

n J 

C 7r/2 

(b) cos 7 jc dx. 

f77/2 

(c) cos 8 jc dx. 

23 (a) /(In jc)' 1 dx = jc(ln jc)' 1 - w/(ln jc)' 1 " 1 dx. 

(b) /(In jc) 5 dx. 

24 The region under the curve y = sin * between x = 0 and 
jc = 77 is revolved about the y-axis. Find the volume of 
the resulting solid (a) by the shell method; and (b) by the 
washer method. 

25 The curve in Problem 24 is revolved about the jc-axis. 
Find the area of the resulting surface of revolution. 

26 (The volcanic ash problem) When a volcano erupts, the 
cloud of ejected ash gradually settles onto the surface of 
the nearby land. The depth of the deposited layer of ash 
decreases with distance from the volcano. Assume that 
the depth of the ash r feet from the volcano is ae~ br feet, 
where a and b are positive constants. 

(a) Find the total volume of ash that falls within a dis¬ 
tance c of the volcano. Hint: What is the element of 
volume dVoi ash that falls on a narrow ring of width 
dr and inner radius r centered on the volcano? 

(b) What is the limit of this volume as 


10.8 

A MIXED BAG. 
STRATEGY FOR 
DEALING WITH 
INTEGRALS OF 
MISCELLANEOUS TYPES 


As the student understands by now, differentiation is straightforward but inte¬ 
gration is not. In finding the derivative of a function it is obvious which formula 
must be applied. But it may not be at all obvious which method should be used 
to integrate a given function. 

Since the problems in each section of this chapter have focused on the meth¬ 
ods of that section, it has usually been clear what method to use on a given in¬ 
tegral. Generally speaking, the methods at our disposal now are direct substitu¬ 
tion, trigonometric substitution, partial fractions, and parts. But what if an integral 
is met out of context, with no obvious clue as to how to work it out? In this sec¬ 
tion we try to suggest a strategy for this common situation. 

An essential prerequisite is a knowledge of the basic integration formulas. For 
the sake of emphasis, we repeat the list given in Section 10.1, together with three 
additional formulas arising from our work in this chapter. As we pointed out ear¬ 
lier, the first 15 formulas should be memorized, and we hope students will take 
our advice seriously this time. It is useful to know them all, but the last three 
(marked with an asterisk) need not be memorized since they are easy to derive, 
as follows. Formulas 16 and 17 are immediate from the simple partial fractions 
decompositions 
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1 


and 


(x + a)(x ~ a) 

1 

(a + x)(a — x) 


2 a 

J_ 
2 a 


x + a 


a + x 


These decompositions can easily be understood by mentally recombining the 
terms in brackets with the aid of a common denominator; we then see directly 
what the constant factor outside the brackets must be. Formula 18 is almost im¬ 
mediate from the trigonometric substitutions x = a tan 6 and x = a sec 6, re¬ 
spectively. In this list of formulas we use x instead of u as the variable of inte¬ 
gration— since the usefulness of the w-notation is now thoroughly familiar to us 
—and for the sake of simplicity we omit the constant of integration. 


f r^ 1 

1 x n dx = -- (n ^ - 1). 

J n T- 1 

if-- 

i 

J cos x dx = sin x. 

J sin x dx = —cos x. 

6 | sec 2 x dx = tan x. 


In x. 
x 

e x dx = e x . 


7 

8 
9 

10 


/ 


esc 2 x dx = —cot x. 
sec x tan x dx = sec x. 

esc x cot x dx = - esc x. 

dx . x 

— , — = sin 1 —. 

a 


dx 


+ Xr 


1 , X 

= — tan 1 —. 


" h 

12 | tan * dx = -In (cos x). 

13 | cot x dx = In (sin x ). 

J sec x dx = In (sec x + tan x). 

J esc x dx — —In (esc x + cot x ). 

/ 


14 

15 
*16 

*17 

*18 


dx 


x 2 -a 2 

2 a 

dx 

_ J_ 

a 2 — x 2 

2 a 

dx 

- = ] 


Vjc 2 ± a 2 


x - a 
x + a 

a + x 
a — x) 

= In (x + Vjc 2 ± a 2 ). 
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These formulas constitute our arsenal of weapons for attacking integrals, and 
it is up to us to decide which to use in a particular case. If we do not see what 
to do immediately, the following strategy may be helpful. 


STRATEGY FOR INTEGRATION 


1 Simplify the integrand. The use of algebraic or trigonometric identities will 
sometimes simplify the integrand and make a method of integration obvious. 
For example: 


| V*(Vx + ^/x) dx = J (x + x 5/6 ) dx; 

J (sin x + cos x) 2 dx = J (sin 2 x + 2 sin x cos jc + cos 2 x) dx 
= j* (1 + 2 sin x cos x) dx; 

= J (1 — tan 2 x) cos 2 x dx 


i 


1 — tan 2 . 


dx 


■/(■- 


cos 2 x dx 


= J (cos 2 x — sin 2 x) dx = J 


cos 2x dx. 


In the second problem, if we fail to notice that sin 2 jc + cos 2 x = 1, and in¬ 
stead integrate sin 2 x and cos 2 x separately, then we can still solve the prob¬ 
lem, but we have missed an opportunity to do things the easy way. A simi¬ 
lar remark applies to the third problem, with its use of the double-angle 
formula for the cosine. 

2 Look for an obvious substitution. Try to find some function u = g(;c) in the 
integrand whose differential du = g'(x) dx is also present, apart from a con¬ 
stant factor. For example, in 

r x dx 

J T ^ 2 

we notice that if u = 4 — x 2 , then du = — 2x dx and x dx = —\ du. It is there¬ 
fore much simpler to use this substitution than to use partial fractions or the 
trigonometric substitution x = 2 sin d , each of which also works but takes 
longer to carry out. 

3 Classify the integrand. This is the heart of the matter. If Steps 1 and 2 have 
not helped, then we turn to a more careful examination of the form of the in¬ 
tegrand f{x). 

(a) If f{x) is (or can be written as) a product of powers of sin x and cos x , 
or tan x and sec x , or cot x and esc jc, then the methods of Section 10.3 
can be used.* 


*A special method for integrating any rational function of sin jc and cos x is described in Appendix 
A.9 at the end of the book. This method will not be needed for any of the review problems at the 
end of this section. 
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(b) If /(jc) involves V<z 2 ± x 2 or Vjc 2 ± a 2 , or powers of these expressions, 
use the trigonometric substitutions of Section 10.4. 

(c) If /(jc ) is a rational function, use partial fractions as explained in Sec¬ 
tion 10.6—unless there is a better way for a particular integral. 

(d) If/(jc) is a product of functions of different types, try integration by 
parts. As we have seen in Section 10.7, this method also works for many 
individual inverse functions like In x, sin~ l x, and tan -1 x. 

(e) Be observant, thoughtful, flexible and persistent—all of which are of 
course easier said than done. If a method doesn’t work, be ready to try 
another. Sometimes several methods work. Keep your options open and 
do things the easy way—if any. And remember that doing a problem 
more than one way is a good learning experience. 


Our purpose in the following examples is to try to suggest possible lines of at¬ 
tack by “thinking out loud.” We are interested mainly in brainstorming these in¬ 
tegrals, and in most cases we will not work out all the details to the final answer. 


Example 1 



dx. 


Comments Since the integrand is a rational function, partial fractions will work. 
This requires factoring x 6 — 1 into (jc 3 + 1)(jc 3 — 1) = (jc + 1)(jc 2 — x + 1) * 
(jc — 1)(jc 2 + x + 1) and then finding constants A, B, C, D, E, F such that 

x 2 A Bx + C D Ex + F 

— - - =-— H-x-1-— H-x-. 

X 6 - \ X + \ X 2 — X 1 x — l x 2 + x+ l 


We can do this if we must, but actually carrying out this work is not an attrac¬ 
tive prospect. 

Let us probe in a different direction. A much more promising method is to no¬ 
tice that jc 6 is the square of jc 3 and that the numerator of the integrand is almost 
the derivative of jc 3 . Accordingly, if we put u = jc 3 , then du = 3jc 2 dx, x 2 dx = 
y du, and the integral becomes 


If du _ 1 / w — 1 

3 J a 2 - 1 “ 6 




by formula 16. 


Example 2 


dx. 


Comments The trigonometric substitution jc = tan 6 will work. Partial fractions 
will also work, but since the integrand is an improper rational function, we must 
begin with long division. However, an easier way to accomplish the result with¬ 
out actually carrying out the long division is simply to add and subtract 1 in the 
numerator, 


i 


+ JC 2 



+ 1-1 

1 + JC 2 



1 - 



dx 


= x — tan 1 jc. 
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Example 3 


e 2x dx 

e x - r 


Comments We begin by noticing that e ^ dr = e x (e x dx) = e x d(e x ). This sug¬ 
gests that we put u = e x , so that e 2x dx = u du and the integral can be written 


u du 
u ~ 1 


u — 1+1 
u - 1 






= w + In (w — 1) = e* + In (e x — 1). 


By subtracting and adding 1 here we employ a slight variation of the idea used 
in Example 2. 


Example 4 


4x + 1 

1 + X 2 


dx . 


Comments The numerator is nearly (but not quite) the derivative of the de¬ 
nominator. This suggests that we break the integrand into a sum and rearrange 
the constants to achieve this desirable condition: 


4x + 1 
1 + x 2 


dx = 


-/(’• 

-*j 


lx 


1 + X 2 

2x dx 


1 + x 2 


1 + X 2 

dx 

. + x 2 


dx 


= 2 In (1 + x 2 ) + tan" 


x. 


f 2x + 6 
J x 2 + 7x+ 10 dx ' 

In Example 4 we arranged part of the numerator to be the deriva- 
denominator. A similar purpose here suggests that we write 

f 2x + 6 _ _ f (2x + 7) - 1 

J x 2 + lx + 10 J x 2 + lx + 10 

f (2x + 7) d: _ f dx 
J x 2 + 7x+ 10 J x 2 + 7x+ 10' 

The first of these integrals has been arranged to be In (x 2 + lx + 10), and we 
can easily work out the second by factoring the denominator into (x + 2)(x + 5) 
and using partial fractions. 


Example 5 

Comments 
tive of the 


f x 5 dx 

Example 6 J ( , + ^ 

Comments The trigonometric substitution x = tan 6 will work. Partial fractions 
will also work, but if we try this there will be eight unknown constants to find. 
We hope for something better. 

Let us try the substitution u = 1 + x 2 . Our only reason for this is that it sim¬ 
plifies the denominator to w 4 . Then du = 2x dx, and we have 
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f x 5 dx _ 1 

r (x 2 ) 2 (x dx) __ 1 I 

f (u - l) 2 du 

1 

1 (1 + x 2 ) 4 - j 

(1 + x 2 ) 4 2-1 



= \ [ —— 2 4 " + 1 du=\-\ (u~ 2 ~ 2 u~ 3 + u~ 4 ) du, 
2 J if 2 J 

which is easy. 


[ dx 

E ” n ""' 7 

Comments We notice at once that the differential of In * is dx/x. We therefore 
put u = \n x, so that du = dx/x and 

f _ f du_ _ J. _ 1 

J *(ln *) 2 J w 2 u In*’ 


Example 8 


* dx 

VTTT 


Comments This requires a so-calle d ratio nalizing substitution , that is, one that 
eliminates the radical. We put « = ^/x + 1, so that w 3 = * + 1, 3w 2 = dx, and 

x = u 3 — 1. We can now write 


f x dx 

J ^TT 


\ ~ ~ ~u~^~ dU = f <3u 4 - 3u) du. 


which is easy. 


Example 9 


Comments The rationalizing substitution 


' 1 + s 
1 -* 


will work here, but the result is a messy ratio nal fun ction. A better idea is to mul¬ 
tiply both numerator and denominator by Vl + x, which gives 




-i 


+ * VTT7 


dx 


~ x vT+7 

dx f x dx 


-I 


1 + * 

vr^? 


dx 


V I - X 2 


Vl -X 2 


- Vl - X 2 . 


Example 10 -- dx. 

j 1 + cos * 

Comments This time we multiply both numerator and denominator by 1 — cos * 
to obtain a somewhat different application of the idea in Example 9: 
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f-—! - dx= I 

j 1 + COS X J 

-I 


1 


_ 1 - cos x 

1 + cos x 1 - cos x 


dx 


-/ 


1 - COS X 


dx 


1 - COS X 


dx = esc 2 x dx 


[ CQS 

J sir 


: dx 


— —cot X + 


1 


sin x 


Example 11 J e^ x dx. 

Comments It is natural to try the substitution u — Vx, even though we have no 
idea what is likely to happen. Then u 2 = x , 2 u du = dx, and we have 

J e^~ x dx = J 2 ue u du. 

This integral is now an obvious candidate for integration by parts. 


The following list of problems contains integrals of all the types we have en¬ 
countered, arranged in random order so that students can test their diagnostic 
powers. 


PROBLEMS 


Find the following integrals. 
f x dx 
j \/i_2 


1 


Vi - .v 1 

3 J sin 2 x cos 5 x dx. 


5 /VTfS*. 
J x In x 

7 J sin Vr dx. 

9 J cos x tan x dx. 
11 x sin 2 x dx. 


J x si 

13 I- 
15 h 


+ e x 
x 2 dx 


dx. 


tan -1 Vx 


17 I VI 

I 

V4-x 2 


dx. 


19 | ^±4-dx. 


21 I 


dx. 


23 fx 5 e x * dx. 


2 \ x 4 In x dx. 


dx 


J x 3 + 4x ' 

6 J (e 3v ) 4 e x dx. 

8 h^r dx - 

10 l - C0SX 2 dx. 

12 j lnX+ x V ~ X dx. 

14 j dx 

16 / sin x cos (cos x) dx. 

18 | sec 4 x dx. 

20 J(1 + \/x)*dx. 

22 f_*_ 

J e 2x + 5 e .f 
24 JV + l ) 2 dx. 


25 

27 

29 

31 

33 

35 

37 

39 

40 

41 

43 

45 


I (x + 3f- dx - 

26 

I x 4 + 2x 2 + 10 dx ' 

28 

f x In x 

f - dx. 

J v^T 

30 

fx 2 sin x 3 dx. 

32 

f —--dx. 

J (x 2 + 1)(JC 2 + 4) 

34 

(• a: 2 dx 

J (x- l) 3 ' 

36 

J tan 3 x sec 4 x dx. 

38 

f . - i .... 

dx. 

1 - x 2 + V 1 - X 2 

fx 3 e~ 2x dx. 


J sin 2 x cos 4 x dx. 

42 

rvv 

J x + 3 

44 

J In (x 2 + 3) dx. 

46 


J x 'Vx 4- 5 dx. 

j x 2 Vx 3 - 4 dx. 
J sin 2x 


dx. 


V4 — co 
fx sec x tan x dx. 
f dx 


xV2x- 16 
x 3 In x dx. 


/ ( e * ~ T 7 ) 2 dx - 


x dx 


Vl - 4x 2 

r3 


16 + X 8 


dx. 


1 + X‘ 


dx. 
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47 

49 

51 

53 

55 

57 

59 

61 

63 

65 

67 

69 

71 

73 

75 

77 

79 

81 

83 

85 

87 


e 5x cos 3a dx. 


x dx 


- 2x 2 - 3' 


x 4 + 1 


x 5 + 5x + 3 
dx 


dx. 


x + 7 + 5Va + 1 ‘ 
sin x dx 


1 + 3 cos 2 a‘ 
v-3 


dx. 


(x + l ) 8 
tan 6 x dx. 

jc 2 + 5jc H- 6 


dx. 


j In V2x - 


1 dx. 


1 + * 


dx. 


J sin 2 5x cos 2 5x dx. 
J cot x In (sin a) dx. 
J cot 3 2x esc 3 2x dx. 
J In (2a + x 2 ) dx. 

J Va( 1 — Va) dx. 

J In (1 + x 2 ) dx. 

J x tan 2 x dx. 

J sec 7 x tan x dx. 

J x sin -1 x dx. 

r v3 


—;—o dx. 

1 + x 8 

sec 2 x dx 
Vsec 2 x — 1 


48 

50 

52 

54 

56 

58 

60 

62 

64 

66 

68 

70 

72 

74 

76 

78 

80 

82 

84 

86 

88 


/ 

/ 

/ 

i 

! 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 


A + 1 

X 2 - 2x + 2 


dx. 


e x+ey dx. 


x sin 1 (a 2 ) dx. 
_ dx_ _ 

x 3 + x 2 + x + r 

2x + 3 
— 5 ——T dx. 


sin (In x) dx. 

sin x + cos x 
sin x — cos x 
dx 


dx. 


Vl — 4x 2 

4 dx 

x 2 + 4x + 20' 

X 


dx. 


dx. 


Vl6 -X 4 
dx 

x 2 + 5x - 6' 
dx 

e 3x _ e x' 

e x + 1 
e*- 1 
dx 

sin Ax’ 
e x dx 

e 2x - r 

x 4 

(x 5 + l ) 3 
tan -1 2x 
1 +4x 2 
dx _ 

x 2 + 5x + 6 * 
1 + cos 2 x 


dx. 

dx. 


1 - cos 2 x 

1 v- 


dx. 


tan 


■ dx. 


dx 


1 + 2e x — e 


89 

. r 

f x 

Iv;*- 

90 

91 

j X 

92 

93 

f v ^ 2 * 

J x + 2 

94 

95 

f dx 
j sin 2 *’ 

96 

97 

J V(1 + 3a)( 1 — 3a) dx. 

98 

J sin 3 a cos 2 a dx. 


99 

f 2X + 5 ! 

J x 2 + 5a + 6 X 

100 

101 

/ A 4 - 2a 2 - 3 dx - 


102 

f \/1 + A/1 + Va dx. 

103 


104 

105 

f sin 3 a , 

5 dx • 

j COS D X 

106 

107 

J sin a cos 2a dx. 

108 

109 

f x dx 

Va - 1 

110 

111 

| In (ax + b) dx. 

112 

113 

J e x cos ( e x ) dx. 

114 

115 

f Va In a dx. 


116 

f COS A 

dx. 

J sin 2 a — 2 sin a + 3 

117 

/(tan a + cot a) 2 dx. 

118 

119 

f 32a + 80 
i (a - 1)(a+ 3) 2 

120 

121 

J In (l - Va) dx. 

122 

123 

J x tan “ 1 (a — l) dx. 

124 

125 

1 — *■ 

1 Vl - A 2 



1 

1 

1 

1 

1 


V^+9 , 

-— dx. 

x 

dx 

cos 5x’ 


Vx + Va- + 1 

dx 

V9 * 2 + 12 * - 5 


sin 2 x cos 3 x dx. 


J x sec 2 x dx. 

J sin 2x cos x dx. 
i 4 


v4 _ 


dx. 


dx 


e x + 5 + 4e~ 


[ 

J sir 


- dx. 


_ dx _ 

3* 2 - 13* + 4* 


f In (;cVx) dx. 
f dx 
J a 2 + b 2 x 2 ’ 
f x 2 dx 
J (x 2 + l) 3 ’ 

f _ dx 

J 2x 2 -2x + 


From the point of view of the theorist, the main value of calculus is intellectual; 
it helps us comprehend the underlying connections among natural phenomena. 
However, anyone who uses calculus as a practical tool in science or engineering 
must occasionally face the question of how the theory can be applied to yield 
useful methods for performing actual numerical calculations. 

In this section we consider the problem of computing the numerical value of 
a definite integral 


10.9 

NUMERICAL 
INTEGRATION. 
SIMPSON’S RULE 
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fix) dx (1) 

Ja 

in decimal form to any desired degree of accuracy. In order to find the value of 
(1) by using the formula 

[ h f(x)dx = F(b)-F(a), (2) 

Ja 

we must be able to find an indefinite integral F(x) and we must be able to eval¬ 
uate it at both x = a and x = b. When this is not possible, formula (2) is useless. 
This approach fails even for such simple-looking integrals as 

rV^Txdx and f 5 ^dx, 

Jo J 1 X 

because there are no elementary functions whose derivatives are Vsin x and eVx 
(see Appendix A.9). 

Our purpose here is to describe two methods of computing the numerical value 
of (1) as accurately as we wish by simple procedures that can be applied re¬ 
gardless of whether an indefinite integral can be found or not. The formulas we 
develop use only simple arithmetic and the values of f(x) at a finite number of 
points in the interval [a, b], In comparison with the use of the approximating 
sums that are used in defining the integral (see Section 6.4), the formulas of this 
section are more efficient in the sense that they give much better accuracy for 
the same amount of computational labor. 



THE TRAPEZOIDAL RULE 


Let the interval [a, b ] be divided into n equal parts by points x 0 , x\, ... ,x n from 
xq = a to x n = b. Let yo, y \,..., y n be the corresponding values of y = f(x). We 
then approximate the area between y = f(x) and the x-axis, for x 1 < x < by 
the trapezoid whose upper edge is the segment joining the points (**-i, y*- 1 ) 
and (xk, yk) [see Fig. 10.5]. The area of this trapezoid is clearly 

T(w -1 + yk)(x k - x k -i). (3) 

If we write 


&x = x k - X k -i 



(4) 


then adding the expressions (3) for k = 1,2gives the approximation for¬ 
mula 


[ fix) dx = (jyo + y\ + yi + • • • + y n -i + jy n ) fa, 

Ja 

because each of the y's except the first and the last occurs twice. This formula 
is called the trapezoidal rule. 


Example 1 Use the trapezoidal rule with n = 4 to calculate an approximate value 
for the integral 


f Vl - x 3 dx. 
Jo 
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Here y = /(x) = Vl — x 3 and xo = 0, x\ = {, xj = x-$ = f, X4 = 1. We can 

compute the y’s easily by using a calculator: 

To = 1. 

>'1 = vl = Vo.984 = 0.992, 
y 2 = ij = V0.875 = 0.935, 

>’3 = vl = VO. 578 = 0.760, 

y 4 = 0. 

By the trapezoidal rule, we therefore have 

£ Vl -x 3 dx = {(0.500 + 0.992 + 0.935 + 0.760 + 0.000) = 0.797. 


SIMPSON’S RULE* 


Our second method is based on a more ingenious device than approximating each 
small piece of the curve by a line segment; this time we approximate each piece 
by a portion of a parabola that “fits” the curve in a manner to be described. 

Again we divide the interval [a, b ] into n equal parts, but now we require that 
n be an even integer. Consider the first three points xo, x\, X 2 and the corre¬ 
sponding points on the curve y =/(*), as shown in Fig. 10.6. If these points are 
not collinear, then there is a unique parabola that has vertical axis and that passes 
through all three points. To see this, recall that the equation of any parabola with 
vertical axis has the form y = P(x) where P(x) is a quadratic polynomial, and ob¬ 
serve that this polynomial can always be written in the form 

P(x) = a + b(x — x\) + c(x ~ X[) 2 . (5) 

We choose the constants a , b, c to make the parabola pass through the three points 
under consideration, as indicated in the figure. Three conditions are necessary 
for this: 


xq, a + b(x o — *i) + c(x o ~ x\) 2 - yo> 


at x = 




a = y i; 


a + b(x 2 - xi) + c{x 2 - *i) 2 = y 2 - 


( 6 ) 

(7) 



Equations (6) and (7) can be solved for the constants b and c. However, it is more 
convenient to use the definition (4) of A* and the fact that a = y\ to write these 
equations in the form 

-b Ax + c Ax 2 = y 0 - yj, 
b Ax + c Ax 2 = y 2 - yi, 

from which we obtain 

2c kx 2 = y 0 - 2 y 1 + y 2 . (8) 


Thomas Simpson (1710-1761), an English mathematics teacher whose name is wrongly attached 
to the rule that bears his name, was in his earlier years a professional astrologer and confidence man 
(one of his escapades forced him to flee to another town). His eventual success as a writer of ele¬ 
mentary mathematics textbooks was greatly helped by accusations of plagiarism. This success en¬ 
abled him to escape from poverty and leave his shady past behind him. 
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We now think of the parabola (5) as a close approximation to the curve 3 ; = 
f(x) on the interval [jcq, X 2 L and we compute this part of the integral ( 1 ) accord¬ 
ingly, 

[ /(x) dx = \ \a + b(x — x 1) + c(x — X\) 2 ] dx 

= ax + 7 b(x — x \) 2 + }c(x ~ xi) 3 • 

lx a 

When this is evaluated in terms of A*, we obtain 


2a Ajc + f c Ax 3 . 



By using ( 8 ) and the fact that a = y 1 , we can write this in the form 
2 y\ Ax + {(y 0 “ 2y\ + y 2 ) Ax = y(y 0 + 4yi + y 2 ) Ax. 


The same procedure can be applied to each of the intervals [x 2 , x 4 ], [x 4 , x 6 ], 
When the results are all added together, we get the approximation formula 



l(yo + 4y\ + 2^ + • ■ ■ + 4+ y n ) Ax, 


which is called Simpson's rule. We specifically point out the structure of the ex¬ 
pression in parentheses: y 0 and y n occur with coefficient 1 , the remaining y’s with 
even subscripts occur with coefficient 2 , and the y 's with odd subscripts occur 
with coefficient 4. 


Example 2 Use Simpson’s rule with n = 4 to calculate an approximate value 
for the integral 

f 2 dx 

Jo 1 +x 4 ' 


This 
x 4 = 


time we have y = fix) = 1/(1 + a: 4 ) and *o = 0, x\ = 7, xz = 1 , *3 
2. A simple table helps to keep the computations in order: 


ro = 1 

>i = if = 0.941 
>>2 = 1= 0.500 
> 3 = |f = 0.165 
>4 = IT = 0.059 


yo = 1.000 
4y! = 3.764 
2y 2 = 1.000 
4>>3 = 0.660 

3/4 = 0059 
sum = 6.483 


3 

2’ 


Simpson’s rule now yields 


Sometimes data is obtained from a scientific experiment with equally spaced 
observations. If this data represents isolated values of a function whose analytic 
expression is not known, then it may be wished to obtain an approximation to 
the integral of this function over the range of observation. Simpson’s rule can be 
used in such a situation. 
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Example 3 If the experimental data is 


X 

0 

0.5 

1 

1.5 

2 

y 

1.0000 

1.6487 

2.7183 

4.4817 

7.3891 


then 


f Q 2 ydx = i[ 1 + 4(1.6487) + 2(2.7183) + 4(4.4817) + 7.3891] 

= 6.3912. 

As a matter of fact, y = e x was the function used to generate this table of val¬ 
ues, so the value of the integral is e 2 — 1 = 6.3890560989 to 10 decimal places. 


Any serious study of a method of approximate calculation must include a de¬ 
tailed estimate of the magnitude of the error committed so that definite knowl¬ 
edge is available of the level of accuracy attained. We do not pursue this matter 
very far here, but merely state that the error in Simpson’s rule is known to be at 
most 


M{b - a) 
180 


A* 4 , 


(9) 


where M is the maximum value of/ (4) (x) on [a, b\ Derivations of this bound for 
the error can be found in books on numerical analysis. The power of Ax that ap¬ 
pears in (9) tells us that if we reduce the width Ax by a factor of 10 (using 10 
times as many subintervals), then we expect the maximum error to shrink by a 
factor of 10 4 = 10,000. If we replace Ax in (9) by (b - a)/n, the bound (9) takes 
the form 


M(b ~ a) 5 
180 n 4 ' 


( 10 ) 


This formula enables us to impose a previously determined bound on the error 
by specifying a suitable value for n. 


Example 3 ( continued ) We see that the actual error in the above calculation is 
about 0.0021 when n = 4. What value of n will guarantee that the error will be 
at most 0.0001? 

In this case, assuming/(x) = e x really was the function underlying our data, 
then/ (4) (x) = e x and M = e 2 . By (10) we therefore have 


e 2 -2 5 
180fz 4 


0 . 0001 , 


so 

e 1 • 2 5 

ft 4 = 10 4 or n = 10.7. 

180 

Any integer n > 11 will therefore provide this level of accuracy. 


Students who own calculators and enjoy working with them should note that 
the methods and problems of this section—and also of others to come, espe- 
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dally Section 14.5—provide plenty of raw material for these calculator enthu¬ 
siasts. 


PROBLEMS 


HHl 1 Clearly, 

Vjc dx = f = 0.666 .... 

Calculate the value of this integral approximately with 
n = 4 by using 

(a) the trapezoidal rule (recall that V2 = 1.414... and 
V3 = 1.732...); 

(b) Simpson’s rule. 

Since the two rules are almost equally easy to apply, and 
Simpson’s rule is usually more accurate, the trapezoidal 
— rule is rarely used in practical computations. 

HH 2 Clearly, 


sin x dx = 2. 

Jo 

Calculate the value of this integral approximately by us- 
ing Simpson’s rule with n = 4. 

HHl 3 The exact value of 


I Vsin x dx 
Jo 

is not known. Find its approximate value by using Simp- 
r^| son’s rule with n - 4. 

HHl 4 The exact value of 


[ 5 — dx 
J i x 


is not known. Use Simpson’s rule when n — 4 to find its 
— approximate value. 

•Ill 5 The exact value of 


£ 


e * 2 dx 


is not known, but to 10 decimal places it is 0.8820813908. 
Calculate this integral approximately by using Simpson’s 
— rule with n — 4. 

•Ill6 Find an approximate value for In 2 by using the fact that 


In 2 



and applying Simpson’s rule with n — 4. (To 10 decimal 
„ places, In 2 = 0.6931471806.) 

•Ill 7 Use the formula 


J7T f 1 dx 

4 Jo 1 + X 2 

to find an approximate value for tt by using Simp¬ 
son’s rule with n = 4. (To 10 decimal places, tt = 
3.1415926536.) 



Figure 10.7 A dogleg fairway on a golf course. 
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8 In Example 3, what positive integers n will guarantee that 
the error is at most 0.000001? 

9 The width, in feet, at equally spaced points along the fair¬ 
way of a hole on a golf course is given in Fig. 10.7. The 
management wishes to estimate the number of square 
yards of the fairway as a basis for deciding how long it 
should take a groundskeeper to mow it. Use Simpson’s 
rule to provide such an estimate. 

10 Suppose that the three points on the curve in the deriva¬ 
tion of Simpson’s rule are collinear. Use (8) to show that 


in this case c = 0, and conclude that under this as¬ 
sumption the curve through the points is a straight line 
instead of a parabola. 

11 Simpson’s rule is designed to be exactly correct if f(x) 
is a quadratic polynomial. It is a remarkable fact that it 
also gives an exact result for cubic polynomials. Prove 
this. Hint: Notice that it suffices to establish the state¬ 
ment for n = 2; then prove it in this case for the func¬ 
tion f[x) = x 3 ; then extend it to any cubic polynomial. 

12 Use formula (9) to prove the statement in Problem 11. 


CHAPTER 10 REVIEW: FORMULAS, METHODS 


Think through and learn the following. 

1 The 15 basic formulas (write them down from memory). 

2 Method of substitution. 

3 Integrals of the form 

J sin™ x cos' 1 x dx , J tan™ x sec" x dx , 

j cot™ x esc" x dx. 


4 The trigonometric substitutions x = a sin 6, x = a tan 6 , 
x = a sec 6. 

5 Completing the square: (x + A) 2 = x 2 + 2 Ax + A 2 . 

6 Method of partial fractions. 

7 Integration by parts. 

8 Simpson’s rule. 
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SECTION 10.2 

Find each of the following integrals. 


j V3j + 5 dx. 
f 6x dx 

J 1 + 3x 2 ' 

5 J cos ( 1 - 5x) dx. 

„ f sec Vx tan Vx dx 

' Vx ' 

f 2 xdx 

J 1 +* 4 ’ 

11 | cot 4x dx. 
dx 


13 


15 


17 | sin 


;c(ln x) 2 ’ 

sec 2 x dx 
tan x 

/3x - 5 


\ 2 


dx. 


(In x) 6 dx 


e i/x dx 


6 | sin x sin (cos x) dx. 
x 3 dx 


8 


10 


vT-7 


x 2 + 5 
x 2 + 4 


dx. 


12 [ - 7 ^—. 

J sin 2x 

» ll^7- 

16 | 10xV v5 dx. 

18 J esc 2 (2 — x) dx. 


19 

21 

23 

25 

27 

29 

31 

33 

35 

37 


| 6x 2 cot x 3 esc x 3 dx. 
dx 


x[l + (In x) 2 ] ’ 
dx 

Ox + 5) 2 ’ 
dx 


f _ dx 

J 3-2x‘ 

J x 2 cos (1 + x 3 ) dx. 
J x esc 2 (x 2 + 1) dx. 
cos x dx 


l + sin 2 jC 
dx 




tan 2 jc' 
tan -1 x dx 


+ 


dx 


20 


sec 2 x dx 
V1 - tan 2 x 


22 cot ttx dx. 


24 tan x sec 4 x dx. 


2x+Y 


26 

28 

30 

32 

34 

36 

38 


( e x + 2x) dx 
e x + x 2 — 2 ' 


i 

J sin (2 — x) dx. 

f dx 
' V3 - 4x 2 ' 
f dx 

J 1 + 4jc 2 ‘ 

J (esc x — \) 2 dx. 

J \^3x — 2 dx. 

f (e x — e~ x ) dx 
J e x + e~ x 
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71 
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j 

/ 


e x!1> dx. 

sec 2 (sin x) dx 
sec x 


40 


dx 

sec 2x 


79 J (1 + cos x) 4 sin * dx. 80 
81 J cos (tan x) sec 2 x dx. 82 


/ 

/ 


(1 + cos x) dx 
x + sin x 

esc 2 (In x) dx 
x 


J (esc x — cot x) CSC 

x dx. 


f dx 

44 j 

r dx 

‘ Vl - 25*2 

' 16 + 25x 2 ' 

f sec x tan x dx 

J 1 + sec 2 x 

46 J 

[ (1 T sec x) 2 dx. 

f (In x) 2 dx 

J X 

48 J 

f cos xdx 
sin 2 x 

f sin x dx 

50 J 

f 6 esc 2 x dx 

J 1 + cos x' 

1 - 3 cot x ’ 

f dx 

J 

52 j 

f e x cos e x dx. 

f sin (In x) dx 

J X 

54 j 

f esc 2 Vx dx 

Vx ' 

f esc 1/x cot 1/x dx 

56 J 

f 4 dx 

J X 2 

3 + 4^' 

r e 2x dx 

J I + e 4 *' 

58 J 

f **,. 

sin x z 

J x 3 V2 + x 4 dx. 

60 J 

f x dx 

' V2-x 2 ' 

f (1 + e x ) dx 

J e x + x ‘ 

62 j 

f xe xl dx. 

f 2 dx 

J V?' 

64 J 

f x sin ( 1 - x 2 ) dx. 

r dx 

66 J 

f dx 

J sin 2 x ’ 

V4 - 9a 2 

J x tan x 2 dx. 

68 j 

f sec 2 x dx 

Vtan x 

f x dx 

70 J 

f 2 e 2x dx. 

J l+x 2 ' 

J xe 3 * ! ~ 2 dx. 

72 J 

f 3x 2 sin x 3 dx. 

J sec x (sec x + tan x) dx. 


f x 2 dx 

J 9 +x 6 ' 



J x 2/3 V 1 + x 5/3 dx. 

76 J 

f 4x 3 dx 

1 + A 4 ' 

J sec 2 x e tan x dx. 

78 J 

^ x sec 2 x 2 dx. 


Compute each of the following definite integrals by making 
a suitable substitution and changing the limits of integration. 


83 


f 

Jo 


v "2/2 2x dx 


Vi -a 4 


fir/4 

85 cot 2x esc 2 2x dx. 86 

Jtt/S 


87 f 4 2 xVa 2 + 9 d*. 
Jo 


84 


88 


fVTT 

I x sin x 2 dx. 

cos x dy 
1 + sin 2 x‘ 

x dx 

Vx 2 + 16 


£ 

l’ 


SECTION 10.3 

Calculate each of the following integrals. 

89 J sin 2 5x dx. 90 J cos 4 3x dx. 


91 J cos 2 lx dx. 

93 J sin 5 x cos 2 x dx. 

95 j cos 3 4x dx. 
cos 3 x dx 


97 


108 


/■ 


sim x 

99 J sin 375 x cos x dx. 
101 J sec 6 x dx. 

103 J tan 3 x sec 7 x dx. 
105 J cot 5 x dx. 


92 J sin 6 x dx. 

94 J sin 5 x dx. 

96 J cos 3 2x sin 2x dx. 

qg f sin 5 x dx 
Vcos X 

100 J sin 2 x cos 4 x dx. 
dx 


102 


104 


106 


107 J (sec 3x + esc 3x) 2 dx. 

t ■ 


J cot 4 


x dx. 


dx 


sin 4 3x‘ 


dx 


sec x tan x 


SECTION 10.4 

Find each of the following integrals. 


109 

j V3 - x 2 dx. 

110 j 

f dx 

\ (fl 2 + ^2)3/2 

111 

f x 2 dx 

M2 j 

f V4 - 9a 2 

J a 2 + x 2 ' 

1 A ‘ 

113 

J x 3 Va 2 - x 2 dx. 

114 J 

f x 3 dx 

Vfl 2 + X 2 
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115 

117 

119 

121 

123 

125 

127 

129 

131 


i 

i 

i 

i 

i 

i 

i 

i 

i 


Va 2 + i- 2 , 

-r- dx. 

x z 

dx 

x 4 V a 2 - x 2 
x 2 dx 

(<a 2 + x 2 ) 2 ' 
dx 

x 2 Vx 2 - 9 ’ 
dx 

(1 - 9x 2 ) 3/2 ‘ 
dx 

xV9 4- 4x 2 
x 2 dx 

(a 2 x 2 )3/ 2 

X 2 

(fl 2 + x 2 ) 3/2 ' 
x 2 <ix 

Vx 2 — a 2 


116 

r dx 

' x 2 Va 2 - x 2 


f dx 

118 

J x 2 4~ x 4 

120 

J x\a 2 - x 2 ) 3/2 dx. 

122 

J Vx 2 - 1 dx. 


f x 2 dx 

124 



Va 2 4- x 2 

126 

C dx 

•* V9 - (jc - l) 2 

128 

r dx 

x 4 Va 2 4- x 2 


f Vfl 2 — X 2 

130 

J- ,4 * 


f x 3 dx 

132 

J (x 2 - a 2 ) 2 ' 2 ' 


SECTION 10.5 

Calculate each of the following integrals. 


133 J 

f dx 

134 

r dx 

V65 - 8x - x 2 

Vl + 4x - x 2 

135 J 

f dx 

136 

(• (3x - 5) dx 

' 5x 2 + l(k + 15' 

J x 2 + 2x + 2' 

137 J 

r dx 

138 

r (l-x)dx 

V2 + 2x- 3x 2 

' V8 + 2x - x 2 

139 J 

f a: 2 dx 

140 

f x dx 

V2x - x 2 

Vx 2 - 4x 4- 5 

141 J 

r dx 

142 

f (3x + 4) dx 

3x 2 — 6x 4- 15 * 

V2x + x 2 

143 J 

r dx 



(x - iWx 2 -2x- 

3 


144 J 

f (2x - 5) dx 

V4x - x 2 



145 J 

f (3x + 7) dx 

146 

J Vx 2 + 2x + 2 dx. 

Vx 2 + 4x + 8 

147 J 

f (2x - 3) dx 

148 

J Vx 2 - 2x dx. 

' (x 2 + 2x - 3) m ' 


SECTION 10.6 

Find each of the following integrals. 
16x 4- 69 


149 

151 

153 

154 

155 

156 

157 

158 

159 

160 


12 


—8x — 16 
4x 2 - 1 

3x 2 - lQx - 60 
x 3 + x 2 - 12x 

8 x 2 + 55x - 25 
x 3 - 25x 

—2x 2 - 18x + 18 
x 3 - 9x 

4x 2 - 2x - 108 


dx. 150 

dx. 152 

dx. 

dx. 

dx. 


3x - 56 
x 2 4- 3x - 28 

12x - 63 


dx. 


y2 _ 


3x 


dx. 


dx. 


x 3 4- 5x 2 - 36x 
—3x 3 + x 2 + 2x + 3 

X 4 + X 3 

9x 2 - 35x + 28 


dx. 


f 9x z - 
J x 3 - 


4x 2 + 4x 
5x — 8 


dx. 


x 3 4- 4x 2 4- 8 x 
' 3x 2 — 5x 4- 4 


■ x 2 4- x • 


dx. 

— dx. 


SECTION 10.7 

Calculate the integrals in Problems 161-176 by the method 
of integration by parts. 


161 

163 


b 


x 2 tan -1 x dx. 

J cos (In x) dx. 

165 J x 3 cos x dx. 

In x dx 


162 


t 


x 2 cos x dx. 


164 J x sin 2 x dx. 


167 


169 


(x+ 1) 2 ‘ 
x 3 dx 

Vl + x 2 ‘ 


166 

168 

170 


4- x 2 dx. 
xe x dx 


f (x + l) 2 ' 

J x(x 4- 3) 10 dx. 


171 J e** sin bx dx. 172 J x n In x dx (n ^ — 1). 


173 

175 

176 


f x dx 
J e x ■ 


174 J x 2 sin x dx. 


x 3 e 2x dx. 


J In (x 4- Vx 2 4- a 2 ) dx. 

1 77 Find the area under the curve y = sin Vx from x = 

0 tO X = 7T 2 . ^ *3 

r 

dx by using the 


178 Calculate the integral J 
identity 


vT 


4- x 2 
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—•* 3 - = + * 2 ~ ]) = a/7T75-£— 

VTT7* VTT7' 

Make sure your answer agrees with the result of Prob¬ 
lem 169. 

179 Calculate the integral i 2 Va — x dx (a) by using the 

substitution u = Va — x\ (b) by parts. 

180 Use integration by parts to show that 

[ \/a 2 — x 2 dx = jcV a 2 — x 2 + f — X _ dx. 

J J Va^7 2 

Write x 2 = -(-jc 2 ) = — {a 2 — x 2 — a 2 ) in the nu¬ 
merator of the second integral and thereby obtain the 
formula 

f V<3 2 — jc 2 dx = jjcVa 2 — jc 2 + \a 2 f — ™^ X -= : 

J J V^x 2 

= jxVfl 2 — x 2 + \a 2 sin -1 — + c. 

a 

181 Use the method of Problem 1 80 to obtain the formula 

J ( a 2 — x 2 ) n dx 


x{a 2 — x 2 ) n + 
2 n + 1 


2a 2 n 
In + 1 


J (<2 2 — JC 2 )" 1 c/jC. 


*182 Use the idea of Problem 181 to obtain formula (9) in 
Section 10.6, 

r dx _ 1 x 

J ( a 2 + x 2 ) n 2a\n — 1) (a 2 + x 2 ) n ~ l 

2/1-3 _ f dx 

2a 2 (n — 1) j (a 2 + ^ 2 ) n_ 1 

In the next three problems, derive the given reduction formula 
and apply it to the indicated special case. 

c x m+] (In x~) n 

183 (a) jc«(ln jc)" dx = - - 

J m+ 1 

--— [ x w (ln x)”" 1 dx. 

m + 1 J 

(b) fx 5 (In x) 3 dx. 

184 (a) [ x n e ax dx = — — — [ x n ~ x e ax dx 

J a a J 

(b) J x 3 e~ 2x dx. 

185 (a) [ sec" x dx = —sec" -2 x tan x 

j n - 1 

n — 2 f 

H-— sec" 1 x dx. 

n - 1 J 

(b) f sec 3 x dx (see Problem 29 in Section 10.3). 


APPENDIX 1: THE 
CATENARY, OR CURVE 
OF A HANGING CHAIN 



As a specific example of the use of the methods of integration discussed in Section 10.4, 
we solve the classical problem of determining the exact shape of the curve assumed by a 
flexible chain (or cable, or rope) of uniform density which is suspended between two 
points and hangs under its own weight. This curve is called a catenary ; from the Latin 
word for chain, catena * 

Let the y-axis pass through the lowest point of the chain (Fig. 10.8), let s be the arc 
length from this point to a variable point (jc, y\ and let wo be the linear density (weight 
per unit length) of the chain. We obtain the differential equation of the catenary from the 
fact that the part of the chain between the lowest point and (jc, y) is in static equilibrium 
under the action of three forces: the tension 7o at the lowest point; the variable tension T 
at (jc, y), which acts in the direction of the tangent because of the flexibility of the chain; 
and a downward force vvos equal to the weight of the chain between these two points. 

Equating the horizontal component of T to 7b and the vertical component of T to the 
weight of the chain gives 

T cos 9 = Tq and T sin 0 = w 0 s, 

and by dividing we eliminate T and get tan 0 = wqs/Tq or 

dy w 0 

—— = as, where a = —. 
dx T 0 


Figure 10.8 


*The catenary problem is also solved in the optional Section 9.7 by using methods depending on hy¬ 
perbolic functions. The solution given here does not depend on these methods and can therefore be 
understood by students who have omitted that optional section. 
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We next eliminate the variable 5 by differentiating with respect lox, 


d 2 y _ ds_ 
dx 2 a dx 



(1) 


This is the differential equation of the catenary. 

We now solve equation (1) by two successive integrations. This process is facilitated 
by introducing the auxiliary variable p = dy/dx , so that (1) becomes 


= a V1 + p 2 . 

dx 


On separating variables and integrating, we get 

I 7^= =fadx. (2) 

1 Vl +p 2 1 

To calculate the integral on the left, we make the trigonometric substitution p = tan </>, so 
that dp = sec 2 4> d4> and Vl -f - p 2 = sec (J). Then 


f 4? f sec 2 4>d4) [ L ,, 

7 = -= sec 4> d4> 

J Vl + p 2 j sec 6 J 


sec 4) 


= In (sec 4) + tan </>) = In (Vl 4- p 2 + p), 

so (2) becomes 

In (Vl + p 2 + p) = ax + cu 
Since p = 0 when x = 0, we see that c\ = 0, so 


In (V1 + p 1 + p) = ax. 

It is easy to solve this equation for p, which yields 

dy 1 

i=p = 1 ( e -- e -i 


and by integrating we obtain 

y = 4~ (c“ + c _ “) + c 2 . 

2a 

If we now place the origin of the coordinate system in Fig. 10.8 at just the right level so 
that y = 1/a when ^ = 0, then C 2 = 0 and our equation takes its final form, 

y = 2^i.e ax +«-«). (3) 

Equation (3) reveals the precise mathematical nature of the catenary and can be used as 
the basis for further investigations of its properties.* 

The problem of finding the true shape of the catenary was proposed by James Bernoulli 
in 1690. Galileo had speculated long before that the curve was a parabola, but Huygens 
had shown in 1646 (at the age of 17), largely by physical reasoning, that this is not cor¬ 
rect, without, however, shedding any light on what the shape might be. Bernoulli’s chal¬ 
lenge produced quick results, for in 1691 Leibniz, Huygens (now aged 62), and James’s 
brother John all published independent solutions of the problem. John Bernoulli was ex- 


*The hyperbolic cosine defined in Section 9.7 enables us to write the function (3) in the form 

y = — cosh ax. 
a 
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ceedingly pleased that he had been successful in solving the problem, while his brother 
James, who proposed it, had failed. The taste of victory was still sweet 27 years later, as 
we see from this passage in a letter John wrote in 1718: 

The efforts of my brother were without success. For my part, I was more fortunate, for 
I found the skill (I say it without boasting; why should I conceal the truth?) to solve it 
in full. ... It is true that it cost me study that robbed me of rest for an entire night. It 
was a great achievement for those days and for the slight age and experience I then had. 
The next morning, filled with joy, I ran to my brother, who was struggling miserably 
with this Gordian knot without getting anywhere, always thinking like Galileo that the 
catenary was a parabola. Stop! Stop! I say to him, don’t torture yourself any more try¬ 
ing to prove the identity of the catenary with the parabola, since it is entirely false. 

However, James evened the score by proving in the same year of 1691 that of all possi¬ 
ble shapes a chain hanging between two fixed points might have, the catenary has the 
lowest center of gravity, and therefore the smallest potential energy. This was a very sig¬ 
nificant discovery, because it was the first hint of the profound idea that in some myste¬ 
rious way the actual configurations of nature are those that minimize potential energy. 


APPENDIX 2: 
WALLIS’S PRODUCT 
7t_2 2 4 466 

‘i'T’i’IT' 


This expression was discovered by the English mathematician John Wallis in 1656 and is 
called Wallis's product. Apart from its intrinsic interest, formula (2) underlies other im¬ 
portant developments in both pure and applied mathematics, so we prove it here. 

If we define /„ by 

Ctt/2 

l n = J o sin ' 7 x dx , 

then ( 1 ) tells us that 

In = ~— ln-2- O) 

n 

It is clear that 

f 77-/2 77 7 t /2 

In = dx = — and I\ = sin x dx — 1 . 
u Jo 2 Jo 

We now distinguish the cases of even and odd subscripts, and use (3) to calculate fa and 
/ 2 / 7 + 1 , as follows: 

r 2/7-1 r 2/7—12/7 — 3, 

hn ~ 2n ~ 2 ~ 2 /t 2 /t" - —~ 2 ~ hn ~ A 

2/7 — 1 2/7 — 3 2/7 — 5 1 i 

2/7 2/7 - 2 2/7 — 4 2 

_ 1 3 5 2/7 — 1 7T 

” TT6 


As an application of integration by parts in Section 10.7, we obtained the following re¬ 
duction formula: 


f sin 77 x dx = —— sin 77 1 x cos x + —-— f sin 77 2 x dx. 

J n n j 


( 1 ) 


This formula leads in an elementary but ingenious way to a very remarkable expression 
for the number 7t/2 as an infinite product, 


_tt = 2 2 4- A_ 6_ 6_ 
2 13355*7 


2/7 


2/7 


2 / 7—1 2/7 + 1 


( 2 ) 


2/7 2 ’ 


( 4 ) 
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and 


hn+\ 


2n _ In In — 2 

2 n + 1 2n-1 “ 2n + 1 2n - 1 /2n_3 


2n 2w-2 2n-4 2_ 

2n + 1'2n — 1 2n — 3 ' ' ' 3 7| 


2 4 6 2n 

3 5 7 2n + 1' 


(5) 


As the next link in the chain of this reasoning, we need the fact that the ratio of these two 
quantities approaches 1 as n —> <*>, 


hn 

hn+\ 


1 . 


( 6 ) 


To establish this, we begin by noticing that on the interval 0 < x < 7t/2 we have 0 < 
sin x < 1, and therefore 


0 < sin 2774 * 2 x < sin 2 " +1 x < sin 2 " x. 


This implies that 


0 < 


r 

Jo 


sin 


2n + 2 


x dx '■ 


Jo 


sin 


2n+\ 


x dx : 


r 

Jo 


sin 2 " x dx , 


or equivalently, 


o < hn + 2 — hn+\ — hn- 


If we divide through by I 2n and use the fact that by (3) we have 


then (7) yields 


This implies that 


hn+2 _ 2/i + 1 

hn 2/1 + 2 ’ 


+ 1 < hn+i < j 

2/2 + 2 I 2n 


hn +1 

hn 


as /i —> , 


(7) 


and this is equivalent to (6). 

The final steps of the argument are as follows. On dividing (5) by (4), we obtain 

hn+ 1 _ 2 2 4 4 6 6 2/i _ 2/i 2 

hn _ TT3*5 *5*7 * * * 2/i - 1*2/1 + 1* tt’ 

so 

j7r = 2 2 4 4 6 6 . . . 2 n 2n ( I 2n \ 

2 1 3 3 5 5*7 “ * 2/i- 1*2/1 + 1 \hn+\) 

On forming the limit as n —> 00 and using (6), we obtain 

2 t _ 111 2/i _ 2/i 

2 “n™ 13 *3 5 5 7 * 2/i - 1* 2/i + 1’ 


and this is what (2) means. 
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We also remark that Wallis’s product (2) is equivalent to the formula 



This is easy to see if we write each number in parentheses on the left in factored form. 
This gives 



111 111 =2l 

2 2 4 4 6 6 t r’ 

which is clearly equivalent to (2). Formula (8) will reappear in Appendix 1 at the end of 
Chapter 13 as a special case of another even more wonderful formula.* 


APPENDIX 3 HOW 
LEIBNIZ DISCOVERED 
HIS FORMULA 


The area of the quarter-circle of radius 1 shown in Fig. 10.9 is obviously 7r/4. We follow 
Liebniz and calculate this area in a different way. The part that we actually calculate is 
the area A of the circular segment cut off by the chord OT, because the remainder of the 
quarter-circle is clearly an isosceles right triangle of area 

We obtain the stated area A by integrating the sliverlike elements of area OPQ , where 
the arc PQ is considered to be so small that it is virtually straight. We think of OPQ as a 
triangle whose base is the segment PQ of length ds and whose height is the perpendicu¬ 
lar distance OR from the vertex O to the base PQ extended. The two similar right trian¬ 
gles in the figure tell us that 


i /OC 

^T = — or OR ds = OS dx , 

dx OR 

so the area dA of OPQ is 

dA = \OR ds = \OS dx = jy dx, 


"Wallis was Savilian Professor of Geometry at Oxford for 54 years, from 1649 until his death in 1703 
at the age of 87, and played an important part in forming the climate of thought in which Newton 
flourished. He introduced negative and fractional exponents as well as the now-standard symbol «> 
for infinity, and was the first to treat conic sections as plane curves of the second degree. His infi¬ 
nite product stimulated his friend Lord Brouncker (first president of the Royal Society) to discover 
the astonishing formula 


4_ 

7 T 


2 + • 


2 +- 


2 +- 


2 + 


9 2 


2 + ■ 


from which the theory of continued fractions later arose. [No one knows how Brouncker made this 
discovery, but a proof based on the work of Euler in the next century is given in the chapter on 
Brouncker in J. L. Coolidge’s The Mathematics of Great Amateurs (Oxford University Press, 1949).] 
Among the activities of Wallis’s later years was a lively quarrel with the famous philosopher Hobbes, 
who was under the impression that he had succeeded in squaring the circle and published his erro¬ 
neous proof. Wallis promptly refuted it, but Hobbes was both arrogant and too ignorant to under¬ 
stand the refutation, and defended himself with a barrage of additional errors, as if a question about 
the validity of a mathematical proof could be settled by rhetoric and invective. 
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where y denotes the length of the segment OS. The element of area dA sweeps across the 
circular segment in question as x increases from 0 to 1, so 




and integrating by parts in order to reverse the roles of x and y gives 
, 1 

where the limits on the two integrals are understood to be y = 0 and y = 1. To continue 
the calculation, we observe that since 

y = tan y</> and x = 1 — cos <f> = 2 sin 2 y</>, 


Hf**-HL* 


( 1 ) 


the trigonometric identity 



tan 2 y</> 


sin 2 y<£ 
cos 2 \<j) 


= sin 2 y</> sec 2 y</> = sin 2 y</>(l + tan 2 y</>) 


yields 


2 1 + y 2 ’ 

The version of the geometric series given in formula (13) in Section 9.5 enables us to 
write this as 


y = y 2 (l — y 2 + y 4 - y 6 + • • ■) = y 2 - y 4 + y 6 - y 8 + • ■ ■, 

so (1) becomes 


A = j - f 0 ‘ (y 2 -y A + y 6 - y 8 + ■ ■ ■) dy 



2 3 + 5 7 + 9 ' “ ' 

When y is added to this to account for the area of the isosceles right triangle, and the re¬ 
sult is equated to the known area 7 t/ 4 of the quarter-circle, we have Leibniz’s formula 



Is it any wonder that he took great pleasure and pride in this discovery for the rest of his 
life? 
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FURTHER 
APPLICATIONS OF 
INTEGRATION 


11.1 

THE CENTER OF MASS 
OF A DISCRETE 
SYSTEM 


V—d 2 



Figure 11.1 


Most of the ideas in this chapter are based on the simple physical concept of cen¬ 
ter of gravity. As we shall see, this concept has implications for geometry, and 
it turns out to be possible to use it to arrive at a reasonable notion of what ought 
to be meant by the “center” of a general geometric figure. In this introductory 
section we confine ourselves to describing the concepts, and make no use of in¬ 
tegration. 

We begin by considering two children of weights w\ and w 2 sitting at distances 
d\ and d 2 from the fulcrum of a seesaw (Fig. 11.1). As we know, each child can 
increase the tendency of his or her weight to turn one end down by moving far¬ 
ther out from the fulcrum, and the seesaw balances when 

W\d\ = W 2 d 2 . ( 1 ) 

This principle was discovered by Archimedes, and is known as the law of the 
lever. If we establish a horizontal x-axis with its origin at the fulcrum and the 
positive direction to the right, then (1) can be written in the form 

2 

w\X\ + w 2 x 2 = 0 or V w k x k = 0, 
k= 1 


w 4 w 2 Wj W 3 

0 X 2 ^ x x x^ 

Figure 11.2 


where x\ = d\ and x 2 = ~d 2 . 

We now extend this discussion by considering the x-axis as a weightless hor¬ 
izontal rod that pivots at the point p, as shown in Fig. 11.2, and we assume that 
n weights w* are placed at points x^, where k = 1, 2, . . . , n. By Archimedes’ 
law, this system of weights will exactly balance, or be in equilibrium about p, if 

X - P) = 0 . 


More generally, whether this system is in equilibrium or not, the sum - p) 

measures the tendency of the system to turn in a clockwise direction about the 
pivot point p. This sum is called the moment of the system about p, and the sys¬ 
tem is in equilibrium if this moment is zero. If the weights w* and their positions 
Xk are given in some arbitrary manner, and if we are free to move the pivot point 
p, then it is easy to find a point p = ~x at which the system will balance, that is, 
with the property that the moment of the system about x is zero. The required 
condition is 


384 


X Wk(*k - x) = 0. 
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This is equivalent to 


X W k x k ~ X WkX = 0 0r X X Wk = X WkXk ’ 


SO 


_ _ Zw k x k 
X • 


( 2 ) 


This balancing point x is called the center of gravity of the given system of 
weights. 

We now recall that the weight of a body at the surface of the earth is simply 
the force exerted on the body by the gravitational attraction of the earth, and is 
therefore given by Newton’s formula F = mg, where m is the mass of the body 
and g is the acceleration due to gravity (approximately 32 feet per second per 
second or 9.80 meters per second per second). In the above discussion this means 
that w k = m k g, where m k is the mass of the kth body. Formula (2) can therefore 
be written as 


_ = 'Zm k gx k _ 'Zm k x k 
X 2 m k g 2m* 


(3) 


With the influence of gravity removed from the discussion in this way, and the 
weights w k in (2) replaced by the masses m k in (3), it is customary to call the 
point x the center of mass of the system. 

It is easy to extend these ideas to a two-dimensional system of masses m k lo¬ 
cated at points (x k , y k ) in a horizontal jcy-plane (Fig. 11.3). We define the moment 
of this system about the y-axis by 


My = X mkX b 


(4) 


which is the sum of each of the masses multiplied by its signed distance from 
the y-axis. If we think of the xy-plane as a weightless horizontal tray, as sug¬ 
gested by the figure, then in physical language the condition M y = 0 means that 
this tray—with the given distribution of masses—will balance if it rests on a 
knife-edge along the y-axis. Similarly, the moment of the system about the jc-axis 
is defined by 


A** = X m & k - 


(5) 


Students should carefully observe the interchange of jc’s and y’s in formulas (4) 
and (5); to compute M y we use the jc*’s, and to compute M x we use the y*’s. If 
we denote the total mass of all the particles in the system by m, so that 




y 


Figure 11.3 
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11.2 

CENTROIDS 


then the center of mass of the system is defined to be the point (x, y\ where 


_ _ _ M, 

'Im/t m 


(6) 


and 


_ _ X m k y k _ 

y m ' 


(7) 


The center of mass of our system can be interpreted in two ways. First, if equa¬ 
tions (6) and (7) are written in the form 

nix = M y and my = M x , 

then we see that (3c, y) is the point at which the entire mass m of the system can 
be concentrated without changing the total moment about either axis. The sec¬ 
ond interpretation depends on writing (6) and (7) in the form 

X "**(** “ x) = o and X “ y) = 0. 

If we think of our system in the way described, as a distribution of masses on a 
weightless horizontal tray, then these equations tell us that the tray will balance 
if it rests on a knife-edge along the line x = 3c parallel to the y-axis, and also 
along the line y = y parallel to the x-axis. These conditions imply that the tray 
will balance if it rests on a knife-edge along any line through (3c, y). It will there¬ 
fore also balance if supported by a sharp nail precisely at the point (3c, y). 

In the preceding discussion, the xy-coordinate system in Fig. 11.3 provides a 
frame of reference that is useful for developing the ideas. However, it is clear 
from the physical meaning of the center of mass that the location of this point is 
determined by the masses themselves and their individual positions, and does not 
depend on the particular coordinate system that is used to describe these posi¬ 
tions. As a practical consequence, this fact tells us that in any specific situation 
we are free to choose any coordinate system that seems convenient under the cir¬ 
cumstances. 


Remark The “center of population” of the United States has been described as 
the point at which a life-sized flat map of the whole country would balance on 
a pin if all Americans weighed the same. The location of this point has been cal¬ 
culated from the data in each census. In 1790 it was a few miles east of Balti¬ 
more. It has been moving westward ever since; in 1980 it crossed the Mississippi 
River; and in 1990 it was about 25 mi southwest of St. Louis (see Fig. 11.4). It 
is interesting to speculate on what changes in the position of this point would be 
produced by “weighing” Americans according to age, or wealth, or education, 
instead of treating them as interchangeable units. 


The ideas discussed in Section 11.1 apply to discrete systems of particles located 
at a finite number of points in a plane. We now consider how integration can be 
used to generalize these ideas to a continuous distribution of mass throughout a 
region R in the xy-plane, as shown in Fig. 11.5. 

We shall think of R as a thin sheet of homogeneous material—say, a uniform 
metal plate—whose density 8 (= mass per unit area) is constant. To define the 




11.2 CENTROIDS 


387 



Figure 11.4 Center of population, 1790 to 1990. {Courtesy U.S. Bureau of the Census .) 


moment of this plate about the y-axis, we consider a thin vertical strip of height 
f(x) and width dx , whose position in the region is specified by the variable x (Fig. 
11.5, left). The area of this strip is f(x) dx and its mass is 8f(x) dx ; and since all 
of its mass is essentially at the same distance x from the y-axis, its moment about 
this axis is x8f(x) dx. The total moment of the plate about the y-axis is therefore 
obtained by allowing the strip to sweep across the region, and by integrating— 
or adding together—all these small contributions to the moment as x increases 
from a to b. 


M y = [ x8f(x) dx. (1) 

This formula can be derived by laboriously constructing approximating sums and 
then forming the limit of these sums, which leads to (1) by means of the defin¬ 
ition of the integral. However, we prefer to continue in the spirit of Chapter 7, 
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Figure 11.5 
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and the preceding discussion provides yet another illustration of the power of the 
Leibnizian approach to integration as we described it in Section 7.1. 

Similarly, the moment of the plate about the x-axis is obtained by considering 
a thin horizontal strip of length g(y) and width dy (Fig. 11.5, right), and is given 
by the formula 

M x = | d ySg(y ) dy. 

The total mass of the plate can evidently be expressed in two ways, 
m = \ a <5/U) dx = | 8g{y) dy. 

The center of mass (x, y) of the plate is now defined by 

_ [xSf(x) d.x 

[ Sf(x) dx m 

Ja 


and 


[ ySg(y) dy 
l &g(y) dy 


m ' 


The reader should observe particularly that these formulas generalize (6) and (7) 
in the preceding section from the discrete case to the continuous case. Also, from 
the point of view of geometry they have the following remarkable feature. Since 
the density 8 is assumed to be constant, it can be factored out of the integrals 
and removed by cancellation, and the formulas for x and y become 


x = 



\ yg(y) dy 

and y = - 

l g(y) dy 


( 2 ) 


Each denominator here is clearly the total area of the region, and the numerators 
can be thought of as the moments of this area about the y-axis and x-axis, re¬ 
spectively. The center of mass is therefore determined solely by the geometric 
configuration of the region R and does not depend on the density of any mass 
that this region may contain, as long as this density is constant. For this reason 
the point (x, y) is called the centroid of the region, meaning “point like a cen¬ 
ter.” The examples and problems that follow will make it clear that this termi¬ 
nology is well suited to the geometric concept it is meant to describe. 

It will be convenient for our work in the next section if we simplify formulas 
(2) even further. In the case of x, the area of the thin vertical strip is an element 
of area in the sense of Sections 7.1 and 7.2, so we write it as dA =/(x) dx\ and 
in the case of y, we similarly have dA = g(y) dy for the area of the thin hori¬ 
zontal strip. Formulas (2) can therefore be written in the streamlined form 


fx dA 
/ dA 


y = 


_ SydA 
S dA ' 


x = 


and 


(3) 
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We emphasize that each dA in these formulas is understood to be the area of a 
thin strip parallel to the appropriate axis, in order to guarantee that all points in 
the strip will be essentially at the same distance from this axis. It is also under¬ 
stood here that the process of integration expressed by these symbols is extended 
over the region under discussion. The limits of integration are omitted deliber¬ 
ately, and don’t really need to be written down unless we are performing actual 
calculations in a specific case. 

Example 1 Find the centroid of a rectangle. 

Solution If the rectangle has height h and base b, then we can place the coor¬ 
dinate system so that the origin is at the lower left corner and the point ( b , h) is 
at the upper right corner, as shown in Fig. 11.6. Since the area of this rectangle 
is hb, we have 


f 

Jo 


h dx 


hb 


hb 


hx 2 


dA = h dx 


>(jb,\h) 


(b , li ) 


dx 


Figure 11.6 


hb 



J_ 

2 


b. 


In just the same way we find that y = \h , so the centroid is the point (\b, \h), 
which is clearly the center of the rectangle. 


In general, it appears that the centroid of a region must lie on a line of sym¬ 
metry of the region, if such a line exists. This is easily seen to be true, as fol¬ 
lows. If L is a line of symmetry of a region R , then we can choose this line to 
be the y-axis (Fig. 11.7), and we wish to convince ourselves that x = 0. If dA is 
an arbitrary thin vertical element of area at position x, then by symmetry there 
is a corresponding element of area at position —x; and since x dA + (—jc) dA = 
0, we have 

f x dA = 0, and therefore x — ^ f = 0. 

J J dA 



Further, if a region has two distinct lines of symmetry, then the conclusion we 
have just reached tells us that the centroid must lie on both lines and is therefore 
the point of intersection of these lines. Accordingly, in every case where a geo¬ 
metric figure has a “center” in the usual sense of the word, this center is the cen¬ 
troid. However, as our next example shows, centroids are easily calculated for 
many regions that are not ordinarily considered to have centers at all. From this 
point of view, the centroid of a region is a far-reaching generalization of the con¬ 
cept of the center of a geometric figure. 

Example 2 Find the centroid of the region in the first quadrant bounded by the 
axes and the curve y = 4 -x 2 (Fig. 11.8). 


Solution 
gion is 


By using the vertical strip in the figure, we see that the area of the re- 


A = J dA = f () (4 ~ x 2 ) dx = 


4x 


- r 3 


16 

3 ’ 



o 


Figure 11.8 
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SO 


X = 

Similarly, using 


jxdA = _3_ f 2 
A _ 16 Jo 


x{4 — x 2 ) dx 



2 

4' 


a horizontal strip not shown in the figure, we have 


- fydA 3 a/ ^— 

y = — = -j o y^-> d y- 


To evaluate this integral we make the substitution u = 4 — y, so that y = 4 — u 
and dy = — du, and we also change the limits of integration from y = 0, 4 to 
u = 4, 0: 

y = ( 0 yV4-y dy = f° u 1/2 (4 - u)(~du) 

= [ (4w 1/2 - w 3/2 ) du = w 3/2 — u 512 

16 Jo 16 [3 5 Jo 

_ J_ / 64 _ 64 \ = 2 

16 V 3 5 ) 5' 


The integration here is a bit complicated because it uses a horizontal strip, and 
this forces us to solve the equation of the curve for x in terms of y. We therefore 
describe an alternative method for computing y that uses the vertical strip shown 
in the figure and the result of Example 1. Since the centroid of this rectangular 
strip is located at its center, the moment of the strip about the x-axis is \y dA = 
y y 1 dx , and therefore 


y = 


f {-y 2 dx 
A 


_3_ 

32 

_3_ 

32 



- 8* 2 + x 4 ) dx 


[ 32 _M + 32l 
3 5 J 


2 

5’ 


as before. 



One more word about centroids. We have discussed centroids of plane regions. 
We can just as easily speak of the centroid of an arc in the xy -plane or of a re¬ 
gion in three-dimensional space. The definitions and formulas are so similar to 
what we have already done that we won’t burden students with detailed expla¬ 
nations. However, we do remark that in finding the centroid of an arc (Fig. 11.9) 
it may be helpful to think of the arc as a piece of curved wire of constant den¬ 
sity 1 (= mass per unit length), so that the mass of a portion of the wire is sim¬ 
ply its length. With ds understood to be the element of arc length in the sense of 
Section 7.5, we therefore have 


fx ds 

X ~Jdi 


and 



(4) 


Each denominator here is the total length of the arc, and the numerators are the 
moments of the arc about the y-axis and x-axis, respectively. 


Figure 11.9 
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PROBLEMS 

Find the centroid of the plane region R that is bounded by: 

1 y = x 2 , y = 0, x= 2. 

2 y — 4x — x 2 and y = x. 

3 y = Va 2 — x 2 and y = 0. 

4 y = sin jc and y = 0 (0 < x < 7r). 

5 x 2 = ay and y = a. 

6 x 2 = ay and y 2 = a.r. 

7 y = >^jc, y = 0, jc = 8. 

8 jc 2 + y 2 = a 2 and jc + y = a (first quadrant). 

9 jc 2 + y 2 = a 2 , x = a, y = a. 

10 y = 1/jc, y = 0, jc = 1, and jc = 2. 

11 Find the centroid of the first-quadrant part of the curve 

x 2/3 + y2/3 = a 2!3 

12 Find the centroid of the semicircular arc y = Va 2 - jc 2 . 

13 The semicircle under y = vb 2 — x 2 is removed from the 
semicircle under y = Va 2 — jc 2 , where b < a. Find the 
centroid of the remaining region. Find the limit of y as 
b —► a, and compare with the result of Problem 12. 

14 Let y = /(jc) be a nonnegative function defined on the in¬ 
terval a < jc < b. If the region bounded by this curve, the 
jc-axis, and the lines jc = a, jc = b is revolved about the 


jc-axis, show that the resulting solid of revolution has its 
centroid on the jc-axis with 

f xf(x) 2 dx 

_ Ja 

P/M 2 dx 

Ja 

15 Use the result of Problem 14 to find the centroid of (a) 
a cone with height h and radius of base r; and (b) a hemi¬ 
sphere of radius a. 

16 It is known from elementary geometry that the three me¬ 
dians of a triangle intersect at a point that is two-thirds 
of the way from each vertex to the midpoint of the op¬ 
posite side.* Show that this point is the centroid of the 
triangle. Hint: Place the axes so that the vertices are 
(0, 0), ( a , 0), and (b, c). It suffices to find the centroid 
(3c, y) and show that this point lies on the median from 
(b, c). Why is this so? Why does physical intuition tell 
us to expect this result? 

*See Problem 35 in Section 1.2. 


Two beautiful geometric theorems connecting centroids with solids and surfaces 
of revolution were discovered in the fourth century A.D. by Pappus of Alexan¬ 
dria, the last of the great Greek mathematicians. 

-—-- OF PAPPUS 

First Theorem of Pappus Consider a plane region that lies completely on one side 
of a line in its plane. If this region is revolved about the line as an axis, then the vol¬ 
ume of the solid generated in this way equals the product of the area of the region and 
the distance traveled around the axis by its centroid. 


11.3 
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This is easily proved by the following argument. Let the axis of revolution be 
the jc-axis, as shown in Fig. 11.10. Then the distance y of the centroid from this 


axis is defined by 

y> 

- _ />’ dA _ J> dA 
y f dA A ' 




which is equivalent to 





Ay = J y dA 

y 

\y 

I 


or 



1 

1 

1 

X 


A • 27ry = J 2ny dA. 


Figure 11.10 
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All that is needed now is to observe that this equation is precisely the assertion 
of the theorem, because 2iry is the distance traveled by the centroid and the in¬ 
tegral on the right is the volume of the solid as calculated by the shell method 



Example 1 Find the volume of the torus (doughnut) generated by revolving a 
circle of radius a about a line in its plane at a distance b from its center, where 
b> a (Fig. 11.11). 

Solution The centroid of the circle is its center, and this travels a distance 2irb 
around the axis. The area of the circle is va 2 , so by the first theorem of Pappus 
the volume of the torus is 

V = 7 ret 2 • 2 ub = 27T 2 a 2 b. 


(See Problem 31 in Section 10.4.) 

Figure 11.11 - 


Second Theorem of Pappus Consider an arc of a plane curve that lies completely 
on one side of a line in its plane. If this arc is revolved about the line as an axis, then 
the area of the surface generated in this way equals the product of the length of the arc 
and the distance traveled around the axis by its centroid. 



The proof is similar to that given above. Again we take the axis to be the 
jc-axis (Fig. 11.12), and we start with the definition of the distance y from this 
axis to the centroid of the arc, 

- _ h ds _ Jy ds 
} f ds s' 

which is equivalent to 

sy = J y ds 


or 


s ■ 2iry = J 2iry ds. 

And again this is exactly the assertion of the theorem, because the integral on 
the right is the area of the surface of revolution. 

Example 2 With the aid of this theorem it is easy to see that the surface area of 
the torus described in Example 1 is 

A = 27 Ta • 27 jb = AiP-ab. 


Apart from their aesthetic appeal, the theorems of Pappus are useful in two 
ways. When centroids are known from symmetry considerations—as in the ex¬ 
amples— we can use these theorems to find volumes and areas. And also, when 
volumes and areas are known, we can often use these theorems in reverse to de¬ 
termine the locations of centroids. Both types of applications are illustrated in 
the following problems. 
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PROBLEMS 

1 Use the known formulas V = jna 3 and A = 4na 2 for the 
volume and surface area of a sphere of radius a to locate 
the centroid of (a) the semicircular region under y = 
va 2 - x 2 \ (b) the arcy = \a 2 — x 2 . Compare with Prob¬ 
lems 3 and 12 in Section 11.2. 

2 Use the centroids found in Problem 1 to find the volume 
and surface area generated when the semicircular region 
and the arc are revolved about the line y = a. 

3 By Problem 10 in Section 7.5, the total length of the curve 
x 2 ' 3 + y 2/3 = a 2/3 is6a. Use this fact and the result of Prob¬ 
lem 11 in Section 11.2 to find the area of the surface gen¬ 
erated by revolving this curve about (a) the x-axis; (b) the 
line x + y = a. 

4 A square with side a is revolved about an axis lying in its 
plane which intersects it at one of its vertices but at no 
other points. What should be the position of the axis to 
yield the largest volume for the resulting solid of revolu¬ 


tion? What is this largest volume? What is the corre¬ 
sponding surface area? 

5 A regular hexagon with side a is revolved about one of its 
sides. What is the volume of the resulting solid of revo¬ 
lution? What is the area of the surface of this solid? 

6 The regular hexagon in Problem 5 is revolved about an 
axis through a vertex which is perpendicular to the line 
from the center to that vertex. Find the volume and sur¬ 
face area of the resulting solid of revolution. 

7 Use Pappus’ first theorem to find (a) the volume of a cylin¬ 
der with height h and radius of base r\ (b) the volume of 
a cone with height h and radius of base r. 

8 It is known from elementary geometry that irrL is the lat¬ 
eral area of a cone of base radius rand slant height L. Ob¬ 
tain this formula as a consequence of Pappus’ second the¬ 
orem. 


Consider a rigid body rotating about a fixed axis. For example, the body might 
be a solid sphere spinning about a diameter, or a solid cube swinging back and 
forth like a pendulum about a horizontal axis along one of its edges. In order to 
study motions of this kind, it is necessary to introduce the concept of the mo¬ 
ment of inertia of the body about the axis. Our purpose in the next few para¬ 
graphs is not only to define this concept, but also to explain its intuitive mean¬ 
ing so that students can understand why it matters. 

When a rigid body moves in a straight line, all its constituent particles move 
in the same direction with the same velocity. On the other hand, when a rigid 
body rotates about an axis, its constituent particles move around circles of dif¬ 
ferent sizes and have different velocities, and for this reason we expect the prob¬ 
lem of describing the body’s motion to be more difficult. Fortunately, however, 
this situation is simpler than it seems, and it turns out to be possible to study ro¬ 
tating bodies by using ideas and formulas that are completely analogous to those 
already familiar for the case of linear motion. 

We begin with a brief review of the linear formulas. Consider a particle of 
mass m moving in a straight line (Fig. 11.13). If its position is given by the vari¬ 
able s, then v — ds/dt and a = dv/dt are its velocity and acceleration. A force F 
acting on the particle is related to the acceleration by Newton’s second law of 
motion. 
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F = ma or a = — F. 

m 


(1) 


The second form of this equation is useful for its clear expression of the idea that 
the acceleration of the particle is caused by the force and is proportional to this 
force. This form also helps us think of the mass m of the particle as a measure 
of its capacity to resist acceleration, because if the force F is the same and m in¬ 
creases, then a decreases. 

Now consider a particle of mass m rotating around a fixed axis in a circle of 
radius r (Fig. 11.14). If its angular position is given by the angle 9 as measured 



Figure 11.14 
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from some fixed direction, then co = dO/dt and a = dcoldt are its angular teloc - 
ity and angular acceleration. These rotational quantities are related to the corre¬ 
sponding linear quantities v, and a, as measured along the circular path, by 
means of the equations s = r6, v — no, and a = ra. The twisting effect of the 
tangential force F is measured by its torque T = Fr, which is the product of the 
force and the distance from its line of action to the axis. We have seen that force 
produces linear acceleration in accordance with equation (1). In just the same 
way, torque produces angular acceleration in accordance with the corresponding 
equation 

T = la, (2) 

where the constant of proportionality I is called the moment of inertia. I can be 
thought of as a measure of the capacity of the system to resist angular accelera¬ 
tion, and in this sense it is the rotational analog of mass. 

These remarks describe the conceptual role of the moment of inertia. To dis¬ 
cover what its definition must be in order to fit it for this role, we transform (2) 
by replacing T by Fr and a by a/r, and then we replace F by ma: 

a a 

Fr = I- mar = I - 
r r 

The last equation tells us that I must be defined by the formula 

I = mr 2 . (3) 

In this section we are mainly concerned with learning how to use integration to 
calculate the moment of inertia, about a given axis, of a uniform thin sheet of 
material of constant density 8 (= mass per unit area). It may be helpful to think 
of such a sheet as a thin plate of homogeneous metal. Our method is to imagine 
the plate divided into a large number of small pieces in such a way that each 
piece can be treated as a particle to which formula (3) can be applied. We then 
find the total moment of inertia by integrating—or adding together—the indi¬ 
vidual moments of inertia of all these pieces. 



Figure 11.15 


Example 1 A uniform thin rectangular plate has sides a and b and density 8. 
Find its moment of inertia about an axis that bisects the two sides of length a 
(Fig. 11.15). 


Solution Introduce coordinate axes as indicated in the figure, with the y-axis as 
the axis of rotation. We concentrate our attention on the thin vertical strip shown 
in the figure because all of its points are essentially at the same distance x from 
the axis of rotation. The moment of inertia of the slrip about the axis is x 2 • 8b dx, 
so the total moment of inertia of the plate is 


fa/2 „ 

r 1 si 

— jr 

x 2 '8b dx — 8b 

J-a/2 

3 



= 72 &3 *- 


(4) 


It is customary to write the moment of inertia in a form that displays the total 
mass M. In this case M = dab, so 


/ = jiMa 2 . 
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Two thin vertical strips that are symmetrically placed with respect to the axis of 
rotation have the same moment of inertia. In equation (4) we could therefore have 
written the integral in the form 



which makes possible a slightly simpler calculation. 


Example 2 A uniform thin circular plate has radius a and mass M. Find its mo¬ 
ment of inertia about a diameter. 

Solution Introduce coordinate axes as shown in Fig. 11.16. If the density of the 
plate is denoted by 8 , then the moment of inertia of the indicated strip about the 
y-axis is x 2 • <52y dx = x 2 • 82Va 2 — x 2 dx, so the total moment of inertia is 



To evaluate this integral we make the trigonometric substitution x = a sin 6 , so 
that dx = a cos 6 dO and 



(As always, students should verify the omitted details of this calculation for them¬ 
selves.) 


Example 3 Find the moment of inertia of the circular plate in Example 2 about 
an axis through the center and perpendicular to the plate. 

Solution This time the axis is to be imagined as protruding out of the page from 
the center of the circle (Fig. 11.17), and we divide the area into thin rings with 
centers at the center of the circle, as shown. The total moment of inertia is there¬ 
fore 



Remark 1 We recall that a particle of mass m moving with velocity v has ki¬ 
netic energy given by the formula 


K.E. = \mv 2 , 


and also that this energy is the amount of work that must be done on the parti¬ 
cle to bring it to a stop. On the other hand, if the particle rotates in a circle of 
radius r, then v= ra) and we have 


y 



Figure 11.17 


K.E. = \mr 2 a) 2 = \lco 2 , 


and again this is the work required to stop the rotating particle. By comparing 
these formulas we reinforce the idea that the moment of inertia 7 plays the same 
role in rotational motion as is played by the mass m in linear motion. 
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Remark 2 In addition to its importance in connection with the physics of ro¬ 
tating bodies, the moment of inertia also has significant applications in structural 
engineering, where it is found that the stiffness of a beam is proportional to the 
moment of inertia of a cross section of the beam about a horizontal axis through 
its centroid. This fact is exploited in the design of the familiar steel girders called 
“I-beams,” where flanges at the top and bottom of the beam—as in the letter I 
—increase the moment of inertia and hence the stiffness of the beam. 


PROBLEMS 


1 A uniform thin rectangular plate of mass M has sides a 
and b. Find its moment of inertia about one of the sides 
of length b. 

2 A uniform thin plate of mass M is bounded by the curve 
y = cos jc and the*-axis between * = -it12 and x = 7t/2. 
Find its moment of inertia about the y-axis. 

3 Find the moment of inertia of a uniform thin triangular 
plate of mass M, height h, and base b about its base. 

4 Find the moment of inertia of the triangular plate in Prob¬ 
lem 3 about an axis parallel to its base and passing 
through the opposite vertex. 

5 A uniform thin circular plate has radius a and mass M. 
Find its moment of inertia about an axis tangent to the 
plate. 

6 Find the moment of inertia of a uniform straight wire of 
mass M and length a about an axis perpendicular to the 
wire at one end. 


7 A uniform wire of mass M is bent into a circle of radius 
a. Find its moment of inertia about a diameter. 

8 Find the moment of inertia of a uniform solid cylinder 
of mass M, height h, and radius a about its axis. Hint: 
Use the shell method. 

9 Find the moment of inertia of a uniform solid cone of 
mass M, height h, and radius of base a about its axis. 

10 Find the moment of inertia of a uniform solid sphere of 
mass M and radius a about a diameter. 

11 If the moment of inertia of a body of mass M about a 
given axis is / = Mr 2 , then the number r is called the ra¬ 
dius of gyration of the body about that axis. This is the 
distance from the axis at which all the mass of the body 
could be concentrated at a single point without changing 
its moment of inertia. Referring to Problems 8-10, find 
the radius of gyration about the indicated axis of (a) the 
cylinder; (b) the cone; (c) the sphere. 


CHAPTER 11 REVIEW: DEFINITIONS, CONCEPTS 

Think through the following. 4 Pappus’ theorems. 

1 Moment about an axis. 5 Moment of inertia. 

2 Center of mass. 

3 Centroid. 


ADDITIONAL PROBLEMS FOR CHAPTER 11 

SECTION 11.1 

1 Consider the plane distribution of particles whose cen¬ 
ter of mass (x, y) is defined by equations (6) and (7) in 
Section 11.1. If Ax 4- By + C = 0 is any line in the 
plane, then we may suppose (introducing a factor if nec¬ 
essary) that A 2 + B 2 = 1; and by Additional Problem 
21 in Chapter 1 we see that the signed distance from 
this line to (x k , y k ) is 

d k = Ax k + By k + C, 

this being positive on one side of the line and negative 
on the other. 

(a) Show that the entire mass m = 2 m k of the system 


can be concentrated at the center of mass ( x , y) with¬ 
out changing the total moment 2 m k d k about the ar¬ 
bitrary line. 

(b) Use part (a) to show that the total moment is zero 
about every line through (x, y). 

2 Consider again the plane distribution of particles dis¬ 
cussed in Problem 1. 

(a) If the axes are translated as shown in Fig. 11.18, 
then the old coordinates and the new coordinates of 
a fixed point P are connected by the transformation 
equations 


x = X + a, 


y=Y+b. 
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Calculate the center of mass in the new coordinate 
system, and show that it is the same point as before, 

(b) If the axes are rotated through an angle 0 as shown 
in Fig. 11.19, then the old coordinates and the new 



coordinates of a fixed point P are connected by the 
transformation equations 

x = X cos 6 — Y sin 6, y = X sin 6 + Y cos 6. 

Show that the center of mass as calculated in the 
new coordinate system is the same point as before, 

(c) Deduce that the location of the center of mass is in¬ 
dependent of the coordinate system that is used to 
calculate it. 

SECTION 11.2 

3 Find the centroid of the plane region R that is bounded 
by 

(a) y = x 2 and y = x; 

(b) y = lx 2 and y = x 2 + 1; 

(c) y = 2 x — x 2 and y = 0; 

(d) y = x - x 4 and y = 0; 

(e) y 3 = x 2 and y = 2; 

(f) y = x 3 and y = 4x (x > 0); 

(g) y = e x ,y = -e x , x = 0, x = 1. 

*4 Find the centroid of the part of the curve y = x 2 that 
lies between x = 0 and x = b. 

SECTION 11.3 

5 Consider a rectangle with height 2 a and base 2b placed 
in thexy-plane with its sides parallel to the axes and its 


center at the point (0, c), where c > V a 2 + b 2 . If this 
rectangle is rotated counterclockwise through an angle 
0 about the point (0, c) and then revolved about the 
x-axis, show that the volume and surface area of the re¬ 
sulting solid of revolution are the same for all values of 
9. What are they? 

6 A regular hexagon inscribed in the circle x 2 + y 2 = 1 
has one of its vertices at the point (1, 0). If this hexa¬ 
gon is revolved about the line 3x + 4y = 25, find the 
volume and surface area of the resulting solid of revo¬ 
lution. 

SECTION 1 1.4 

7 Show that the moment of inertia of a uniform thin plate 
in the xy-plane about an axis perpendicular to this plane 
at the origin is equal to the sum of its moments of in¬ 
ertia about the two coordinate axes. Use this fact to find 
the moment of inertia of a uniform thin square plate of 
mass M and side a about an axis through its center and 
perpendicular to its plane. 

8 A uniform thin plate of mass M has the curve 



as its boundary. Use the method of Problem 7 to find 
its moment of inertia about an axis through the origin 
and perpendicular to its plane. 

9 Consider a uniform thin plate of mass M in the xy-plane. 
Let / be its moment of inertia about a line L in this 
plane, and let 7 0 be its moment of inertia about a par¬ 
allel line Lq through the centroid. Show that 

/ = I 0 + Md 2 , 

where d is the distance between L and L 0 (this is called 
the parallel axis theorem). Hint: Place the coordinate 
system so that Lq is the y-axis and L is the line x = d. 

*10 Consider a uniform solid body of mass M in three-di¬ 
mensional space. Let / be its moment of inertia about 
a line L, and let lo be its moment of inertia about a par¬ 
allel line Lo through the centroid. Then the parallel axis 
theorem stated in Problem 9 holds in exactly the same 
form: 

I = Iq + Md 2 , 

where d is the distance between L and Lo. Establish this 
fact, and apply it to find the moment of inertia of (a) a 
uniform solid sphere of mass M and radius a about a 
tangent; (b) a uniform solid cube of mass M and edge 
a about an edge. Hint: See Problem 7. 
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12.1 

INTRODUCTION. THE 
MEAN VALUE 
THEOREM REVISITED 


INDETERMINATE 
FORMS AND 
IMPROPER 
INTEGRALS 


In the next few chapters we will need better methods for computing limits than 
any we have available now. Accordingly, our main purposes in the first part of 
this chapter are to understand the types of limit problems that lie ahead and 
to acquire the tools that will enable us to solve these problems with maximum 
efficiency. 

In Section 2.5 we saw that the limit of a quotient is the quotient of the limits, 
in the following sense: If 

lim f(x) = L and lim g{x) = A/, 


then 


i- M. = A 

™ g(x) M y 


(1) 


provided that M =£ 0. Unfortunately, however, it is a fact of life that many of the 
most important limits are of the form (1) in which both L = 0 and M = 0. When 
this happens, formula (1) is useless for calculating the value of the limit and this 
limit is said to have the indeterminate form 0/0 at x = a. The expression “inde¬ 
terminate form” is used because in this case the limit on the left of (1) may very 
well exist, but nothing can be concluded about its value without further investi¬ 
gation. This is shown by the four examples 

x x 2 x x sin \lx 

i i» » 

XXX* X 


each of which is a quotient of two functions that both approach zero as x—> 0. 
We see from these examples—by canceling x’s from the numerators and de¬ 
nominators—that such a quotient may have the limit 1, or 0, or «>, or it may 
have no limit at all, finite or infinite. 

Indeterminate forms can sometimes be evaluated by using simple algebraic de¬ 
vices. For example, 


3* 2 -lx + 2 
S3 X 2 + 5* - 14 


( 2 ) 


has the indeterminate form 0/0, and this limit is easy to calculate by factoring 
and canceling, 
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3* 2 -lx + 2 (x- 2)(3x - 1) 3x — \ _ 5 

£3 x 2 + 5x- 14 " 1-2 U-2 )(jc + 7) ™ * + 7 9' 

In other cases, more complicated methods are required Thus, the limit 


is another indeterminate form of the type 0/0, and in Secions 2.5 and 9.2 geo¬ 
metric arguments were used to show that the value of this important limit is 1. 
In this connection we point out the suggestive fact that the limit (3) can also be 
evaluated by noticing that it is the derivative of the function sin x at x = 0: 


lim 

»o 


sin jc 
x 


lim 

x —>0 


sin x — sin 0 
x-0 


d 

= — sin x 
dx 


,v=0 


COS x] x =o = cos 0 


1 . 


Indeed, every derivative 


f'{a) = lim 


f(x)~f(a) 

x — a 


(4) 


is an indeterminate form of the type 0/0, since both numerator and denominator 
of the fraction on the right approach zero as x approaches a .* 

These remarks suggest that there is a close but hidden connection between 
indeterminate forms and derivatives. And so there is. But to understand this con¬ 
nection, it is first necessary to recall the Mean Value Theorem. 

As we learned in Section 2.6, this theorem states that if a function y = /(;c) is 
defined and continuous on a closed interval a < x < b, and differentiable at each 
point of the interior a < x < b, then there exists at least one number c between 
a and b for which 


f'ic) 


Ab) -m 

b - a 


or equivalently, 


fib) -fid) =f\c)ib - a). 


This assertion is best understood in geometric language (see Fig. 12.1); it says 
that at some point on the graph between the endpoints, the tangent line is paral¬ 
lel to the chord joining these endpoints. From this point of view the theorem 
seems obviously true and is difficult to doubt; but in fact, as we saw in Section 
2 .6, it is a rather deep theorem whose validity depends in a crucial way on the 
stated hypotheses. 

In most of our work we try to avoid dwelling on the theoretical parts of cal¬ 
culus. Here, however, we must make an exception, because the central fact of 
this chapter (L’Hospital’s rule, in the next section*) cannot be understood unless 
we know what the Mean Value Theorem says. 



"Students should examine formula (4) together with a suitable sketch, in order to convince them¬ 
selves that this formula can be taken as the definition of the derivative of an arbitrary function f(x) 
at a point x = a We have not had occasion to use this version of the definition before, but it will be 
particularly convenient for our work in the present chapter. 
f L’Hospital is pronounced “LOW-pee-tal.” 
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12.2 

THE INDETERMINATE 
FORM 0/0. 
L’HOSPITAL’S 
RULE 


We remarked earlier that there is a close connection between indeterminate forms 
and derivatives. We begin to explore this connection with the following simple 
theorem: ///(;c) and g(x) are both equal to zero at x = a and have derivatives 
there , then 


£x) _ f{a) 
™ g{x) g'(a) 


/'(*) 

g\x) 



0 ) 


provided that g'(a) =£ 0. To prove this, it suffices to use f(a) = 0 and g(a) = 0 
to write 


M m-m _ Lfw-mvk-a) .ru) 

g(x) g(x) - g(a) [g(x) - g(a)]!(x - a) ^ g'( a y 


as stated. 

As examples of the use of (1), we easily find the limits (2) and (3) in Section 

12 . 1 , 


5x‘ 


1 • 

X 2 + 5* - 14 “ 2x + 5 


x=2 


( 2 ) 


and 


lim ■ 
*->() 


cos JC 


= cos 0=1. 


x=0 


(3) 


As another example, we have 


tan 6x 

lim —t--- 

e z — I 


6 


sec 2 6x 
2e lx 


x=0 



(4) 


a result that would have been hard to find in any other way. 

Formula (1) requires the existence of the derivatives of the functions /(jc) and 
g(jc) at the single point x = a. At other points these functions need not have 
derivatives, nor indeed even be continuous. However, if the derivatives exist in 
an interval about a and are continuous at a , then we can obtain formula (1) in 
another way, by applying the Mean Value Theorem separately to the numerator 
and denominator, 


/(*) = /(*) ~ f(a) = f\c i)U “ a) = f(C]) f\a) 

g(x) g(x) ~ g(a ) g\c 2 )(x - a) g\c 2 ) g\a) ’ 


as x^> a. Here C\ and c 2 lie between jc and a, so both approach a as jc —» a. 

What purpose is served by giving a second alternative proof of formula (1) 
when the first proof is perfectly satisfactory? The point is this: Formula (1) is a 
good tool to have, but is still only of limited value, because it often happens in 
the problems we consider that f (a) = g' (a) = 0, and in this case the right side 
of (1) is meaningless. However, we can use our second proof to get around this 
difficulty as follows. Suppose that c\ and c 2 in (5) can be taken equal to one an¬ 
other so that the first part of (5) can be written as 

A*) = /(*) ~ f(a) = f'(c)(x - a) = f\c ) 
g(x) g(x) - g(a) g'(c)(x ~ a) g\c) ’ 


where c is between jc and a. Then in forming the limit as jc —> a, (6) permits us 
to replace the quotient 
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44 by the quotient 
gW g to 

L y Hospital’s rule states that under certain easily satisfied conditions this re¬ 
placement is legitimate, that is. 


lim 


/to =lim £to 

gto *-*. g'(x) 


(7) 


provided the limit on the right exists. Students should remember that /(«) = 
g(a) = 0 is assumed here, and we also mention that even though ordinary two- 
sided limits are usually intended in (7), one-sided limits are allowed. 

It may be helpful to students if we now give a formal statement of our main 
result. 


Theorem (L’Hospital’s Rule) Let a be a real number and let f(x) and g(jt) be func¬ 
tions that are differentiable on some open interval containing a. Assume also that 
g'(jc) ^ 0 on this interval, except perhaps at the point a itself. If f{a) = 0 and g{a) = 
0, then 


lim 

x—>a 


/(*) = ,j /'(*) 

g(x) g\x) ■ 


(7) 


provided the limit on the right exists.* 


L’Hospital’s rule is named after the French mathematician—a pupil of John 
Bernoulli—who published it in his book Analyse des infiniment petits (182 pp., 
Paris, 1696), which was the first calculus textbook and enjoyed wide popularity 
and influence. 


Example 1 At the beginning of this section we evaluated the limits (2), (3), and 
(4) by using formula (1). These limits can also be found by using L’Hospital’s 
rule (7): 

lx 2 -lx+ 2 6x-l 5 

lim 9 ^ -— = lim -—— = —, 

x->2 x 2 + 5jc — 14 -v —>2 2jc + 5 9 


sin x cos x 

lim-= lim —-— = 1, 

-v—*0 X x->0 1 


tan 6x .. 6 sec 2 6jc 

lim —r--- = lim ——-r-— = 3. 


x — e lx - 


2e 2 


The reason (7) works so smoothly in these problems is that in each case the 
second limit exists and is easy to find by inspection, since the functions involved 


*For those students who are interested in the proof of L’Hospital’s rule, we here explain as briefly 
as possible the details of the reasoning that underlies (7). We assume—as stated — that f(a) = g(a) = 
0, that jc approaches a from one side or the other, and that on that side the functions f{x) and gU) 
satisfy the following three conditions: (i) Both are continuous on some closed interval I having a as 
an endpoint; (ii) both are differentiable in the interior of /; and (iii) g'U) ^ 0 in the interior of I. With 
these hypotheses, (6) is an immediate consequence of a technical extension of the Mean Value The¬ 
orem known as the Generalized Mean Value Theorem; and if x is now allowed to approach a from 
the side under consideration, then (7) follows from (6) as indicated above. Those tenacious students 
who like to nail everything down will find a proof of the Generalized Mean Value Theorem in Ap¬ 
pendix A.4. 
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are continuous. The point we wish to make here is that whatever (1) can do, (7) 
can do just as easily; and as the next example shows, (7) is much more power¬ 
ful and often works easily when (1) doesn’t work at all. 


Example 2 L’ Hospital’s rule (7) proves its value in limit problems like 


lim 

x^O 


1 - COS X 


Here formula (1) is useless, as we see from the failure of the attempted calcula¬ 
tion 

1 - cos * _ sin *1 _ 0 

Isa * 2 - 2x \x =0 ~ 0 

The reason for this failure is that (1) assumes that g'(a) =£ 0, and this condition 
is not satisfied here. However, by (7) we have 


lim 

x —>o 


= lim 

x->0 


sin .r 
2x ’ 


if the second limit exists. But this second limit is again of the form 0/0, so 
L’Hospital’s rule applies a second time and permits us to continue and reach the 
correct answer, 


lim 

x —>0 


lim 


sin x 
~~2x~ 


cos 
lim 

-X—>0 


2' 


Another limit that behaves in this way is 


lim 
x— >o 


V7TT - (i + \x) 


= lim 

.x—>0 


iOt+l)-" 2 

2x 


1 

2 


= lim 

.x—»o 


4(*+ i r 3/2 
2 


8 ' 


The limits in Example 2 illustrate the great advantage L’Hospital’s rule (7) has 
over formula (1): It is valid whenever the limit on the right exists, regardless of 
whether g\a ) is zero or not. Thus, as these problems show, if f\a) = g\a) = 0, 
then we have another indeterminate form 0/0 and we can apply L’Hospital’s rule 
a second time, 


rim rn = lim m 

g( X ) x^a g (X) 


lim 

x—>a 


fix) 

g\x) ’ 


provided the last-written limit exists. As a practical matter, the functions we en¬ 
counter in this book satisfy the conditions needed for L’Hospital’s rule. We there¬ 
fore apply the rule almost routinely, by continuing to differentiate the numerator 
and denominator separately as long as we still get the form 0/0 at x = a. As soon 
as one or the other (or both) of these derivatives is different from zero at x = a, 
we must stop differentiating and hope to evaluate the last limit by some direct 
method. 


Example 3 A careless attempt to apply L’Hospital’s rule may yield an incorrect 
result, as in the calculation 
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lim 


sin 4;t 


. _ v = lim 

>0 2 * + 3 *-*0 


4 cos 4 jc 


4 

2 


2 . 


The correct answer is 


.. sin 4x 

lim --- 

*->o 2 jc + 3 



In this problem the numerator and denominator of the given quotient are not both 
equal to zero at x = 0, so L’Hospital’s rule is not applicable. 


Our methods work in just the same way for limits in which x —> °o; that is, if 
f(x) —» 0 and g(x) —» 0 as x —> ©°, then 


r f{x) r f'(x) 
lim ~= lim J —r—, 

X ^°° g(x) X^oo g (*) 


( 8 ) 


if the limit on the right exists. To see this, we put x = 1 It and observe that 
t —> 04- (recall that this notation means that t approaches zero from the right). 
Briefly, L’Hospital’s rule (7) now gives 

ffL = .. = lim /'(*) dxfdt 

i™ g(x) g(l It) g'(x) dx/dt’ 

which yields (8) after dx/dt is canceled. 

Finally, in both forms of L’Hospital’s rule, as expressed in formulas (7) and 
(8), it is easy to see that the procedure remains valid if the value of the limit on 
the right is °o or — oo. 


PROBLEMS 


Find the following limits. 

, .. sin 3x 

1 lim —-. 

*->o sin x 

v x ~ 2 

2$ 6x 2 - IOjc-4- 

VJT9 - 3 


3 

5 

7 

9 

11 




fx + 1 - (1 + TJC) 


lim 

x —>0 

.. e x -\-x 

lim- 5 - 

x->0 x 2 

X 3 

lim —-. 

»o sin x — x 

3x 


13 lim 


*-»o tan x 


2 

4 

6 

8 

10 

12 

14 


In x 
lim- 

x— >1 X 1 

P x — £ 

lim 
»o 


lim 

x-*l 


lim 

x —>0 


sin 5x 

4x 3 -5x + 1 
In x 

sin -1 3x 


x 

- 1 - JC 


;c—>0 1 - COS TTX 

e 2x - 1 


15 

lim 

x — >0 

sin 2 x + 8jc 
e 2x — 1 

16 

lim 

x —>6 

17 

lim 

;c—>0 

x — sin x 
x — tan jc* 

18 

lim 

X—>TT 

19 

lim 

\/x 

20 

lim 

*->o 

sin 7 t/x’ 


lim 

;c —>0 

lim ' , r . 

>0 sin 5x 

lim - ' n ^ an ^ . 

X^n/A Sin X — COS X 

~x-2 - 2 
v 2 __ 


36 

In (cos 2x) 
(x - 7r) 2 
a x — b x 


21 

tan 2jc - 2jc 
lim 

x->o x — sin x 

22 

lim 

x->0 

sin x 3 
sin 3 x ' 

23 

sin 2x — 2 sin x 
lim . „ _ . 

»o sin 3jc — 3 sin x 

24 

lim 

x->\ 

1 1 


..... («* ~ l) 3 

(jc — 2)e x + x 4- 2’ 


26 In Fig. 12.2, P is a point on a circle with center O and 
radius a . The segment AQ equals the arcA^ and the line 
PQ intersects the line OA at B. Show that OB approaches 
2a as P approaches A along the circle. Hint: A QAB is 
similar to A PRB. 
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27 In Problem 26, let f(6) be the area of the triangle ARP 
and let g(9) be the area of the region that remains after 
the triangle ORP is removed from the sector OAP. Find 
formulas for the functions f{9) and g(9) and evaluate 
lim e _>of(9)/g(0). 

28 L’Hospital’s rule (7) works in just the same way if the 
conditions f(a) = g(a ) = 0 are replaced by the conditions 
lim x ^ a fW = lim^agC*) = 0. Explain. Use this idea to 
evaluate 


lim 

x —>0 


(1 + x) l/x — e 
x 


29 The formula 


f'W = lim 

/i—»o 


/(* + /»)-/(*) 

h 


is one version of the definition of the derivative. By treat¬ 
ing the right side as an indeterminate form, derive this 
formula from L’Hospitafs rule. 


12.3 

OTHER INDETERMINATE 
FORMS 


For certain applications it is important to know that L’Hospital’s rule remains 
valid for indeterminate forms of the type oo/oo. That is, if the numerator and de¬ 
nominator of the quotient f(x)/g(x) both become infinite as x—» a, then 


r /to r f'W 

lim —— = lim - , 

gW *~> a g W 


( 1 ) 


provided the limit on the right exists. The argument is a bit tricky, and is sketched 
in Remark 2 so that those who wish to skip it can conveniently do so. Just as in 
Section 12.2, one-sided limits are allowed and (1) extends immediately to the 
case in which x—> °o; also, it remains valid if the limit on the right is °o or — «>. 


Example 1 Show that 


x p 

lim — : = 0 

*—*>° £ x 


( 2 ) 


for every constant p. 


Solution We begin by observing that if p < 0, then this limit is not an indeter¬ 
minate form and its value is easily seen to be zero. On the other hand, when 
p > 0, the limit is clearly an indeterminate form of the type oo/oo. L’Hospital’s 
rule (1) for the case in which x —> °o therefore gives 


x p 

lim —— = lim 

x —>°° e x x-*°o 


px p ~ l 
e x ’ 


if the limit on the right exists; and if this process is continued step by step, we 
can reduce the exponent for x to zero or a negative number, and the desired con¬ 
clusion (2) now follows from the above remark about this case. This example 
gives us important insight into the nature of the exponential function: as x —> °o, 
e x increases faster than any positive power of x, however large, and therefore 
faster than any polynomial. 


Example 2 Show that 


lim 

X—>°° 



(3) 


for every constant p > 0. 

Solution This limit is clearly an indeterminate form of the type oo/oo, so by 
L’Hospital’s rule we have 
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lim -^ 77 - = lim 

X->oo X P JC—>oo 


1/jc 

px [) ~ 1 


= lim —- = 0 . 
px p 


Expressed in words, (3) tells us that as jc —» ©©, In jc increases more slowly than 
any positive power of jc, however small. 


We have discussed the limits (2) and (3) before, by clumsier special methods, 
in Sections 8.3 and 8.4. But our present treatment of these important facts is 
clearly preferable, because the powerful method of analysis based on L’Hospi- 
tal’s rule extends easily to many similar limits. 

The expressions 

0 • oo^ co — oo ? 0°, °°^> 1°° 

symbolize other types of indeterminate forms that sometimes arise. The product 
/(jc)g(jc) where one factor approaches zero and the other becomes infinite (0 • °o) 
can be reduced to 0/0 or oo/oo by putting the reciprocal of one factor in the 
denominator. The difference between two functions which are both becoming 
infinite (©© — oo) can often be manipulated into a more convenient form. A power 
y = f(x)*W that produces an indeterminate form of one of the other types is best 
handled by taking logarithms, 

In y = In f(xf (x) = g(x) In f(x). (4) 

This reduces the problem to the more familiar form 0 • °o; and since y = e lny , 
we then use the continuity of the exponential function to infer that lim y = 
lim e lny = e hm ln > ’. These generalities are illustrated in the following examples. 


Example 3 Evaluate 


lim x In jc. (5) 

jc-X)+ 


Solution Here jc is required to approach zero from the positive side because In jc 
is defined only for positive jc’s. Since In jc —> — ©© as jc— > 0 + , it is clear that (5) 
is an indeterminate form of the type 0 • ©©. The value of this limit is not obvious, 
because as jc — > 0 +, we cannot tell whether the product jc In jc is influenced more 
by the smallness of jc or by the largeness (in absolute value) of In jc. However, 
we can easily convert the limit into an indeterminate form of the type °o/°o and 
apply L’Hospital’s rule (1), as follows: 


lim jc In jc = lim 77 — = lim 

jc— >0+ jc— >0+ I/jc jc— >0+ 


1/jc 
1 /jc 2 


= lim 

x— >0+ 


i-x) = 0 . 


Thus, the smallness of jc turns out to dominate the behavior of the product 
jc In jc near jc = 0 . 


Example 4 Evaluate 

lim (sec jc — tan jc). ( 6 ) 

x— >tt/2 

Solution This is of the type 00 - do. We convert it into an indeterminate form 
of the type 0/0 and apply L’Hospital’s rule, 
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lim (sec x — tan x) = lirn 

x-*tt/2 x-*tt/2 


\ COS X 


sin^_\ 
cos x ) 


1 - sin x ,, 

= lim -= lim 

COS X 


“COS X 


*-47r/2 — sin x 


0. 


Example 5 Find lim ;c ->o+ x% - 

Solution This limit is of the type 0°, and we reduce it to the simpler type 0 • 00 
by taking the logarithm. To do this most conveniently, we write y = x x and 
observe that 

In y = In x x = x In x—> 0 as »0 + , 

by Example 3. This tells us that 

x x = -y = e ln - v —> e° = 1, 

by the continuity of the exponential function. Therefore we have 


Example 6 Find lim^oo x l/x . 

Solution This limit is of the type °o°. We write y = x xtx and observe that 
In y — In x 1 


Ux _ ]nx A 

r y -—> 0 as x - 


by Example 2. This tells us that 

X l/x = y = e ]ny e° = 1, 

or equivalently, 

lim x l/x = 1. 

*—>00 


( 8 ) 


Example 7 Show that 


for every constant a. 


lim (1 + ax) x,x = e a 


(9) 


Solution If a = 0 this limit is not an indeterminate form, and the statement is 
clearly true because each side has the value 1. If a =£ 0, the limit is an indeter¬ 
minate form of the type I 00 . In this case we write y = (1 + ax) [/x and observe 
that 


lim In y = lim 
*—»o *—>0 


In (1 + ax) 
x 


= lim 
*—>0 


a/( 1 + ax) 
1 


= a. 


This implies that 


which is (9). 


(1 + ax) {,x = y = e lny — > e a , 
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Remark 1 The UHospital habit. Like any mathematical procedure, L’Hospi- 
tal’s rule should be used intelligently, and not purely mechanically. We should 
try to control the bad habit of automatically applying L’Hospital’s rule to every 
limit problem that comes up. Often there is an easier way, for instance, the use 
of familiar limits or simple algebraic transformations. 

(a) The limit 


2x 5 + 3* 2 + 4 


is of the type oo/oo, and can be found by repeated use of L’Hospital’s rule. But it 
is much simpler to divide both numerator and denominator by x 5 and write 

6x 5 -2 = 6 — 2/x 5 6-0 

2x 5 + 3x 2 + 4 2 + 3/;t 3 + 4/;t 5 ”* 2 + 0 + 0 

(b) The limit 


is of the type 0/0. L’Hospital’s rule can be applied, and works, but it is much 
easier to notice that 

sin 3 x _ I sin ;c \ 3 p _ 


because we already know that (sin x)/x —» 1 as x —> 

(c) The limit 


lim 

*—>00 


Vx 2 +1 

X 


0 . 


is of the type 00 / 00 , and L’Hospital’s rule gives 


lim 


Vj 2 + 1 


lim X/V *- 2 + 1 = lim 


*"*“ Vx 2 + 1 lim (Vx 2 + 1/x)' 


This brings us back to the limit we started with, and gets us nowhere. However, 
it is very easy to insert the denominator into the radical and write 


Remark 2 The argument for L’Hospital’s rule (1) in the case 00/00 can be briefly 
sketched as follows. Let f(x) and g(x) both become infinite as x—> a from one 
side or the other, and suppose that f'(x)/g'(x ) —> L. We want to show that also 
f(x)/g(x) —> L. For x near enough to a on the side under consideration (see Fig. 
12.3), f'(x)/g'(x) can be made as close as we please to L between x and a. If x 
is between x and a, and if f(x) and g(x) are assumed to satisfy the simple condi¬ 
tions (i) to (iii) stated in the footnote in Section 12.2, then 

/(*)-/(*) = f'(c) 
g(x) - g(x) g\c) 

for some c between x and x. Since c is also between x and a, we know that 


x c x a 


Figure 12.3 
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f'(c)/g'(c ) is close to L. Now hold x fixed and let x —» a. Then f(x) and g(x) grow 
very large,/(x )//(*) and g(x)/g(x) become very small, and 



fix) -f{x) 

g(x) ~ g(x) 


M r i zfjxmx) 

g(x) [ 1 - g(x)!g(x) 


is close to f(x)/g(x). It follows that f(x)/g(x) is close to f'(c)/g'(c), which in turn 
is close to L, so f(x)/g(x ) is itself close to L when x is close to a, and this is what 
we wanted to establish. 


PROBLEMS 


Evaluate the following limits by any method. 

l r 18x3 

1 lim 


3 + 2x 2 -6x 3 ’ 
tan x 


2 lim ln (ln 


3 lim 

>7r/ 

5 lim 


*->7 t /2 1 + Sec X* 

tan x 


x^tj /2 tan 3 x’ 


7 

lim 

*-> 0 + 

ln x 

8 

CSC x’ 

9 

lim 

*->0+ 

ln (sin 2 x) 
ln x 

10 

11 

lim x sin — 

*->00 x 

12 

13 

lim (x 2 - \)e~ x \ 

14 

15 

lim , 

X —A 00 

e x 1 n x. 

16 

17 

lim 

*-> 0 + 

sin x ln x. 

18 

19 

lim 

X —>00 

/ A 2 A 2 


\X 1 X + 1 

21 

lim I 

x—>0 \ 


22 

23 

lim 

*-> 0 + 

(sin x) x . 

24 

25 

lim 

*-»o+ 

^-tan.x 

26 

27 

lim (e x - 1 

x->0+ 

28 

29 

lim 

*-> 0 + 

(sin x) si ™. 

30 

31 

lim (ln x) 1/x . 

x— >00 

32 

33 

lim (1 + e ax ) lti , a > 0.34 

X->°° 

35 

lim (1 + ax) 1/x , a > 

0.36 


x->°° ln x 

4 lim ]H*-. 

X 2 

6 lim — 3 -. 

X —>00 £ 3 - k 

(In x ) 10 
lim -. 

x—>°° X 


x -^ tt /2 




*->0 




16 lim x - tan 1 xj. 

■ esc (5 sin 2 x). 

x sin x] 


1 


x—> 1 y x 

lim (tan 

*-> 0 + 


ln x 


vJC 2 


x— >0+ 

lim x 1 
*-» 0 + 

lim (1 

x->0 

lim 

x— >7r/2 — 


\1-cosjc 


r 100 a \/x 


37 

lim (cos x) 1/x 

x ->0 

38 

lim 

x —> tt /2 

(sin x) tanv . 

39 

lim x 1/(1 ~ x2 \ 

X —>1 

40 

lim 

*~»o+ 

(cos Vx) 1/X . 


41 In spite of the evidence piled up in Problems 23-30, 
indeterminate forms of the type 0 ° do not always have 
the value 1. Show this by calculating 


42 


lim xP /]n -\ 

x-> 0 + 


where p is a nonzero constant. 

(a) Sketch the graph of the function y = /(x) defined 
by 


/« = 


e ~vx 2 

0 


ifx^O, 


if x = 0 . 


(b) Show that lim x~ n e~ ]/x2 = 0 for every positive in¬ 
teger n. 

(c) Show that fix) as defined in (a) has an /ith deriva¬ 
tive f n \x) for every positive integer n and every x 
=£ 0. [We do not ask for a general formula for/^fx), 
but students should carry the calculations far 
enough to show that f^ n \x) is always given by a for¬ 
mula of a certain form, involving certain constant 
coefficients.] 

(d) Use parts (b) and (c) to show that/^fO) = 0 for 
every positive integer n. 

*43 As x —» 0 +, show that 


cot x- —»0 and 

x 


1 

cot X H -—» °o, 

X 


but that 


cot x - ~^ cot x + ~j 


2 

3' 


44 Use (4) in the text to explain why 1 # , 0 1 , and 0°° are not 
indeterminate forms. 
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When we write down an ordinary definite integral as defined in Chapter 6, 

f f(x) dx, (1) 

Ja 

we assume that the limits of integration are finite numbers and that the integrand 
f(x) is continuous on the bounded interval a < x < b. If f(x) > 0, we are thor¬ 
oughly familiar with the idea that the integral (1) represents the area of the 
region shown on the left in Fig. 12.4. 

In the next section and in Chapter 13 it will be necessary to consider so-called 
improper integrals of the form 

[7W dx, (2) 

in which the upper limit is infinite and the integrand f(x) is assumed to be con¬ 
tinuous on the unbounded interval a < x < «>.* We define the integral (2) in the 
natural way suggested on the right in Fig. 12.4; that is, we integrate from a to a 
finite but variable upper limit f, and then we allow t to approach °o and define 
(2) by 

f f(x) dx = lim f f{x) dx. 

Ja t—Ja 

If this limit exists and has a finite value, then the improper integral is said to con¬ 
verge or to be convergent, and this value is assigned to it. Otherwise, the inte¬ 
gral is called divergent. If f(x) > 0, then (2) can be thought of as the area of the 
unbounded region on the right in Fig. 12.4. In this case the area of the region is 
finite or infinite according as the improper integral (2) converges or diverges. 


Example 1 



dx = lim 

t —»°o 



dx = lim [—e x ]o = lim 

r—**= t —»oo 



= 1 . 


This improper integral converges, because the limit exists and is finite. 


*The word “improper” is used because of the “impropriety” at the upper limit of integration. If we 
wish, we can speak of (1) as a proper integral because it has no improprieties, but this is neither nec¬ 
essary nor customary. 



12.4 

IMPROPER INTEGRALS 


Figure 12.4 
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Students often tend to abbreviate this calculation by writing 
f~e~*dx= l-e~*]$= ~ + 1 = 1, 

instead of writing out the limits as we have done in Example 1. This shorthand 
rarely causes any real difficulties. However, in our work in this section we will 
always write out the limits for the sake of emphasizing that improper integrals 
are defined as limits. 


Example 2 


dx I 

'' dx 

_ r 

—2 = hm 

—r = lim 


Jl x Z t->°° J 

1 

X 


This improper integral also converges. 




1 . 


Example 3 



ft dr 

lim — = lim [In x]\ = lim In t = °o. 
t—J i x t—t— 


This integral diverges, because the limit is infinite. 


Example 4 


I cos x dx = lim | cos x dx = lim sin f, 

JO /—»oo Jo r—>oo 

which does not exist. This integral diverges, because the limit does not exist. 


Our next example generalizes Examples 2 and 3 and contains specific infor¬ 
mation that will be needed in Chapter 13. 


Example 5 If p is a positive constant, show that the improper integral 



converges if p > 1 and diverges if p < 1. 


Solution The case p — 1 is settled in Example 3, so we assume that p ± 1. In 
this case we have 


1°°^ = lim I 

'< dx 

= lim 

- x \-p 

J 1 X p Moo J 

1 xP 

t —>oo 

.1 -p _ 


= lim 

f —>oo 


t'-p - 1 

1 -p 




if p > 1 


oo if p < 1, 


and this completes the proof. 


We consider the geometric meaning of Example 5 by examining Fig. 12.5. The 
fundamental facts are these. The integral 
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Figure 12.5 


converges because the curve approaches the x-axis “rapidly enough” as x —> ©o. 
And integrals such as 

\~}-dx and 
h x Ji x m 


diverge because the curves do not approach the x-axis “rapidly enough” as 
x —> co. When the exponent p is allowed to decrease through values greater than 
1 (for instance, p = 4, 3, 2, 1.5, etc.), then it is easy to see that the correspond¬ 
ing graph of y = \/x p to the right of x = 1 rises; also, the calculation shows that 
the area of the unbounded region under this graph increases but remains finite. 
When p reaches 1 this area suddenly becomes infinite, and it remains infinite for 
all values of p < 1. It is indeed remarkable that a region of infinite extent can 
have a finite area, as happens here when p > 1. We will comment further on this 
phenomenon in Remark 1 below. 

Another type of improper integral arises when the integrand/(x) is continu¬ 
ous on a bounded interval of the form a<x<b but becomes infinite as x 
approaches b, as shown in Fig. 12.6. In this case we can integrate from a to a 
variable upper limit t which is less than b. This integral is a function of t, and 
we can now ask whether this function approaches a limit as t —> b. If so, we use 
this limit as the definition of the improper integral of f(x) from a to b, 

f fix) dx = lim [ f(x) dx. 

Ja t—>b Ja 

As before, this integral is called convergent if the limit exists and is finite, and 
divergent otherwise. 


Example 6 


r 

Jo vr^x 



dx 

Vl - JC 


= lim [-2Vl - x]'o 


= lim [— 2VT"-7 + 2] = 2. 



Figure 12.6 


This improper integral clearly converges. 


There are several other types of improper integrals which we mention only 
briefly because the ideas are essentially the same as those already described. 
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If the impropriety of an integral occurs at the lower limit, we use t as the lower 
limit and then let t —> — or t a, as the case may be. If the integrand misbe¬ 
haves at several points, then the improper integral — if it exists—is obtained by 
dividing the original interval into subintervals. 

Finally, if f(x) is continuous on the entire real line, then we write, by defini¬ 
tion , 

|_ M /W dx = f_^/(x) dx + f g fix) dx, 

where convergence for the improper integral on the left means that both integrals 
on the right converge. An integral from — oo to oo can be split at any convenient 
finite point just as well as at the point x = 0. 

Example 7 


f°° dx _ f () 

dx f~ 

dx 

J-oo 1 + X 2 J-°° . 

1 +x 2 Jo 

1 + X 2 


= lim P-r^T + lin. 

t ->-~ it 1 + x l /->» Jo 1 + x l 

= lim [tan~ l ;t]?+lim [tan^ 1 ;c]o 

M—“ f— 

= lim (-tan -1 t) + lim tan -1 t = -f —J] + ^ = it. 

t-> — oo f->oo \ 2 1 2 


Remark 1 Students may still be skeptical that a region of infinite extent can 
have a finite area. If so, then the following example may help. Consider the re¬ 
gion under the curve y = 1/2* for 0 < x < ©o. This region is shaded in Fig. 12.7, 
and clearly has a smaller area than the combined area of all the rectangles shown 
in the figure. But these rectangles have base 1 and heights 1, t, so their 

combined area is exactly 2, as indicated: 

! + i + W+-'- = 2.* 

It follows that the shaded region — of infinite extent! — has finite area less than 

2. The area of this region can even be computed exactly; since 2* = ( e Xn2 ) x = 
e x]n2 , it is 

’"This is an infinite geometric series of a kind often studied in high school algebra courses. We dis¬ 
cuss these series in much more detail in Chapter 13. 



Figure 12.7 


1 


2 


3 


4 
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lim 

/-)«> J 

e ~x\ril 

b 

lim 

r_j_ 

ii m 

in 2 

lim 

t— >oo 

1 

In 2 

1.4427. 


' ~T ~P ‘ 


1 

In 2 


Remark 2 Generally speaking, improper integrals play a more substantial role 
in higher mathematics than they do in calculus. We mention two important ex¬ 
amples—which we do not pursue any further in this book—to give students 
some idea of what we’re talking about. 

(a) The improper integral 

F( p) = Jf x p ~ l e~ x dx 

(the symbol on the left is capital gamma in the Greek alphabet) is called the 
gamma function. This is a very interesting function which is studied in advanced 
calculus and elsewhere. It has innumerable applications to physics as well as to 
geometry, number theory, and other parts of pure mathematics. 

(b) The improper integral 

F(p) = \~ e-r* f(x) dx 

has many significant applications to electric circuits, vibrating membranes, and 
heat conduction, and to the solution of certain types of differential equations. It 
is a function of p associated with the given function/(*), and is called the Laplace 
transform off(x).* 


*See Chapter 9 in the author’s book Differential Equations with Applications and Historical Notes, 
2nd ed. (McGraw-Hill, 1991). 


PROBLEMS 


In each of the following problems, determine whether or not 
the improper integral converges, and find its value if it does. 

dx 


dx 

Jt3 m 


3 [--£ 

Js x' 

- f“ 1 . 1 , 

5 —sin — dx. 

Ji x l x 

i. 


1 

9 

10 


x x 
dx 

] e jc(ln x) 2 ’ 

f( 


: dx. 


Wx v7T3 


y 


dx. 


(° 

Jo (1 +Jt) 3 ' 

I sin x dx. 

JO 

r°° dx 

Je x In x 

e~ x cos x dx. 
Jo 


11 

r°° dx 

12 f“ 4* 


J 1 x(x + 2)' 

h xV\6 + X 2 

i 

f 2 In x dx 

14 P dx 

i j 

Jo Vx 

Jo 4 — X 2 ' 

15 

f \x\e~ xi dx. 

J — oo 

16 e~ x cos x dx. 

J — oo 

17 

Let p be a positive < 

constant. Determine the values of p 


for which the improper integral 
n dx_ 

Jo x p 

is convergent, and those for which it is divergent. 
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18 Consider the region under the graph of y = 1 /x for x > 1. 
Even though this region has infinite area, show that the 
solid of revolution obtained by revolving this region about 
the x-axis has finite volume, and compute this volume. 

19 Consider the region in the first quadrant under the curve 
y = l/(x + l) 3 . Find the volume of the solid of revolu¬ 
tion generated by revolving this region about (a) the 
x-axis; (b) the y-axis. 

20 The region under the curve y = 4/(3* 3/4 ) for * > 1 is re¬ 
volved about the x-axis. Find the volume of the solid of 
revolution generated in this way. 

21 Show that the surface area of the solid of revolution de¬ 
scribed in Problem 20 is infinite. As a result of these 
calculations, we see that a container in the shape of this 
surface can be filled with paint (it has finite volume), but 
nevertheless its inner surface cannot be painted (it has 
infinite surface area). Hint: Use the obvious inequality 



to show that 


surface area > f = °°. 

3 J i x 3/4 

22 If a > 0 and the graph ofy = ax 2 + bx + c lies entirely 
above the x-axis, show that 


f 

j —© 


_ dx _ 

ax 2 + bx + c 


2tt 

V4 ac - b 2 


23 (A comparison test) Let /(x) and g(x) be continuous 
functions with the property that 0 ^/(x) ^ g(x) for a ^ 
x < Show that 

(a) if fa #(*) dx converges, then /(x) dx also con¬ 
verges; 

(b) if Jq/(x) dx diverges, then J“g(x) dx also diverges. 

24 Use the comparison test in Problem 23 to determine 
whether each of the following integrals converges or di- 


verges: 




<„r 

dx 

(b) 

>< 

Os 

1 

V* 3 + 5 ’ 

, r- 

cos 4 5 jc 

*(d) 

f“ In x , 

(c) 1 

, dx\ 

X 3 

— dx. 
Je X Z 


12.5 

THE NORMAL 
DISTRIBUTION. GAUSS 


Suppose a measurement or experiment is performed many times, and that its re¬ 
sult is a number. We can think, for example, of weighing the babies born in a 
certain hospital during a given year, or of measuring the annual rainfall in a cer¬ 
tain city over a number of years. Suppose the possible results of our measure¬ 
ment or experiment are numbers x that lie in an interval a < x < b. To record 
our results we can divide the interval [a, b] into n subintervals of equal length, 
say a = xq < x\ < X 2 < * * * < x n = b, and then count the number of times m * 
that our result is a number between x*_ 1 and x *. When this way of arranging the 
data is represented by a step function whose height is ra* over the kth subinter¬ 
val, the resulting graph is called a histogram. 

In Fig. 12.8 the birth weight data in the table on the left—taken from genuine 
vital statistics—is displayed in the histogram on the right. The total number of 
babies born in this hospital in this year was 2555. To find the average birth weight 
directly, we would have to calculate the number 

sum of all birth weights 
total number of babies 


But our table doesn’t provide individual birth weights, so without access to the 
original data this calculation is beyond our power. However, by using the mid¬ 
point of each weight interval, we find that the sum of all the birth weights is 
approximately 

(1.5)(12) + (2.5)(18) + (3.5)(46) + (4.5)(158) 

+ (5.5)(422) + (6.5X828) + (7.5)(491) + (8.5)(429) 

+ (9.5)(133) + (10.5)(18) = 17,419.5 lb. (1) 

The average birth weight, also called the mean , is therefore approximately 
17,419.5/2555 = 6.82 lb. In the histogram each term of the sum (1) is the prod¬ 
uct of the x-coordinate of the midpoint of a subinterval and the area of the cor¬ 
responding rectangle. 
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If we reconstruct our histogram by using a larger and larger number of smaller 
and smaller subintervals, we expect the graph to approach the graph of a smooth 
function/(x). We can now adjust the unit of length along the vertical axis so that 
the total area under the curve is 1. This gives a function y = f(x ) called the fre¬ 
quency density. This function has two characteristic properties: 

f(x) >0 and f f{x) dx = 1 . (2) 

Ja 

Also, if a < c < d < 6, then the integral 

\ d c f(x) dx (3) 

gives the ratio of the number of times the measurement produces a value between 
c and d to the total number of measurements, that is, the relative frequency of 
the result c ^ x < d. In the same way,/(*) dx can be thought of as the propor¬ 
tion of results that lie between x and x + dx. From this point of view, the inte¬ 
gral (3) can be interpreted as the probability that a randomly chosen measure¬ 
ment will have a result between c and d, and f(x) is then called a probability 
density function. 

In order to gain further insight into these concepts, let us for a moment think 
of f(x) as the mass density function of a rod of total mass 1 that lies along the 
x-axis between x = a and x = b. Then f(x) dx is the element of mass, xf(x) dx is 
the moment of this element of mass about the origin, and the integral 

X = f xf(x) dx (4) 

Ja 

is the center of mass of the rod, since f f(x) dx = 1. Also, the integral 

Ja 

I = [ (X- X) 2 f(x) dx 

Ja 


Figure 12.8 


(5) 
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is the moment of inertia of the rod about the line x = x as axis. We know from 
our experience in Chapter 11 that this quantity is small if most elements of mass 
are nestled close to the axis, and larger otherwise. 

In the case of a general probability density f(x ) with properties (2), the inte¬ 
gral corresponding to (4), 

m = f xf(x) dx, 

Ja 

is called the mean . As we know, the mean m is the point on the jc-axis where the 
region under the probability density graph, if it were made out of cardboard and 
placed in a horizontal position, would balance on the line x = m. The square root 
of the integral corresponding to (5), 

cr = J ^ (* “ m) 2 f(x) dx, 

is called the standard deviation. If cr is small, the results of our measurements 
cluster around the mean m\ and if a is large, then a significant portion of these 
results are farther away from m. 

In the general mathematical theory of probability of which these ideas are only 
a hint, it is customary to consider probability densities that are defined for all x , 
so that no limitations are placed on the possible results of the measurement or 
experiment under consideration. A probability density is then defined to be any 
function that satisfies the conditions 

f(x) > 0 and 

and the mean m and standard deviation 

m = xf(x) dx and 

Of course, these integrals are improper 
tion 12.4. 


[ f(x)dx= 1, (6) 

a are defined by 

o 2 = J (x - m) 2 f(x) dx. (7) 

integrals in the sense discussed in Sec- 


SEVERAL IMPORTANT IMPROPER INTEGRALS 



To reach our goal of understanding the normal distribution we must first con¬ 
sider several properties of the function y = fix) = e~ x 2 , whose bell-shaped graph 
is sketched in Fig. 12.9. We begin by pointing out that this function is even, which 
means that/(— x) = fix), so the graph is symmetric about the y-axis. Also, the 
values of the function are all positive, it has a maximum y = 1 at x = 0, and the 
graph has two points of inflection at * = ±yV2 (check this by calculating y"). 
It is clear that 

lim e~ x2 = 0, (8) 

because e~ x2 = \/e x2 and e x2 -^ooasi^ ±°°. Also 


lim xe x2 = 0, (9) 

X —»±°o 


because for \x\ > 1 we have \xe x2 \ = \x\e xl < x 2 e xl , and we know that 
\\m x ^ ±00 x 2 e~ xl — lim z _ >00 ze~ z = 0 by Example 1 in Section 12.3. 
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It is a remarkable fact that the area under the curve y — e x2 has the finite 
value 

J e~ x ~ dx = V 77 -, (10) 

because 

e~ xl dx = jVn. (11) 

This astonishing formula connecting e and tt is best established by using double 
integration in polar coordinates. The details of the proof are given in Example 3 
of Section 20.4, but for the present we simply accept it. 

Next, we use the definition of an improper integral to write 

| xe~ xl dx — lim f xe ~ x2 dx = lim[— y e~ x2 ] ( 0 
Jo ?-)co Jo 2 u 

= lim (j-j e~' 2 ) = y. 

/—>00 

(0 

Here we used (8). Similarly I xe x2 dx = —j, so by putting these two integrals 
together we obtain 

f°° xe~ x ~ dx = 0.* (12) 

j—00 

Finally, an integration by parts with u = x, dv = xe ~ x2 dx gives 

J x 2 e~ x2 dx = -jxe~ x2 + y J e~ x 2 dx , 
so 

f x 2 e~ x2 dx = —^rte~ t2 + ~ f e~ x2 dx. 

Jo 2 2 Jo 

By (9) and (11) we now have 



Since the integrand x 2 e x2 is an even function, we conclude that 

[ x 2 e~ x2 dx = (13) 

j—00 


THE NORMAL CURVE 

Let m be any number and a any positive number. Then the function 


*This result also follows without calculation by observing that the integrand is an odd function, that 
is, it has the property /(—;c) = — f{x). 
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f(x) = — X —= e~ < - x ~ m '> 2l2al (14) 

oV2tt 

is called the normal (or Gaussian ) probability density function with mean m and 
standard deviation a. Since clearly/(x) > 0 for all x, to verify what is implicitly 
stated here we must show that 

f{x)dx= 1, (15) 

J — oo 

f xf(x) dx = m, (16) 

J — OO 


and 


[ (x - m) 2 f(x) dx = o 2 . (17) 

J— OO 

To prove these facts we use the change of variable t = (x — m)/oV2, so that 
t varies from — oo to ^ as i varies from — °o to °o and 

x = m + a\ / 2 t, dx = a\/2 dt, f(x) = —-— e~ r \ 

By using (10), (12), and (13) we establish (15), (16), and (17) as follows: 



and 


[ f(x) dx = — ^ — [ e f 2 crV 2 dt = — [ e t2 
J — aV2n J -°° Vtt J -~ 

[ xf(x) dx = —-— [ (m + (7V2r)e _/ W2 dt 
J ~°° ctV27t J ~°° 


dt 


m 

\Ti r 


/: 


e * 2 dt + a — te * at = m, 


f“ (x - m) 2 f(x) dx = —!— f~ 2o 2 f 2 e-'VV2 dt 
= dt = a 2 . 

Vtt J — 

The graph of (14) is called the normal (or Gaussian) curve with mean m and 
standard deviation a. It is symmetric about the line x = m, because the function 
(14) has the same values for x\ = m + a and xi = m — a. Also, the curve is bell¬ 
shaped, and the function assumes its maximum value of l/crV27r = 0399/a at 
x = m. Further, the curve has two points of inflection at the points x = m + a 
and x = m — a. To see this we calculate 


/'« 


_ X ~m e -( x -m)V2o* 

CT 3 V27T 


and 


/"« = 


1 e -(,x-m) 2 t2(T 2 _|_ 

CT 3 V27T 



(* ~ m ) 2 e -(x-m)V2cr 2 

<T 5 V27T 

e -(x-m) 2 l2cr 2 
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Figure 12.10 Changes in fix) as m 
varies and as cr varies. 


This formula tells us that the second derivative is positive for \x — m\ > a and 
negative for \x — m\ < a, which proves the statement about points of inflection. 

Normal curves with cr = 1 and m = 0, 2, -2 are shown on the left in Fig. 
12.10, and with m = 0 and cr = y, 1, 2 on the right. We observe that these curves 
are wide and flat for large cr, and narrow and peaked for small cr. For the spe¬ 
cial case in which m — 0 and cr — 1, we obtain the important standard normal 
probability density 

<AW = -4= e~* V2 . (18) 

The graph of this function is shown in Fig. 12.11. We notice that for — 1 < x < 1 
(within one standard deviation of the mean) we obtain 68.2 percent of the area un¬ 
der the curve, and for — 2 < x < 2 (within two standard deviations of the mean) 
we obtain 95.4 percent of the area under the curve. It is an interesting fact that 
these percentages hold for the areas under all normal curves within one or two stan¬ 
dard deviations of the mean. 

When/(x) is any probability density, the function of t defined by 

Fit) = f f(x) dx 

is called its distribution function. According to our previous interpretation, F(t) 
is the probability that x lies in the interval (—°°, t]. In particular, the normal dis - 



Figure 12.11 The standard normal 
curve (m = 0, cr = I) 
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tribution function (or simply the normal distribution) with mean m and standard 
deviation <x is the function 

F(t) = —!— P e-^-^ 2 ' 2 ^ dx. (19) 

<tV277 ” 

In the simplest special case, in which m = 0 and <x= 1, it is customary to denote 
this by 

O(r) = —L_ P e ~ xV2 dx, (20) 

V27T 

and to refer to it as the standard normal distribution. Tables have been constructed 
for the function <F(f) by the methods of numerical integration, and these tables 
can be used to solve many problems in science and mathematics involving prob¬ 
ability and statistics. Students who wish to explore these important ideas are urged 
to take an advanced course on mathematical probability. 

We have hinted at a procedure here, and it might be helpful to give a brief ex¬ 
planation of how this procedure works. To say that the quantity x is normally dis¬ 
tributed means that its density function is well approximated by (14) for suitable 
choices of m and a. The probability that x lies in the interval a<x<b is de¬ 
noted by P(a < x < b) and is given by 

P(a < x < b) = —!- f e~~ (x ~ m)2,2cr2 dx. (21) 

crV2^ Ja 

If we make the substitution t = (x — m)!cr, then a and b become 

, a — m , , , b — m 

a = - and b = -, 

cr er 

and the integral just written is transformed into 

P(a<x<b) = P(a' </<£>') = r e~ ,m dt 

Vln la ' 

= $(&') - <D(a')- 


This quantity can now b e calculated by using tables to look up the numerical val¬ 
ues of <t>(b') and <J>( a '). 

Many phenomena in science and society are normally distributed, and can 
therefore be modeled and calculated by using this machinery—for instance the 
heights of men of the same age in a large population, the speeds of molecules in 
a gas, the results of measuring a physical quantity many times, and so on. 



Example 1 The mean annual rainfall in New York City is 42 in. The annual rain¬ 
fall over many years is closely approximated by the normal density function with 
m — 42 and standard deviation cr = 2, 


f(x) = 


1 (*- 42 ) 2/8 

2V^ 


A sketch of this normal curve is shown in Fig. 12.12. Use this information to 
compute the proportion of years with rainfall between (a) 40 and 44 in; (b) 38 
and 46 in. 
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Solution (a) The proportion of years with rainfall between 40 and 44 in is 



r 44 

J40 


e -(x-42)2/8 dx _ 


With the change of variable t = (x — 42)/2 — and access to a table of values of 
O(f)—this becomes 

—— [' e ~' 212 dt = <t>(\) — <t>(— 1 ) 

V^7T 1 

= 0.8413 - 0.1587 = 0.6826. 


(b) Similarly, the proportion of years with rainfall between 38 and 46 in is 
(with the same change of variable) 


1 


2V^7t 


/38 


,-(*- 42 ) 2/8 dx 


-Tbl 


-tV 2 


dt 


V 277 


= 0 ( 2 ) - 0 (- 2 ) 

= 0.9772 - 0.0228 = 0.9544. 


Example 2 An examination is sometimes considered to have done its job of 
spreading student grades fairly if the frequency histogram of grades can be ap¬ 
proximated by a normal density function. Some teachers who go to this trouble 
then use this histogram and approximating curve to estimate m and cr, and as¬ 
sign the letter grade A to grades greater than m + cr, B to grades between m and 
m + cr, C to grades between m — cr and m, D to grades between m - 2a and 
m - cr, and F to grades below m - 2a. This is what is meant (or used to be 
meant) by grading on the curve. This approach to calculating grades is probably 
almost extinct in the modern era of grade inflation. 


Remark 1 How does it happen that these probability discussions are saturated 
with various forms of the function e~ x2 l We attempt to answer this question by 
showing how the normal probability density function (14) can be derived from 
simple and reasonable assumptions. 

Consider the experiment of a marksman repeatedly shooting at a target whose 
bull’s-eye is the origin of the xy-plane (Fig. 12.13), and suppose that we are only 
interested in the jc-coordinates of the points of impact. These ^-coordinates pro¬ 
vide an ideally simple example of quantities distributed in the pattern we wish 
to examine, being bunched together around x = 0 and tapering off symmetrically 
to the sides. 

If fix) is the probability density function of these jc-coordinates, then/(;c) dx 
is the probability for any particular shot that its jc-coordinate lies in the interval 
from x to x + dx. Similarly the probability of the y-coordinate lying in the in¬ 
terval from y to y + dy is g(y) dy , where g(y) is the probability density in the 
y-direction. Now, assuming that the x- and y-deviations from the bull’s-eye are 
independent of each other, then the product of the two probabilities, 

lfix) dx][g(y) dy] = f(x)g(y) dx dy =f(x)g(y) dA , 



Figure 12.13 


is the probability that the bullet hits the element of area dA shown in the figure. 
Assuming further that the experiment possesses circular symmetry, this proba- 
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bility will be the same for any equal element of area at the same distance r in 
any direction from the bull’s-eye. This amounts to assuming that fix)giy) is a 
function only of r 2 , 

fix)giy) = h(r 2 \ (22) 

where r 2 = x 2 4- y 2 . 

Differentiating both sides of (22) first with respect to x and then with respect 
to y gives 

f'(x)g(y) = h\r 2 ) • 2x and fix)g\y) = h\r 2 ) • 2 y. 

By eliminating h\r 2 ) from these equations we obtain 


or 


f'(x)g(y) _ f(x)g'(y) 
2x 2 y 

fix) = g’(y) 

2xf(x) 2ygiy) ' 


(23) 


Since the left side is a function of x alone, and the right side is a function of y 
alone, (23) implies that both sides are constant, in particular 


I^L = C 

2 xf(x) 


or 


fix) 

fix) 


= 2cx. 


Integration now gives 


In f(x) = cx 2 + d 


or 

fix) = e d • e cx2 = De cx2 (24) 

where D = e d . But fix) is a probability density function, so we must have 


f" ftodx= 1. 


(25) 


and this implies that c must be negative. We are free to write c in the form c = 
— l/2(j 2 for a positive constant cr, and (24) now becomes 

fix) = De~ x2/2(r2 . 

By integrating this from -<»to oo, changing the variable of integration from* to 
t = x/crV2, and using (10) and (25), we obtain 

D J~ e ~ x2/2a2 dx = Da V2 e~ t2 dt = DcrV2 Vtt = 1. 

Therefore D = Mcr \[2 tt and our function takes its final form, 


fix) = 


1 e -xV2o2 
(J V27T 


which is the normal probability density (14) with mean m = 0. 


Remark 2 Formula (11) can be thought of as a special case (let a —» oo) of a 
little-known formula called Laplace's continued fraction : 
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f e x2 dx = 7r - 
Jo 2 


2 a + ■ 


a 4- - 


2 a + • 


a + • 


2 a + 


We mention this not because it has any practical importance for us in our pre¬ 
sent work, but rather because it would be unconscionable—almost immoral—to 
deprive the student of the opportunity of seeing one of the most wonderful and 
beautiful individual facts in the whole of mathematics.* 


*See p. 357 of H. S. Wall, Analytic Theory of Continued Fractions (Van Nostrand, 1948; reprinted 
by Dover, 1967). In this reference Laplace’s formula is given among the special cases of the contin¬ 
ued fraction of Gauss for the quotient of two hypergeometric functions. Hypergeometric functions 
are discussed on p. 200 and in Problem 1 on p. 203 of the present writer’s book. Differential Equa¬ 
tions with Applications and Historical Notes (McGraw-Hill, 2nd ed., 1991). 


NOTE ON GAUSS 

The German Carl Friedrich Gauss (1777- 
1855) was the greatest of all mathematicians and perhaps 
the most richly gifted genius of whom there is any record. 
The profound creative activity of this awe-inspiring figure 
embraced all of mathematics—geometry, number theory, 
algebra, and analysis—as well as physics and astronomy. 
The fame of the town of Gottingen in Germany as the lead¬ 
ing center of mathematics in the world until the 1930s dates 
from the time of Gauss, who was professor there and di¬ 
rector of the Gottingen astronomical observatory. His fun¬ 
damental contribution to the concept of normal distributions 


arose from his work on the theory of errors in making phys¬ 
ical measurements.* It is strange and ironic that this colos¬ 
sal figure in the intellectual history of western civilization 
should be almost entirely unknown among most educated 
people. 


*For more information on this, see pp. 78-83 of Carl Friedrich 
Gauss by Tord Hall (MIT Press, 1970); or pp. 138-140 of Gauss: 
A Biographical Study by W. K. Blihler (Springer-Verlag, 1981). A 
brief general account of Gauss’s life and work can be found in Sec¬ 
tion A.25 of the present writer’s book, Calculus Gems (McGraw- 
Hill, 1992). 



PROBLEMS 

1 Find the value of the constant k for which each of the fol¬ 
lowing is a probability density function on (-<», °°): 

ke~\ x \ - —^—r, kxe~* 2 . 

1 + x* 

2 Several functions fix) are defined by the following ex¬ 
pressions for 0 ^ x ^ 1 and are identically zero for all 
other values of x: 1, 3x 2 , 5x 4 , e — e x , 77/2 sin 7 tx Verify 
that each is a probability density function. 


3 Verify that each of the following is a probability density 
function and find its mean m\ 

(a) fix) = i(x + 1) for 0 < jc < 2 and 0 elsewhere; 

(b) fix) = |(1- \x 2 ) for —2 < a < 2 and 0 elsewhere. 

4 Verify that the function defined by fix) = \x for 0 ^ 
x ^ 2 and 0 elsewhere is a probability density function 
and find its mean m and standard deviation cr. 

5 In Example 2, use Fig. 12.12 to estimate the percentages 
of students who receive the grades A, B, C, D, F. 
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CHAPTER 12 REVIEW: DEFINITIONS, CONCEPTS 


Think through the following. 

1 Mean Value Theorem. 

2 L’Hospital’s rule for 0/0. 

3 L’HospitaPs rule for other indeterminate forms. 

4 Improper integrals. 


5 Probability density function. 

6 Mean and standard deviation. 

7 Normal probability density. 

8 Standard normal probability density. 


ADDITIONAL PROBLEMS FOR CHAPTER 12 


SECTION 12.2 

Find the following limits. 

,. sin 5x 
I lim ——— 

x->o sin 2x 


v jc 2 + jc - 30 
lim — — -. 

x ~* s V* - 1 - 2 


lim 


jc — 4 


lim 

a->2 


lim 


4 Vx + 4 - 2 
tan (2jc -4) 


8 


10 lim 


n^TTT - (i + jjc) 
3? • 

^/x + 16 - (2 + ^jc) 


lim 


In (jc - 1 ) 


*->2 jc — 2 
sin toc 


lim 


i 1 


- Y 2 


jc 2 + 2jc - 3 
~-3 2jc 2 + 3jc - 9‘ 


lim 

X —>1 


JC 3 -h JC 


2 _ 


In jc 


II 


lim 


In (jc - 2 ) 


x—>i‘+ (x - 3) 2 

cin 


13 lim 

X —>0 

15 lim 


jc — sin jc 
e Vx ! 


17 H 


19 lim 


sin 1/jc ’ 

1 - cos JC 
jc sin jc 
sin " 1 jc 


20 lim 

x —>0 


jc-»o+ sin 2 3jc ’ 

2 cos jc - 2 + jc 2 


21 lim 


3jc 4 
1 — sin jc 


23 lim 

x —>2 


x—>tt/2 COS JC 

3^jc- 1 


12 

14 

16 

18 


25 lim 

x —>0 

26 lim 


3U - 2) 2 

sin 2 jc + 2 cos jc - 2 
w‘o cos 2 jc — jc sin jc — 1 
sin jc — tan jc 

x->0 


x~ 

COS X - 


27 lim 

X—>0 


l+^ 2 


.. jc sin (sin jc) 

lim -——-. 

x->o l — cos (sin jc) 

r ^ " 1 

lim-:-. 

-*-*o x sin jc 


tan 


-l 


lim i ~ 
*->o+ 1 — cos 2jc 


lim 


16 


~ 16+ V^-2 


22 

24 


lim - 


2jc 


*->o tan 


lim .. 

x->l JC Z 


In jc 


r 2 - 


28 


lim - 
*->o 


29 

31 

33 

35 

37 

39 

41 


lim 

*->o 

lim 

X —> 1 

lim 


^ c 


>o 1 — cos 4jc 

jc 3 + 3e [ ~ x — 4 

i jc — In jc — 1 

jc 2 tan jc 

T-46 tan jc — jc 

.. In (jc + 1 ) 

lim — t~ -—. 

;t—>o e 3x — 1 

- 1 


lim - 9 

x->l JC* — 1 


lim 

x->0 


x - tan 


-l 


JC^ 

tan 2 (1 /jc) 


In 2 (1 + 4/jc) * 


30 

32 

34 

36 

38 

40 


1 ^ LUS A 


lim -- -y. 

x^>7r (jc — TTy 


lim 


- sin jc 


->o x tan jc 


lim 


lim 

*-»0 


lim 

x—>0 


lim 


1 


1 — cos 2\ r a . 

2jc 2 
jc — sin jc 
1 - cos jc‘ 


11111 c • 

x—>it 1 + COS JJC 


42 Consider the circular sector of radius 1 shown in Fig. 
12.14. The point C is the intersection of the tangent 
lines at A and B . If f(6) is the area of the triangle ABC 
and g(0) is the area of the region that remains when 
the triangle OAB is removed from the sector, evaluate 
lim^o fWg(O). 


B 



43 Show that 

.. jc 2 sin ( 1/jc) 

lim- 

*->o jc 

is an indeterminate form of the type 0/0 that exists but 
cannot be evaluated by L’Hospital’s rule. What is the 
value of this limit? Does this example show that L’Hos- 
pital’s rule is false? 
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pital’s rule is false? 

44 Use L’Hospital’s rule to establish the following formu¬ 
las for the direct calculation of the second derivative: 

(a, + + 


,, a + It) ~ 2f(x) +f(X ~ H) 

(b) f (x) = lim- 

h —>0 


h 2 


45 If n is a positive integer, show that 

. nx n + l - (rc + 1 )jc" +1 w(n + 1 ) 

.*->1 (* — l) 2 2 

(For the meaning of this rather strange-appearing result, 
see Problem 46.) 

46 If n is a positive integer and x + 1, the formula 

1 _ r n + l 


1 + X + X 2 + X 3 + • • • + *" = 


1 - X 
X n + ] - 1 
X — 1 


is familiar from high school algebra. Differentiate it to 
obtain 


62 lim tan x In x. 

*-> 0 + 

63 lim (x —“) tan 3x. 

x—> 7r/2 ^ 2 / 

64 lim (lx — 7 r) sec x. 

x~>tt/2 

66 lim x(e Ux — 1). 

x— >00 

68 lim (~^r ~ ^ 


mo \x z x sin x 


69 lim 


1 


70 lim 

>0 


71 lim 

x—*0 


■—>0 \ 1 — cos X 

1 + X 


x 4 

_ 1 

In (1 + x) x 
1 


65 lim tan x In (sin x). 

x->tt/2 

67 lim sin x In (sin x). 

m0+ 


In (1 + x) e x - 1 


72 lim (x - Vx 2 + x). 73 lim x 

x —>00 x— >0+ 


74 lim (sin x) tan + 

*-> 0 + 

76 lim (x 2 - l) x_1 . 


jc->0+ 

75 lim (e x - l) sinx 

x->0+ 

77 lim (cos x) cos + 

x->n/2- 


+ 2x 4- 3x 2 + • ■ • + nx n ~ 1 

UX n+1 — (« + l)x n + 1 


0* - l) 2 


, (*) 


and then take limits as x —> 1 and use Problem 45 to 
derive the formula 


1+2 + 3 + ••• + « = 


/?(/7 + 1) 


*47 Multiply equation (*) in Problem 46 by x, differentiate, 
etc., and thereby derive the formula 


l 2 + 2 2 + 3 2 + • • • + n 2 = 


n{n + \)(2n + 1 ) 


SECTION 12.3 

Evaluate the following limits by any method. 

In (1 - lx) 


3x 2 + 9 
48 lim - * x - 
x + e 


49 lim 


tan 7 tx 


CA ,. 2 +sec* ff1 ,. In* 100 

50 lim -. 51 lim -. 

x->3tt/2 tan * 


* + In * 
52 hm-:-. 

x->°o x In * 


53 lim 


In * 


*-> 0 + cot * 


54 lim 


In (sin x) 


jc->o+ In (sin lx)' 


55 lim 


In * 


0 2x ' 


56 lim . 57 lim 

*->°° Vx x—>oo g x 

58 lim x 2 \n x. 59 Hm xP In x, p > 0. 

m 0+ jr->0+ 


60 Hm x 2 e l/x . 

x->0+ 


61 lim * sin —, p =£ 0. 


78 lim (1 - tan x) 

X—>7t/4- 


79 lim (x + sin x) tan + 

jc->0+ 

80 lim (ln*) sinU_1) . 

mi + 

82 lim * axt> , b > 0. 

m0+ 


1 — tan* 


84 lim (x + O 


ax\b/x 


86 Hm (1 + x) e \ 

X—>oo 

88 lim (1 + - ) . 
*->o \ x I 

90 lim (cot xY- 

Jt-»0+ 

92 lim (1 -2*) 3/ + 
94 lim (1 + 1 -) 5X . 


96 lim (1 + 2*) coU . 

x->0 


98 lim (cos 2x) ] 

mO 

99 Show that 


Ah c 2 


81 lim [In (1 + x)}\ 

M0+ L v ' 

83 lim x x * [x xX = x M ]. 

m0+ 

85 lim (1 + xP ) Vx , p > 0. 


87 lim (1 + esc x) ; 

jc-»0+ 

89 Hm (esc xY. 

m0+ 

91 lim x ,n(1 + ,/x) . 

X—>00 


93 lim 1 + - 

x-*=° \ X 


95 lim <e x + 3x) Ux . 

;t-»0 

97 lim (I + 3xf scx . 

x—>0 ’ 


t sin 2 jc 


lim 

X—»oo 


* + sin * 


is an indeterminate form of the type that exists but 
cannot be found by L’Hospital’s rule. What is the value 
of this limit? 
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100 Find the value a must have if 


lim 


x + a \ x 


= 4. 


SECTION 12.4 

Determine whether or not each of the following integrals con¬ 
verges, and find its value if it does. 


101 j 

\ 2 e 3v dx. 

102 

f°° dx 
h x 3 • 

103 J 

f~ dx 

104 

x 2 dx 

Jo x 3 + 1 ’ 

105 j 

e~ x sin jc dx. 
b 

106 

xe~ x dx. 

107 J 

f°° x dx 
'o x 4 + 1 

108 

xe~* 2 dx. 

109 J 

f°° dx 

110 

C°° dx 

'2 4 + x 2 ' 

h x 2 - r 

111 j 

f°° x 2 dx 
'0 e xi 

112 

r°° In x 
- dx. 

Je X 

113 J 

j" 00 dx 

114 

f°° dx 

e x In xVIn x 

* 


115 


117 


119 


r 

E 

E 


dx 


2 In x 


dx. 


x dx 


116 


118 


E 

f 


77/2 dx 


sin x 
8 dx 

Vl6 - .T 2 


120 Let p be a positive constant. Determine the values of 
p for which the improper integral 
f 1 dx 

Jo (1 - xV> 


is convergent, and those for which it is divergent. 

121 Show that the region in the first quadrant under the 
curve v = l/(x + l) 2 has a finite area but does not have 
a centroid. 

122 If x is a positive constant, show that 

[ e~ a ' x2 dx = — f e~ x ~ dx. 

Jo a Jo 


Without performing any actual integrations, use this fact 
to show that the centroid of the region between the curve 
y = e~ a ~ x ~ and the x-axis is (0, V2/4). 

















INFINITE SERIES 
OF CONSTANTS 


We have touched briefly on this subject several times before, but now the time 
has come to confront it directly. 

An infinite series, or simply a series, is an expression of the form 

Cl\ + + #3 +***'+' + * * * , (1) 


13.1 

WHAT IS AN 
INFINITE SERIES? 


where the three dots at the end indicate that the terms continue indefinitely. In 
other words, there are infinitely many numbers a n (one for each positive integer 
n) and (1) is the indicated sum of this infinite collection of terms. The number 
a n is called the nth term of the series, and is usually some simple function of n. 
We include it in (1) if we wish to make an explicit statement of the law of for¬ 
mation of the terms. However, if this law of formation is clear from the context, 
we can write (1) more informally as 

a \ + a 2 3" a 3 3" * ’ ’ or n\ + ci 2 + ‘ * • • 


We will often use the sigma notation of Section 6.3 to write the series (1) in the 
compact form 

X 

n= 1 

The is read “the sum from n = 1 to infinity of a n r 

Needless to say, it is quite impossible to perform the operation of addition an 
infinite number of times—life isn’t long enough—so (1) cannot be interpreted 
literally and its meaning must be approached in a subtler way. It was one of the 
great achievements of nineteenth-century mathematics to discover that a perfectly 
reasonable and satisfactory meaning can be given to (1) by using the concept of 
the limit of a sequence. If we exercise suitable caution, this meaning allows us 
to work with infinite series just as easily as if they involved only a finite num¬ 
ber of terms. In many cases we will actually be able to find the number that is 
the exact sum of the series, and these sums often turn out to be very surprising 
indeed. 

We will get to all this in the following sections, but first we briefly consider 
a few of the many natural ways in which infinite series arise in mathematics. 
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A We usually assume that we understand the real number system, in particu¬ 
lar, what is meant by an infinite decimal. However, it is often overlooked that an 
infinite decimal is defined as an infinite series, 

. ai a 2 a 3 + w + w + ... 

where each of the a’s is understood to be one of the 
Everyone knows that 

T = 0.333 ..., (3) 

but not everyone is sure why, or even what this means. This is not at all sur¬ 
prising, because (3) cannot be fully understood without some acquaintance with 
infinite series, enough to use (2) to evaluate the right side of (3). We will discuss 
this and other related issues in Section 13.3. 


+ -77- + 
10 ” 


( 2 ) 


ten digits 0, 1, 2,. . . , 9. 


B The elementary long division of 1 — jc into 1, i.e., 

1 + x + ;t 2 + • • • 

1 - xW 

I -X 


X 



tells us that 


1 

--= 1 + jc + * 2 + • • • + x n ~ ] + - 1 -. ( 4 ) 

\ ~ X \ —X 

This process can be carried out to as many steps as we wish, and it is natural to 
wonder how the function on the left of (4) is related to the infinite series that 
seems to be forming on the right. That is, is it true that 

—= \+ x + x 2 + ■■■'? (5) 

1 - X 

Many readers have seen this series before, in connection with geometric pro¬ 
gressions in elementary algebra. The series on the right of (5) is usually called 
the power series expansion of the function on the left, because it contains steadily 
increasing powers of jc. 

C Other important power series expansions are now readily available to us, even 
though full verifications are not. For example, if we replace x by - x in (5) we 
obtain 


-j—I— = 1 - x + X 2 - X 3 + • • • ; (6) 

1 + x 

and if we now replace x by x 2 in (6), the result is 

. ~ 2 = 1 - x 2 + x 4 - x 6 + • • • . 

1 + ;c z 


(7) 
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The left sides of (6) and (7) remind us of the familiar formulas 


It 


dx 


+ x 


= In (1 + jc) 


and 


It 


dx 


+ x 2 


= tan 


By integrating the right sides of (6) and (7) as if they were polynomials (but re¬ 
member: they are not polynomials!), we obtain the power series expansions 


x ^ x 3 

In (1 +x)=x — + — 


( 8 ) 


and 


tan 1 x = x — 


x 3 x 5 x 7 

T + T“7 + 


(9) 


If these formulas are valid for* = 1 — and this is a very big “if”—then by putting 
x = 1 and reversing the resulting equations we get 


i-h-f 


= In 2 


( 10 ) 


and 


——+-—-+■■■=—. 
3 5 7 4 


( 11 ) 


Equation (11), connecting n with the positive odd numbers, was discovered by 
Leibniz in 1673 (Appendix 3 at the end of Chapter 10), and was one of the most 
beautiful mathematical discoveries of the seventeenth century. It made a deep im¬ 
pression on the minds of the earliest workers in the field of calculus, as it does 
on us. 

We emphasize that these derivations of (8), (9), (10) and (11) do not consti¬ 
tute acceptable mathematical proofs, because the validity of the procedures used 
in obtaining them has not been established. At this stage and with only these sup¬ 
porting arguments, they have only the status of conjectures. It must be remem¬ 
bered that there is a very wide chasm in mathematics between conjecture and ac¬ 
tual knowledge, and we wish to know. 


D Finally, infinite series arise in a very insistent way in the study of differen¬ 
tial equations. To see how this happens, let us consider the simple equation 


dy. 

dx 


y- 


( 12 ) 


This equation asks for a function which is unchanged by differentiation, and we 
know that y = ce x is such a function for every constant c; in fact, we know that 
these are the only functions with this property (Section 8.3). But to emphasize 
the point we wish to make, let us pretend that we don’t know any solutions and 
try to guess one. Since polynomials are the simplest functions of all, we might 
try one of these first. But we have no idea what degree to choose for this hoped- 
for polynomial solution. This suggests the use of a bit of creative vagueness, so 
we leave the degree unspecified and try to find coefficients ao, a\, 02 , < 33 , < 34 , . . . so 
that 


y = ao + a\x + + nyx 3 + + • • * 


( 13 ) 
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will be a solution of (12). By differentiating (13) term by term we obtain 

= a\ + 7ci2X + 3a?x 2 + Aa 4 x 3 + • • • ; ( 14 ) 

and substituting (13) and (14) in (12) gives 

ct\ + 7ci2X + 3a?x 2 + Aa 4 x 3 + ■ ■ ■ = a 0 + a x x + o^r 2 + a?x 3 + • • • . (15) 

If we now equate coefficients of equal powers of x in (15), we get 
a\ = flo, 7ci2 = a\, 3a?, = a 2 , Aa 4 — a?, . . ., 

so 


_ _ a \ _ #0 
a\ - a () , a2 — — - 


_ ci2 _ ao _ a 3 _ a o 

^3 ^ 2.3 ’ ^4 ^ 2-3.4’ 


( 16 ) 


At this point we remind students of the factorial notation introduced in Sec¬ 
tion 3.6. If n is a positive integer, we write 

n\ = 1 • 2 ■ 3 • • • n (17) 

and call this n factorial. Thus, 1! = 1, 2! = 1 ■ 2 = 2, 3! = 1 • 2 • 3 = 6, 4! = 
1 • 2 • 3 • 4 = 24, 5! = 1 ■ 2 ■ 3 ■ 4 ■ 5 = 120, and so on. The definition (17) is 
meaningless in the case n = 0, but for many reasons it is customary to define 0! 
by 0! = 1. We shall be using factorials often in the next few chapters, so there 
will be ample opportunity for students to become thoroughly familiar with this 
notation. 

Returning to our problem, we can use the factorial notation to write equations 
(16) as 


«i = a 0f 


a 2 


ao 

2!’ 


«3 




a 4 = 


ao 


3!’ * 4!’ 

Our tentative solution (13) of the differential equation (12) therefore becomes 

v _ „ + „ r + ^0 v2 + ^0 3 + ^0 4 . . . . 

y — flo + ao.\ + + 3!* + A\ X + 

/ JC 2 X 4 

= «o( 1 + x + Y + ¥ + ¥+ -- 

where a () is an arbitrary constant. On comparing this with the known solutions 
ce \ we are led to the natural conjecture that the exponential function e x equals 
the infinite series shown in parentheses: 


JC 2 X 4 


( 18 ) 


It turns out that this formula—the power series expansion of e x —is indeed true. 
In fact, for all values of x we can calculate e x as accurately as we please from 
the series by taking enough terms, and this is how numerical tables for e x are 
constructed. However, students should clearly understand that this discussion is 
merely suggestive, and is by no means a valid proof. Proofs will come later. 

In attempting to solve other differential equations in this way, we are led to 
other series, some representing known familiar functions but many representing 
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previously unknown functions. As an example of the former, we know from Sec¬ 
tion 9.6 that the important differential equation 


° 


(19) 


has the general solution 

y = c i sin x + c 2 cos x. 


But if we try to solve (19) by means of power series, in the manner suggested 
by the above discussion, then we obtain the power series expansions of the sine 
and cosine, 


sin x = 



( 20 ) 


and 


cos x = 



( 21 ) 


The validity of these two expansions will be proved in Section 13.4, and this ap¬ 
proach to solving differential equations will be explored further in Chapter 14. 


In working with infinite series, as in almost any part of calculus, there are 
many things that we want to be able to do freely with the tools we are studying. 
The role of the theory is mostly to justify the various procedures that are neces¬ 
sary for carrying out our purposes—such procedures, for example, as the term- 
by-term differentiation and integration used above. This situation was well ex¬ 
pressed by the famous financier J. P. Morgan in describing the role of lawyers in 
his business operations. “I don’t hire lawyers to tell me what I can’t do,” he said. 
“I hire them to find legal ways for me to do what I want to do.” 

Our overall aim in this chapter and the next is to establish power series as fa¬ 
miliar and reliable tools. The main reason for this is that we want to be able to 
accept the power series that arise from differential equations as legitimate and 
fully satisfactory solutions of these equations. 

Of course, every power series is an infinite series of functions. But when x is 
given a specific numerical value, as in obtaining (10) and (11) above, then the 
power series becomes an infinite series of constants. We therefore undertake a 
careful study of series of constants in this chapter to provide a solid foundation 
for our work with power series in the next chapter. 

Nevertheless, our interest in series is not confined to their practical value for 
applications, and in the course of our work we will touch on many fascinating 
topics in pure mathematics that are well worth studying for their own sake. Thus, 
we will see that series are linked to some of the most interesting parts of the the¬ 
ory of numbers, concerning prime numbers, irrational numbers, the nature of the 
constants e and i r, and similar matters. We wish to keep the structure of this chap¬ 
ter as simple as possible and still put the full richness of the subject within easy 
reach of the interested reader. For this reason we place most of this optional ma¬ 
terial in the appendices at the end of the chapter, where it can be examined or 
not according to the wishes of the individual student. 
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13.2 

CONVERGENT 

SEQUENCES 


Any reasonably satisfactory study of series must be based on a careful definition 
of convergence for sequences. However, the behavior of most sequences is easy 
to understand without elaborate explanations, and a genuine theory of conver¬ 
gent sequences would be an unwelcome obstacle blocking our way to the main 
concepts of this chapter. We will therefore discuss sequences rather briefly, and 
try to steer a middle course between excessive informality and tedious detail. 

If to each positive integer n there corresponds a definite number x n , then the 
jc„’s are said to form a sequence. We think of the x„s as arranged in the order of 
their subscripts. 


XU X2, ■ ■ • , X n , . . . , 

and we often abbreviate this array to {*„}. It is clear that a sequence is nothing 
but a function defined for all positive integers n, with the emphasis placed on the 
subscript notation x n instead of the function notation x(n). The numbers consti¬ 
tuting a sequence are called its terms. Thus, x\ and X 2 are the first and second 
terms of the given sequence, and x n is the nth term.* 

Example 1 In each of the following we define a sequence {*„} by giving a for¬ 
mula for its nth term: 

(a) x n = 1, that is, 1, 1, 1, . . . ; 

(b) x n = [\- (-1 )”]/2, that is, 1, 0, 1, 0, ...; 

(c) x n = 1 In, that is 1, y, y, . . . ; 

(d) x n = (n — 1 )/n , that is 0, j, f, f, . . . ; 

(e) x„ = ( —1 ) n+] /n, that is, 1, — y, y, — y, . . . ; 

(f) x„ = 1 + -1 + -1 + • ■ ■ + 

(g ) A ) ,= 1 + i + i+- -+^ ; 

(h) Xn = (i + 2j" 

A sequence like (a), in which all the terms are equal, is called a constant se¬ 
quence. Not every sequence has a simple formula, or even any formula at all. 
This is shown by the sequence { d n }, where d n is the nth digit after the decimal 
point in the decimal expansion of it. 


It is sometimes convenient to relax the definition and allow a sequence to start 
with the zeroth term xo, or even with the second or third term X 2 or * 3 , instead 
of requiring it to begin with the first term x { . One reason for this is that we want 
to include sequences like that defined by x„ = 1 /ln n , where x\ is meaningless. 
In any case, we continue to call the term with subscript n the nth term. 

A sequence {*„} is said to be bounded if there are two numbers A and B such 
that A < x n < B for every n , and in this case A is called a lower bound and B an 


*The words “sequence" and “series” have essentially the same meaning in ordinary speech, but in 
mathematics their meanings are quite distinct. A sequence is merely an infinite list of numbers 
arranged in order, with a first, a second, and so on, whereas a series is an infinite sum of numbers. 
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upper bound for the sequence. A sequence that is not bounded is said to be un¬ 
bounded. In Example 1, it is easy to see that sequences (a) to (f) are bounded, 
but it is less obvious that (g) is not (hint: t + T>4+T = T> l + i + T + {> 
i + i + i + an d so on )- The sequence (h) is also bounded, but this is not 

evident on inspection and will be established below. 

Our main interest is in the concept of the limit of a sequence. Roughly speak¬ 
ing, this refers to the fact that certain sequences {x, ? } have the property that the 
numbers x n get closer and closer to some real number L as n increases. Another 
way of stating this is to say that \x n — L\ gets smaller as n gets larger. As an il¬ 
lustration, consider the sequence [x n ] whose nth term is x n = (n — 1 )/n: 


These numbers seem to “approach” the number 1 as we move farther and farther 
out in the sequence. As a matter of fact, for each n we have 


x n - \ = 


n - 1 


and the number 1 In, and therefore \x n — 11, can be made as small as we please 
by taking n sufficiently large. We express this behavior by saying that the se¬ 
quence has the limit 1, and we write 

1 , 


lim 

n 

It is helpful to visualize this behavior in the manner suggested by Fig. 13.1. 

The general definition is as follows. A sequence [x n ] is said to have a num¬ 
ber L as limit if for each positive number e there exists a positive integer no with 
the property that 

|jc„ — L\< e for all n > n 0 . (1) 


When L is related to {x n } in this way, we write 
lim x n = L, or more briefly, 


lim x n = L, 


and we say that x n converges to L. This is also expressed by saying that x n ap¬ 
proaches L as n becomes infinite, which we can write as 


—-— 

*1 

Figure 13.1 


x n —>L 


This notation is often abbreviated even further, to x n —> L. 

This definition requires that each e, no matter how small, must have at least 
one corresponding no that “works” for it in the sense expressed by (1). In gen¬ 
eral, we expect that for smaller e’s, larger «o’s will be needed; that is, when the 
required measure of closeness is made smaller, we must go farther out in the se¬ 
quence to satisfy it. 

A sequence is said to converge or to be convergent if it has a limit. A con¬ 
vergent sequence cannot have two different limits, because it is not possible for 
x n to be as close as we please to both of two different numbers for all sufficiently 
large n’s. 

A convergent sequence is bounded, but not all bounded sequences are con¬ 
vergent. The sequence 1, 0, 1, 0,... of Example 1(b) is a bounded sequence that 
is not convergent. 


l 


x 2 x 3 x 4 *** 
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It is not always easy to decide whether a given sequence converges, and if it 
does, what its limit is. The following facts are often useful in problems of this 
kind: If x n —» L and y n —> M, then 

lim (x fl + >■„) = L + A/, lim (jc„ — y n ) = L — M, lim x t} y n = LM, 
and, with the additional assumption that AL ^ 0, 


These facts can be rigorously proved by carefully using the definition and the 
properties of inequalities. We omit the details. By using these rules, we can eas¬ 
ily perform such feats as calculating 

2n 3 + n — 5 _ 2 + \/n 2 5 !r? = 2 + 0- 0 ^2 

lm In 3 - In 2 + 4 ” lm 7 - 2/n + 4/w 3 _ 7-0 + 0 _ 7’ 

where the essential first step is to divide both numerator and denominator by the 
highest power of n occurring in the denominator. 

The usual intuitive idea of convergence—that ;t, 7 —means that x n can 
be made “as close as we please” to L by taking n “sufficiently large”—is 
natural and necessary, and is the way most mathematicians really think about 
this concept. Accordingly, in most of our work with sequences we shall rely 
on common sense to tell us how much detail is needed to make an argument 
convincing. 

Example 2 If \x\ < 1, then lim x n = 0. Most people are willing to accept this 
on the grounds that “a number numerically less than 1 which is raised to higher 
and higher powers gets smaller and smaller.” But if a more detailed argument is 
desired, it can be given as follows. The assertion is clear if x = 0, so assume that 
0 < \x\ < 1. Then \x\ = 1/(1 + a) for some a > 0, so by the binomial theorem 
we have 


1 



= (1 + a) n = 1 + na + positive terms > na. 


We see from this that \x n \ < 1 !na\ and since Mna —» 0, we clearly have x n —» 0. 


Example 3 For every x, lim x n /n\ = 0. This is not at all obvious, because even 
though n\ increases rapidly as n grows, for large values of jc it is quite conceiv¬ 
able that x n might grow even more rapidly. To demonstrate that x n /n \ —> 0 as 
n oo, we may assume that jc > 0 (why is this permissible?). To start the argu¬ 
ment we choose a fixed positive integer m so large that x/m < and then we put 
a = x m /m\. For any integer n> m we write n = m + k and observe that 

* *_ (l\ k 

n\ a m - 1-1 m + 2 m + k \ 2 / 

As n —» °o, k also —» °o, so a(\) k —» 0, and we conclude that x n !n\^> 0. 


Example 4 The fact that lim (Vn + 1 - Vn) = 0 will probably seem reason¬ 
able after a little thought (it only says that Vn is nearly equal to Vh + 1 for 
large n), but a definitive argument may not be so easy to find. Such an argument 
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can be constructed by writing the quantity V/T-TT — Vw as a fraction with de¬ 
nominator 1 and rationalizing the numerator, as follows: 

Vn + 1 — Vn _ Vn + 1 ~ Vn V/j -f 1 + Vn 
1 1 Vw + 1 + Vn 


Vn + 1 + Vh 


In working with sequences in connection with infinite series, we will often 
need to be able to recognize that a sequence is convergent even though we know 
nothing about the numerical value of the limit. In such a case we cannot make 
any direct use of the definition of a limit. We now discuss a very important method 
for handling such situations. 

A sequence {*„} is said to be increasing if 

x\ ^ *2 ^ *3 - ‘ ‘ ‘ - x n - *n+\ ^ ‘ • ' , 

that is, if each term is greater than or equal to the one that precedes it.* Among 
the sequences listed in Example 1, (a), (d), (f), and (g) are clearly increasing; 
(h) is also increasing, but this is not obvious on inspection. 

Increasing sequences are pleasant to work with because their convergence be¬ 
havior is particularly easy to determine. We have the following simple criterion: 
An increasing sequence converges if and only if it is bounded. This criterion is 
not only simple, but also extremely important, because the theory of convergent 
series given in the rest of this chapter stems directly from it. 

This criterion is quite easy to establish. Imagine the terms of the sequence plot¬ 
ted on the real line, as shown in Fig. 13.2, with each term to the right of (or on) its 
predecessor. If the sequence is unbounded, then its terms simply march off the 
page, and the sequence clearly cannot converge. This proves half of the criterion, 
the “only if” part. To establish the other half, we assume that the sequence is 
bounded with an upper bound B, as shown in the figure, and we must produce a 
limit for the sequence. Very briefly, we see geometrically that then’s, which move 
steadily to the right and yet cannot penetrate the barrier at B, must “pile up” at some 
point L < B, so L is the limit of the sequence and the sequence converges to IV 
This convergence criterion has many important applications, one of which is 
given in the following example. For this we will need the formula for the sum 
of a geometric progression, 

1 + x+x 2 + • • • + x n ~ 1 = 1 x+\. (2) 

1 - Jt 

This formula is merely a rearrangement of equation (4) in Section 13.1. It can 
also be established in another way, by dividing x — 1 into x n — 1. 


Figure 13.2 A bounded increasing 
sequence. 


*Some writers require the terms of an increasing sequence to satisfy the strict inequality x n < x n+ \ 
for all n. However, our definition allows an increasing sequence to be stationary, in the sense that 
adjacent terms may be equal. 

+ A much more detailed argument can be given here, but we have no wish to strain the patience of 
reasonable people. See Appendix A. 1 for some of the details. 


L B 

i 
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Example 5 Our purpose here is to prove that 


hmU+T + T+.-.+i-^, 


( 3 ) 


We accomplish this by discussing together the two closely related sequences { x n } 
and {y„} defined by 


*“( 1 + i)" and y " = 1 + iT + T + -" + ir- 

We will demonstrate that both of these sequences are increasing and bounded, 
and therefore convergent, and furthermore that they converge to the same limit. 
Our first step is to show that { x n } is increasing and bounded. By the binomial 
theorem, x n can be expressed as the following sum of n + 1 terms: 


, 1V 1 . n(n ~ 1) \ n(n - \ )(n - 2) 1 

*n = 1 + — = 1 + n- — +-XT-’ T +-- 3 + 

n n 2! n z 3! rv 


n{n - !)■■■[/!-(it - 1)] 1 


= 1 + 1 + 


* KKKM )- 


+ ■ 


1 - 


n ~ 1 


(4) 



As we pass from x n to x n +\ by replacing n by n + 1, it is easy to see from this 
sum that each term after 1 + 1 increases, and also that another term is added, so 
x n < x n +\. Further, a term-by-term comparison of (4) with y n shows that x n < y n . 
By applying formula (2), we see that the y n 's have 3 as an upper bound, 


y n = 1 + 1 + 2 


1 


2*3 


2 • 3 ■ 


< 1 + 1 + — + 
2 2 2 


,-.-= 1+2 (l <3 ’ 


and therefore the x n ’s also have 3 as an upper bound. Since {*„} is an increas¬ 
ing sequence with 3 as an upper bound, we know that it converges. Its limit is 
of course the number e y which was introduced in a somewhat different way in 
Section 8.3, 

lim (1 + — ) = e. (5) 

n->oo y n ) 

Since x n <y n <y n +\ < 3, we see that {y„} is also abounded increasing sequence 
which approaches a limit y > e. All that remains is to show that y < e y because 
this will yield our main conclusion that y — e. The argument is as follows. If m < 
n and we consider only the first m + 1 terms of (4), then we have 



If m is held fixed and n is allowed to increase, then we obtain 
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}'m - 1 + 1 + + ■ • • + “7 — e, 

2! 3! m! 

so y < e. We conclude from this that y = e, or 

1 


lim [ i + 1 + — + ■ • ■ + 


which is (3). We observe that 


lim f 1 —-Y = —, 

n ^oo \ n e 


because this limit can be written as 

1 V j+1 


= lim 
n—>°° \ n + 


lim , * 

n — >°° \ n + 1 

The additional fact that 


is an immediate consequence of (6). 


ji y +I _ . n/(n +1) _ 1 

~n^L (l + \/ n ) n _ 7' 


lim 1 - 


( 6 ) 


(7) 


Example 6 Most students will recall that a prime number, or simply a prime, 
can be defined as an integer p > 1 that has no positive factors (or divisors) ex¬ 
cept 1 and p. These numbers form one of the most interesting of all sequences, 

2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, ... . (8) 

Indeed, the fact that there are infinitely many of them, so that they actually do 
constitute a sequence, is itself a famous theorem of number theory proved by Eu¬ 
clid. The sequence (8) is clearly not convergent, and it may appear that the con¬ 
cept of a convergent sequence has little or no relevance to the primes. However, 
this impression is quite wrong, for students who wish to pursue the subject will 
find that the convergence behavior of certain sequences is very close to the heart 
of the modern theory of prime numbers. We support this remark by stating with¬ 
out proof the following very profound theorem about the approximate size of the 
/ith prime: If p n denotes the /ith prime number, then p n is “asymptotically equal” 
to n In n, in the sense that 

lim —t 5 — = 1. (9) 

n In n 

Readers who are interested in these matters are urged to take a course in num¬ 
ber theory. More information about (9) can be found in Section B.16 of the au¬ 
thor’s book, Calculus Gems (McGraw-Hill, 1992). 


PROBLEMS 


l 


State whether each of the indicated sequences converges 
or diverges, and if it converges, find its limit: 


(a) 


(b) 

1 +(-!)" 

n 

(e) 

3" 

2" + 10'° ’ 

(f) 

(c) 

IT 

sin — ; 

5 n 

(d) 

io lo '°\4 

n + 1 

(g) 

In (n + 1) — In n; 

(h) 


Vn + 2 . 
2\4 
n 2 

V4« 4 + 5 ’ 
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(*)-TT 

n n + 1 


(k) cos 
(m) 

(o) n sin - 


(2 n + 1 ) 2 7 t 


(j) 

( 1 ) 

(n) 

(P) 


cos mr\ 
5 n 3 - 


2 n 


n 4 + 3 n 2 — 10 ’ 
sin (n\e n ) 
n + 1 

(l - v«)(3 + yjj) 

4 n + 5 


Show that n\/ri' —>0. Hint: Write it out, and look. 

The limits of many sequences can be found by replac¬ 
ing the discrete variable n by a continuous variable jc 
and applying L’Hospital’s rule for the case * —> °o. Use 
this method to show that 

(a) —^ 0 ; (b) Vn^> 1 ; 

n 

(c) if \a\ < 1 ; then m n —» 0 ; 

(d) if k is any positive integer, then n k /e n -» 0 ; 

(e) if a is any real number, then (1 + a/n) n —> e a . 

State whether each of the indicated sequences converges 
or diverges, and if it converges, find its limit: 


(a) 3 3/ ”; 

(c) n/T\ 

(e) n 2 iy i \ 

(g) (n + 10 ) l/(,I+1()) ; 
(i) n 2 cos mr. 

Find lim x„ if 


(b) e 


- 10/h. 


(d) 


In (n + 1 ) 


n 

(f) /i 1/( » +1 >; 
(h) n 2 sin mr\ 


(a) x„ 

(b) x n = n 


Va?(V^? + ci - Vw); 

r 


a + ■ 


- ^4 


If 0 < a < b, show that lim V a" + b n = b. 

If/(;c) = lim„^oo ( 2 / 7 r) tan -1 nx, show that f(x ) = x/\x\ 
if jc =£ 0 and/(0) = 0. Sketch the graph of this function. 
If the terms of a sequence {*„} are positive numbers, 
show that: 

(a) the sequence is increasing if x„+\!x n > 1 for all n; 

(b) the sequence is decreasing if x n +\!x n < 1 for all «. t 
Use Problem 8 to show that lim x n exists if 


(a) x„ = 


(b) x„ = — 

tr 


(c) = — 

n 


3 - 5 • • • (2 n - 1) 

2 • 4 • 6 • • • (2 n) ; 

2 ■ 4 • 6 • • ■ (2 n) 


1 ■ 3 ■ 5 • • 
2 • 4 • 6 ■ 


(2/i - 1) 
• • (2/Q 


■ 3 • 5 ■ • • (2n — 1) 


’’Naturally, a sequence (jc„) is said to be decreasing if 
X\ > X 2 ^ ^3 — ’ - * — *n — X n + 1 ^ ’ ’ , 

i.e., if each term is less than or equal to the one that precedes it. 


10 Find the value of 


(a) lim 

(b) lim 


(n + \) n 
n" +l ' 

(n + 1 )ln n — n In (n + 1) 
In n 


11 Show that 

n + 1 (n + l ) 2 (n + I ) n 

n 2 n? ri ' +1 

12 Show that 


(a) 1 + 


1 \ 2 "+3 


2 n + 3 


-> e\ 


(b) (1 + -V)" -» e; (c) (l + —> e 2 \ 


n* 




2 n 


*13 Consider a suitable number of circles of equal size 
packed in n rows inside an equilateral triangle, as shown 
in Fig. 13.3. If c n denotes the number of these circles, 
then it is clear from the geometry of the situation that 
c\ = 1, c 2 = 1 + 2, C 3 = 1 +2 + 3, and so on. If A is 
the area of the triangle and A n is the combined area of 
the c n circles, show that 


lim + = 

n->°o A 


TT 

2 V 3 ' 



14 The sequence 


1.1 +y. 1 + • • • 

can be defined recursively by x x = 1, x, l+ \ = 1 + 

-—-— for n > 1. Assuming that lim x n = L exists, 
1 +x„ 

show that L = V2. This gives the continued fraction 
expansion 

V2 = 1 + - 

2 + 


2 + • ■ • 
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Most people are familiar with the fact that 

l+y + T + i+ *'' = 2. (1) 

However, since we cannot add infinitely many numbers in the same way that we 
can add finitely many, the meaning of (1) is evidently quite different from the 
meaning of a statement like 

1+2 + 3+4=10. 


13.3 

CONVERGENT AND 
DIVERGENT SERIES 


What (1) really means is that the sequence of partial sums on the left, that is, the 
sequence of numbers 


1 +i = ly, 

1 + y + y = if, 
i+I + i+ i = il 

converges to the number 2 on the right. This suggests the approach we adopt for 
the general case. 

If ai, <22, ... is a sequence of numbers, then the expression 

X a ” = a ' + ^2 + * • * + a n + * * * (2) 

71 = 1 

is called an infinite series, or simply a series, and the af s are called its terms. 
We emphasize that until a meaning is assigned to it by a suitable definition, the 
expression (2) is merely a formal collection of symbols arranged in a certain way, 
because the indicated operation of adding infinitely many numbers has no mean¬ 
ing in itself. To attach a numerical value to (2) in a natural and useful way, as 
suggested in the preceding paragraph, we form the sequence of partial sums 

s\ = a u 

s 2~ a \ + a 2> 


s n ~ a\ + 02 + * * * + a n . 


The series (2) is said to converge, or to be convergent, if the sequence { s n } con¬ 
verges; and if lim s n = 5, then we say that the series converges to s or that s is 
the sum of the series, and we express this by writing 

a i + a2 + * * * + <?n + * * * = s or X a n ~ s. 

n= 1 

If the series does not converge, then we say that it diverges or is divergent, and 
no sum is assigned to it. 

At this point a few remarks about notation and usage are in order. As we in¬ 
dicated, the statement that the series 2«=i a n converges to the sum s is usually 
written ^Zn=\ a n = s - Thus, the notation i<3„ is used with a dual meaning: to 
specify a series regardless of convergence or divergence, and also (if the series 
converges) to denote its sum. Students will find that this ambiguity causes no 
difficulty in practice. 
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Another matter concerns the indexing (or numbering) of the terms. It is often 
more natural to number the terms of a series beginning with n = 0; that is, we 
write some series in the form 

X °n = a o + a { + •••+«„+•• • 

n = 0 

(and in this case we also write s n = ao + a\ + • * * + a n ). For example, the se¬ 
ries on the left of (1) can be written as 

V 1 

or 2- 2«» 
n= 0 L 

but the latter form is somewhat neater. It is a triviality that any general statement 
about series written as 2„=i a n has an exact analog for series written as 
For this reason, when no ambiguity is likely or when the distinction is immate¬ 
rial, we often omit the limits of summation and for the sake of simplicity write 
2 a n instead of = i a n or These remarks also apply to series of the form 

YZ=k a n for any integer k > 2. 

We now briefly consider several fundamental examples. 


X 


Example 1 Probably the simplest and most important of all infinite series is the 
familiar geometric series 

^ x n = l + X + x 2 +■■■ . (3) 

n = 0 

By a slight alteration of equation (2) in Section 13.2 (replace n by n + 1), the 
nth partial sum of this series is given by the closed formula 

1 — x n+l 

s n = 1 + X + * 2 + • • • + x n = — - (4) 

1 — X 


if x =£ 1. If \x\ < 1, we see from this that s n —> l/( 1 — x), so for these jc’s we have 
1 + X + * 2 + •••+*" + ••• = ——. (5) 

1 — Jt 

The series (3) therefore converges to the sum 1/(1 — x) for \x\ < 1, and is read¬ 
ily seen to diverge for all other values of x. This answers the question raised in 
part B of Section 13.1. 

It is now easy to understand the full meaning of formula (1): the series on the 
left is a geometric series with x = so by (5) we have 


Similarly, 


and 




= 2 . 



5_ 

3 


1 - (—f) 


3_ 

5' 


Further, if we write the repeating decimal 0.333 . . .as an infinite series and ap¬ 
ply (5) at the right stage, then we get the result mentioned in Section 13.1, 
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io 2 + io 3 + 






Example 2 Another series whose behavior is particularly simple is 




To establish convergence and verify that the sum is 1, we use an ingenious trick 
due to Leibniz and observe that 


n(n + 1) n n + 1 ‘ 

This enables us to write the nth partial sum as 



n + 1 ’ 


which makes it obvious that s n —> 1. Any series whose nth partial sum collapses 
in this way into a closed formula is called a telescopic series. 


As these examples suggest, the most direct method for studying the conver¬ 
gence behavior of a series is to find a closed formula for its nth partial sum. The 
main disadvantage of this approach is that it rarely works (first the good news, 
then the bad news!), because it is usually impossible to find such a formula. It 
is this situation that forces us to rely mostly on various indirect methods for es¬ 
tablishing the convergence or divergence of series. 

The main indirect method rests on the convergence criterion for sequences dis¬ 
cussed in Section 13.2, that is, on the fact that an increasing sequence converges 
if and only if it is bounded. Thus, if the terms of our series are all nonnegative 
numbers, then we clearly have s n < s n + a n +\ = s n +\ for every n, and therefore 
the s„ s form an increasing sequence. It follows in this case that the sequence 
{s„} of partial sums —and with it the series —converges if and only if the s n 's 
have an upper bound. Our next example furnishes a good illustration of the use 
of this simple but important idea. 

Example 3 The harmonic series 



(6) 


diverges because its partial sums are unbounded, as we saw in Example 1(g) at 
the beginning of Section 13.2. To establish this in a bit more detail, let m be a 
positive integer and choose n > 2 m+ 1 . Then 















442 


INFINITE SERIES OF CONSTANTS 


t> 1 +y + j 



1 


2^2 + I 





+ 


f_L- 
\2 m + 1 



>l + 2 .i + 4 .l + 

2 4 8 


• + 2 "' • 


2 m+ ' 


■ = (m + 1) —. 


This proves that s n can be made larger than the sum of any number of {’s and 
therefore as large as we please, by taking n large enough, so the s n 's are un¬ 
bounded and (6) diverges. A series that behaves in this way is often said to 
diverge to infinity, and we express this behavior by writing 


v 1=1+1 + !+... = , 

■-1 n 23 


A great many interesting series—some convergent and others divergent— 
can be obtained from the harmonic series by thinning it out, that is, by delet¬ 
ing terms according to a systematic pattern. For instance, if we remove all 
terms except reciprocals of powers of 2, what remains is the convergent 
geometric series 




and if we remove all terms except reciprocals of primes, then—as we shall see 
in a later section—the resulting series diverges. 


I 


Pn 


J_ 

2 



+ ■ ■ ■ 


— oo. 


The simplest general principle that is useful in deciding whether a series con¬ 
verges or not is the nth term test : 


If the series 

X a n = a 1 + 02 + ■ ■ • + + ■ ■ * 

« = 1 

converges, then a„ —> 0 as w —> ©o; or equivalently, if a n does not approach zero as 
n —> °°, then the series must necessarily diverge. 

To prove this, we merely observe that a n — s n — s n -\^>s — s — 0. This result 
shows that a n —»0 is a necessary condition for convergence, in the sense that it 
follows from the convergence of the series Unfortunately, however, it is not 
a sufficient condition; that is, it does not imply the convergence of the series. 
This is easy to see by considering the harmonic series 21/rc, which diverges even 
though 1 /ai —> 0. The nth term test is essentially a divergence test. As examples 
of its use, we mention the series 

X (-i) n+i = i — i + i —1+ -- 

n = 1 
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and 

^- = 1 + I + 3 
+ I 2 3 4 

The first diverges because the sequence (— 1 ) n +1 does not converge at all, and so 
cannot converge to zero, and the second diverges because n/(n + 1)^1 =£ 0. 

Remark Repeating decimals. The procedure for converting any rational num¬ 
ber a/b (in lowest terms) into its decimal expansion is well known: Divide b into 
a. Let us carry out this procedure in the case of the rational number -y, which is 
often used as the simplest rational approximation to i r, correct to two decimal 
places: 

3.142857142857 .. . 

7122.0000000. 

23 
1 0 
_7 
30 
28 
20 
J_4 
60 
56 
40 
35 
50 
49 

10 • ■ • . 

The successive remainders here are 1, 3, 2, 6, 4, 5, 1; as soon as 1 appears a sec¬ 
ond time, the cycle begins all over again and generates the repeating block of 
digits 142857. This example illustrates—and almost proves—the fact that the 
decimal expansion of any rational number is repeating .* The proof of this gen¬ 
eral statement consists of little more than noticing the phenomena displayed in 
the example. When b is divided into a , the remainder at each stage is one of the 
numbers 0, 1, 2, . . . , b - 1. Since there are only a finite number of possible val¬ 
ues for these remainders, some remainder necessarily appears a second time, 
and the division process repeats from that point on to give a repeating decimal. 
We also note that if the remainder 0 appears, then the decimal terminates, but 
a terminating decimal can always be thought of as repeating, as in 0.25 = 
0.25000 .... 

The converse of this statement is also true: Any repeating decimal is the ex¬ 
pansion of a rational number: To see why this is so, let us examine a typical re¬ 
peating decimal, say 3.7222 .... If we split off the nonrepeating part, write the 
repeating part using powers of 10, and use formula (5) at the proper stage, then 
we obtain 


’"Certain rational numbers have two distinct decimal expansions, e.g., { = 0.25000 . . . = 0.24999.. .. 
This situation is analyzed in Problems 14 and 15 following. 
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37 37 2 2 2 

3.7222 .. . « + 0.0222 . . . = 44 + -^ + ^ + ^- + 

10 10 10 2 10 3 10 4 


37 

10 

37 


2 

100 


1 + W + (lo) 2 + ■ 


— .5Z + l0 * 

= To tt: 


— + — 
10 90 


335 

90 


67 

18’ 


37 2 H) 

10 + 100 ’ 9 


which is a rational number expressed as a fraction in lowest terms. It is evident 
that a similar procedure works equally well for any repeating decimal, so the 
statement at the beginning of this paragraph is clearly true. 

We can summarize our results by saying that the rational numbers are exactly 
those real numbers whose decimal expansions are repeating. Equivalently, the 
irrational numbers are exactly those real numbers whose decimal expansions are 
nonrepeating. 


PROBLEMS 


1 


2 


3 


There is nothing to prevent us from forming the geo¬ 
metric series 1 + x + x 2 + • • • for any real number x. 
Show that this series diverges whenever \x\> 1. 
Determine whether each of the following geometric series 
is convergent or divergent, and if convergent find its sum: 

(a) 1 + j + ■ ■ • ; (b) 9 + 3 + 1 + • • • ; 

(c) 2 + }+•■■; (d)i+ 

(e) i ; (f) X (I)": 


<8>X h 


4 \" 


(h) 


!„(¥)• 


(j) X 


v ( 8 Y 

»=0 V 5 V 3 / ’ VJ/ (3 - V5)" 1 

,k, „?„ 7 (+ 

Show that formula (4) is essentially equivalent to the fol¬ 
lowing factorization formula of elementary algebra: 


x n - \ = (x- \)(x n ~' + jc "“ 2 + •••+*+ 1 ). 


Give an independent verification of this factorization for¬ 
mula. 

4 A certain rubber ball is dropped from a height of H ft. 
Each time it bounces it rises to a height rh, where h is 
the height of the previous bounce. Find the total distance 
the ball travels. 

5 Which of the following series are convergent and which 
are divergent? 

(a) sin 7r + sin 277 + • • • + sin nir + • • • . 

.... 277 , . . nrr 

(b) sin — + sin —— + • • • + sin —— + • • • . 

2 2 2 


(c) COS 77 + COS 277 + • • • + COS H77 + ' * ’ . 

/JN 77 , 377 , , (2 n - 1)77 , 

(d) cos — + cos ——!-■•■+ cos---f • ■ • . 

2 2 2 

(e) In V3 + In ^3 + In ^3 + ■ • • + In V3 + • • • . 

(f) In V3 + In ^3 + In ^3 + • • ■ + In + • • • . 

1 2 3 

(g) 10 + 3 + 10 + 6 + 10 + 9 + ‘ * * 

+---+ • • • . 

10 + 3 n 

(h) I + i + £+■■•. 

6 Convert each of the following repeating decimals into a 
fraction (in lowest terms): 

(a) 0.777 . . . ; (b) 0.151515 . . . ; 

(c) 0.639639639 . . . ; (d) 2.3070707 .... 

7 If a and b are digits, show that 

(a) O.aaa . . . = (b) O.ababab . . . = 

9 99 

8 The decimal 0.101001000100001 .. . , in which the l’s 
are followed by successively longer chains of 0’s, looks 
as if it is nonrepeating, and therefore defines an irrational 
number. Construct an argument that converts this im¬ 
pression into a certainty. Hint: Assume that the decimal 
is repeating. 

9 (The fly problem) Two bicyclists start 20 mi apart and 
head toward each other, each pedaling at a steady 10 
mi/h. At the same time a fly traveling 40 mi/h starts from 
the front wheel of one bicycle and flies to the front wheel 
of the other, then turns around and flies back to the front 
wheel of the first, and continues back and forth in this 
manner until the bicycles collide and he is crushed be¬ 
tween the wheels. How far has the fly flown? The hard 
way to solve this problem is to express the total distance 
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as an infinite series and find its sum. There is also an 
easy way. Do it both ways.* 

10 Find the sum of each of the following series: 


(a) ^ + ■ 


27 


+ 16 
81 


(b) 18-6 + 2- } + 


15 A terminating decimal such as f = 0.375 = 
0.375000.. . can also be written as a repeating decimal 
ending in an infinite chain of 9’s if the last nonzero digit 
is decreased by one unit, as in 0.375000. . . = 
0.374999 .... Prove this by using formula (5). 

16 Show that the number 

0.1234567891011121314151617 


(c) 


sin 6 sin 2 6 sin 3 6 
2 4 T” 


(d) 2 + x 2 + (2 + x 2 ) 2 + (2 + * 2 ) 3 + ' " ' 

11 Describe all convergent series of integers. 

12 In the series } — f + f — £ + ■ ■ * , the numerators are 
the successive positive integers, the denominators are the 
successive odd numbers starting with 3, and the signs 
alternate. 

(a) Write the series using the sigma notation. 

(b) Show that the series diverges. 

13 Express each of the following numbers as a repeating 
decimal: 

(a) (b) f; (c) f; (d) (e) 

14 Show that a positive rational number a/b (in lowest terms) 
has a terminating decimal expansion if and only if the 
positive integer b has the form b = 2 m 5 n , where the ex¬ 
ponents m and n are nonnegative integers. Check this 
statement against the results of Problem 13. 


"There is a famous anecdote about the first time one of the most bril¬ 
liant scientists of the twentieth century heard this problem, and how 
he reacted to it. See P. R. Halmos, “The Legend of John von Neu¬ 
mann,” Amer. Math. Monthly, 80 (1973), pp. 386-387. 


in which all the positive integers are written down in or¬ 
der after the decimal point, is irrational. 

17 Unrestrained deficit spending by the federal government 
inflates the nation’s money supply and leads in directions 
that politicians prefer not to think about. However, much 
of the money spent by the government is spent in turn 
by those receiving it, those receiving this twice-spent 
money spend some of it in their turn, and so on indefi¬ 
nitely. This produces a chain reaction that economists call 
the multiplier effect, and results in much greater total 
spending (and therefore total income) than the govern¬ 
ment’s original expenditure. Suppose the original ex¬ 
penditure is E dollars, and that each recipient of spent or 
respent money spends 100c percent of it and saves 100^ 
percent. The numbers c and s are called the propensity 
to consume and the propensity to save; both numbers are 
between 0 and 1, and c + s = 1, since all money is ei¬ 
ther spent or saved. In this way the income of the entire 
country is increased by kE dollars, where the factor k is 
called the multiplier. These basic concepts of macroeco¬ 
nomics were introduced by the English economist John 
Maynard Keynes (1883-1946). Show that k= \/s > 1. 
For example, if c = 0.9 and s = 0.1, then k = 10, and 
for every $1 spent by the government, the national in¬ 
come is increased by $10. 


It is interesting to observe that both the harmonic series 
and the geometric series 


1 +7+T+T+ 


have positive terms that decrease toward zero, and yet the first diverges while the 
second converges. This suggests the subtlety we will encounter as we penetrate 
further into the study of infinite series. 

One of the attractions of our subject is that it offers so many results that stir 
the imagination and stimulate curiosity. For instance, it seems reasonably clear 
from the preceding observation that a series of positive terms will converge if its 
terms decrease “rapidly enough.” This is true. For example, we next see that the 
divergent harmonic series can be made to converge by squaring the positive in¬ 
tegers in the denominators, which makes the terms themselves smaller. 


13.4 

GENERAL PROPERTIES 
OF CONVERGENT 
SERIES 


Example 1 The series of the reciprocals of the squares. 
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^ 111 

Z „ 2 1 + 2 2 + 3 2 + 




i^ + ¥ + 


= i+ -+-+ — +—+... 
4 9 16 25 


converges with sum < 2. This follows at once from the fact that the partial sums 
form an increasing sequence with 2 as an upper bound: 


Sn ~ 1 + 


1 


1 


2-2 3-3 


< 1 + 

= 1 + 

= 2 - 


1 • 2 + 2 • 3 + " ' + (n - \)n 



1 


n 


n 


Further, it is known that the sum of this series is ir/6: 





7T 2 

~6~‘ 


This very remarkable fact was discovered by Euler in 1736. His method of dis¬ 
covery (which uses the power series expansion of sinx—also discovered by him) 
is described in Appendix 1 at the end of this chapter. 


Another way to change the harmonic series into a convergent series is to change 
the signs of alternate terms. This produces a series whose sum (another aston¬ 
ishing fact!) is In 2: 

1 — y + j — j + ■ ■ ■ = In 2. 

In Section 13.1 we obtained this formula from the power series expansion of the 
function In (1 + x), which will not be solidly established until Chapter 14. How¬ 
ever, in Section 13.6 it will be possible to give an easy rigorous proof by an en¬ 
tirely different method. 


Example 2 If we recall that 0! = 1 and 1! = 1, then it is clear that the series 


n=0 rtm 

has partial sums sq- 1, S\ =2, and, for n> 2, 


s n = 1 + 1 + — + ~—~ + 
2 2-3 


1 


2 ■ 3 • • ■ n 

If each factor in the denominators is replaced by 2, then we see that 

S n < 1 -F 1 + — + — t- +••■-(- —- r 

n 2 2 2 2 ft ~ * l 

= 1 +2fl -i) = 3 -i<3, 


2 n 
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so the series converges with sum < 3. By Example 5 in Section 13.2 we know 
that the sum of this series is actually e: 


Xi=i + ' + i + 


2! 3! 


• + 


We will use this fact below to prove that e is an irrational number. 


( 1 ) 


These examples and those in the previous section provide a small supply of 
specific series of known convergence behavior, where this behavior is decidable 
by rather elementary means. The value of these familiar series for determining 
the behavior of new series by various methods of comparison will begin to ap¬ 
pear in Section 13.5. First, however, there are several simple properties of con¬ 
vergent series in general that need to be mentioned explicitly. 

The effective use of infinite series rests on our freedom to manipulate them by 
the various processes of algebra. However, we will soon see that carelessness can 
easily lead to confusion and disaster. It is therefore of prime importance to know 
exactly which operations are permissible and which are traps for the unwary. 

If Z“=i a n converges to we write 

CL\ + <22 + ' * * + a n + ' * * = s (2) 

and call s the “sum” of the series. This well-established terminology is perhaps 
unfortunate, for it tends to foster the belief that an infinite series can be treated 
as if it were an ordinary finite sum. In reality, of course, 5 is not obtained sim¬ 
ply by addition, but is the limit of a sequence of finite sums, and the properties 
of series must be based on this definition and not on any tempting but mislead¬ 
ing analogy. As we shall see, many properties of finite sums do carry over to se¬ 
ries, but we must always be careful not to assume this without proof, because 
some do not. 

As an example of the pitfalls that lie around us, consider the familiar fact that 
rearranging the order of the terms of a finite sum has no effect on the numerical 
value of that sum. In contrast to this, in Problem 10 we ask students to see for 
themselves that the sum of a convergent infinite series can be altered by writing 
its terms—exactly the same terms!—in a different order. This astounding (and 
fascinating) behavior illustrates the need for caution. It also emphasizes the del¬ 
icacy of the concepts we are working with, and gives us fair warning that we 
cannot hope to study infinite series successfully without giving a reasonable 
amount of attention to the underlying theory. 

We begin by pointing out that in dealing with finite sums we can freely insert 
or remove parentheses, as in the expressions 

1 - 1 + 1 = (1 - 1 ) + 1 = 1 - (1 - 1 ) = 1 , 


but this is not true for infinite series. For instance, the series 1 — 1 + 1 — 1 + 
• • • clearly diverges, but 

(1 - 1 ) + (1 - 1 ) + • • • = 0 + 0 + • - • 

converges to 0, and 

1 — (1 — 1 ) — (1 — 1 )-= 1 - 0-0 - 
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converges to l. 5 " These examples show that the insertion or removal of paren¬ 
theses can change the nature of an infinite series. However, in the case of a con¬ 
vergent series like (2), any series obtained from it by inserting parentheses, such 
as 


a 1 + (a2 + <23) + (#4 + (25 + < 36 ) + ' ' * , 

still converges and has the same sum. The reason for this is that the partial sums 
of the new series form a subsequence of the original sequence of partial sums, 
and therefore necessarily converge to the same limit. In the same way, we see 
that parentheses can be removed if the resulting series converges. 

We next remark that if a\ + + • ■ * converges to then a\ + 0 + a 2 + 0 + 

• • • also converges and has the same sum, because the two sequences of partial 
sums are $i, $ 2 * • • • and s\, s\, s 2 , s 2 , . . . , and the repetitions in the latter do not 
interfere with its convergence to Similarly, any finite number of 0’s can be in¬ 
serted or removed anywhere in a series without affecting its convergence behav¬ 
ior or (if it converges) its sum. 

It is important to observe that when two convergent series are added term by 
term, the resulting series converges to the expected sum; that is, if X~=i a n = s 
and 2~ =] b n = t, then X~=i (a n + b n ) = s + t. This is easy to prove, for if s n and 
t n are the partial sums, then 

{a\ + b\) + (a 2 + b 2 ) + • ■ ■ + (a n + b n ) 

= (a\ + <32 + ■ * • + a n ) + {b\ + b 2 + * • • + b n ) 

Sn tn ^ S t. 

Similarly, X~=i (a n — b n ) = s — t and 2^= \ca n = cs for any constant c. It is also 
convenient to know that if 


<2\ + a 2 + • • • = 5 , 


then 


< 2 0 + a\ T a 2 + • • • = <3 0 + s and a 2 + a -3 + • • • = s — a\. 

The first statement is clear from the fact that 

lim (cio + <3i + a 2 4- • • • 4- <3 n ) = lim a 0 + lim (a\ + a 2 + * ■ • + a n ) = <3o + s , 

and the second follows in the same way. Thus, any finite number of terms can 
be added or subtracted at the beginning of a convergent series without disturb¬ 
ing its convergence, and the sums of the various series are related in the expected 
ways. 

We now use several of the properties of series discussed above to prove the 
following theorem of Euler: e is irrational . 


*Let us rearrange these numbers into the bizarre calculation 

0 = 0 + 0+ 0 + -- -=(I - I) + (1 - 1) +(I — !) + ■■• 
- I - 1 + l — I + I - 1 + ‘ * • 

= 1 - (I - I) - (1 - I)- 

= 1-0-0- 


Guidobaldo del Monte (1545-1607), patron and friend of Galileo, thought that this result proves the 
existence of God, because “something has been created out of nothing.” 
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Our starting point is equation (1), 


*=1 + 1 +77+- • • +-T + 
2! n\ 


from which it follows that the number 


1 


1 


e- i - i - j, - 


1 


n\ (w + 1 )! (h + 2 )! 


(3) 


is positive for every positive integer n. We assume that e is rational, so that e = 
p/q for certain positive integers p and q, and we deduce a contradiction from this 
assumption. Let n in (3) be chosen so large that n> q, and define a number a 
by 


= n\ 


e “ 1 “ 1 “ T ” 


Since q divides n\, a is a positive integer. However, (3) implies that 

1 1 


r = #l4 


(n + 1)! 


(n + 2)! 

1 


< 


n + 1 
1 

n + 1 


(n + 1 )(n + 2) 


■ + • 


1 

n + 1 

1 


(n + l) 2 

1 


1 


n + 1 
1 


{n + l) 2 


n + 1 1 — \/(n + 1) n 

This contradiction (there is no positive integer < 1 In) completes the argument. 

Further information about irrational numbers ( 7 r is irrational, etc.) is given in 
Appendix 2 at the end of this chapter. 


PROBLEMS 

1 If 2 a n converges and 2 b n diverges, show that 
2( a n + b n ) diverges. Hint: Assume that it converges and 
deduce a contradiction. 

2 Decide whether each of the following series converges 
or diverges, and give convincing reasons for your an¬ 
swers: 



<f) — In 2 "' (8) — 2n 2 ' 


3 


For each of the following series, find the values for x 
for which the series converges and express the sum as 
a simple function of x: 

(a) ax + ax 3 + ax 5 + ••*,« 0; 


(b) 


x 2 



(‘>* + TT7 + (TT^ + - 

(d) In x + (In x) 2 + (In x) 3 + 
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4 Show that 


jc 2 + 


. +* 2 ( 1 + JC 2 ) 2 

converges for all jc and find its sum. 

Show that 'Z\/e n converges but 21/(e ln/1 ) diverges. For 
what values of jc does 2 e ,vc converge? 

Show that 

(a) ^ [tan -1 (n + 1) - tan 1 n\ = 7t/4; 


1 + 3 ~ 2 T i + 'l— ■ • • = j In 2. (**) 

Satisfy yourself (a) that (**) is valid; (b) that the series 
can be produced by rearranging the terms of the series 
(*), so that the first two positive terms of (*) are fol¬ 
lowed by the first negative term, then the next two pos¬ 
itive terms by the second negative term, etc; and (c) that 
the value of the sum of the series (*) has been myste¬ 
riously multiplied in this way by the factor f/ 

*11 In this problem we ask the student to give a solid proof 
of the validity of the power series expansions 


H-I = CO. 

n > 


(b) 2 ln 

n=\ 

If f(n) —> L, show that 


Z [/(«)-/(«+ i)]=/(i)-z. 

/!= I 

and use this to establish the indicated sums of the fol¬ 
lowing telescopic series: 


1 


(b) Y ("I )' 1+l * 


\_ 

= 2 ’ 

In + 1 
n(n + 1) 


(c) I 


2 n + 1 
ri rt 2 (n + l) 2 


(d)I 


1 


12 ' 


(4 n - 1)(4 n + 3) 

8 It follows from Example 1 that the series 

+ • 


1 


22 + 32 + 5 2 + ?2 + 




where the denominators are the squares of the succes¬ 
sive primes, converges. Why? 

A decimal a {) .a\a 2 ... a„ ... is simply an abbreviated 
way of writing the infinite series 


V an — 1 _i_ ^2 , 

— 1 O ' 1 a ° + 1 n + 1 <^2 


10 I0 2 


where it is understood that a 0 is an arbitrary integer and 
each of the afs for n> 1 is one of the digits 0 , 1 , 2 ,... , 
9. Show that every decimal converges. 

10 Consider the series 


1 


+ ? “ i + 7 - { + • • ■ = In 2 , (*) 


and write under it, as follows, the result of multiplying 
through by the factor 


1 + • * • = 7 In 2. 


Now add, combining the terms placed in vertical 
columns, to obtain the series 


sinx = x ~T + 5!" " 


and 


cos jc = 


2! 4! 


(*) 


(**) 


The machinery consists of the familiar formulas 

sin t dt = 1 — cos jc and cos t dt = sin jc, 

Jo Jo 

and also the following general property of definite in¬ 
tegrals: 

if a < b and fix ) < g(x), then f /(jc) dx < f g(jc) dx. 

Ja Ja 

(a) In the inequality cos jc < 1, replace jc by t and in¬ 
tegrate both sides of cos t < 1 from 0 to a fixed pos¬ 
itive number jc to obtain 

sin jc < jc. 

(b) In the same way, use the result of part (a) to obtain 


1 — cos jc < : 


or cos jc > 1 — 


jc 2 


2 2 ' 

(c) In the same way, use the result of part (b) to obtain 

JC 3 

sm * > * — — . 

(d) By continuing this process indefinitely, generate the 
two sets of inequalities 


sin x < x 


sin a > x — - 


siru<j:--+- 


3 5 7 

;r , x ' 


COS JC < 1 


COS X > 1 — — 


X 2 x 4 
cosxs\ - — + ^7 


, X 2 X 4 x b 

c ° SJ:a 


+ This phenomenon will be explored from a different point of view, 
and we hope clarified, in Sections 13.6 and 13.8. 
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(e) To complete the proofs of (* *) and (**) for the pos¬ 
itive value of x under consideration, show that it 
suffices to know that x n !n\ —> 0 as n —> ©o, which is 
Example 3 in Section 13.2 


(f) Finally, show that (*) and (**) are also valid for 
x < 0. 


The easiest infinite series to work with are those whose terms are all nonnega¬ 
tive numbers. The reason for this—as we saw in Section 13.3—is that the total 
theory of these series can be expressed by the following simple statement: If 
a n > 0, then the series Za n converges if and only if its sequence {$„} of partial 
sums is bounded . 

Thus, in order to establish the convergence of a series of nonnegative terms, 
it suffices to show that its terms approach zero fast enough to keep the partial 
sums bounded. But how fast is “fast enough”? One answer to this question can 
be stated informally as follows: at least as fast as the terms of a known conver¬ 
gent series of nonnegative terms. This idea is contained in a formal statement 
called the comparison test: If 0 < a n ^ b m then 


13.5 

SERIES OF 

NONNEGATIVE TERMS. 
COMPARISON TESTS 


Za ,j converges if Zb, 7 converges ; 

Zb n diverges ifZa n diverges . 

The proof is easy. The first step is to notice that if s n and t n are the partial sums 
of Za n and Zb n , then the assumption yields 

S n = a \ T #2 + * * ’ + U n ^ b\ + + * ' * + b n = t n . 

Our conclusion now follows at once from this inequality and the statement in the 
preceding paragraph, for if the tf s are bounded, then so are the sf s, and if the 
sf s are unbounded, then so are the tf s. 


Example 1 The comparison test is easy to apply to the series 

°o 1 00 1 


The first series converges, because 


1 


3 n + 1 3" 


and X1/3” converges; and the second diverges, because 

~ — T~ 0 ) 

n In n 

and 21 In diverges. [To verify (1) in the equivalent form In n < n, recall that the 
graph of y = In x lies below the graph of y = x.] 


It is worth remarking here that we can disregard any finite number of terms at 
the beginning of a series if we are interested only in deciding whether that se¬ 
ries converges or diverges.* This tells us that the condition 0 < a n < b n for the 
comparison test need not hold for all n, but only for all n from some point on. 


*On the other hand, if we are interested in the sum of a convergent series, then obviously we must 

take all of its terms into account. 
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As an illustration, suppose we want to show that 2(rc + \)/n n converges by com¬ 
parison with 21 In 2 . The inequality 

n + 1 _ 1 
n n ~ n 2 

is not true for all n, but it is true for all n > 4. The series therefore converges by 
comparison with the convergent series 21 In 2 . 

The comparison test is very simple in principle, but in complicated cases it 
can be difficult to establish the necessary inequality between the nth terms of the 
two series being compared. Since limits are often easier to work with than in¬ 
equalities, the following limit comparison test is a more convenient tool for study¬ 
ing many series: If ^Za n and 2 b n are series with positive terms such that 

( 2 ) 

t) n 

then either both series converge or both series diverge. To establish this, we ob¬ 
serve that (2) implies that for all sufficiently large n we have 


or 

\b n < fln < 2 b n . (3) 

Since the convergence behavior of a series is not affected by multiplying each of 
its terms by the same nonzero constant, our conclusion is an easy consequence 
of the inequalities (3) and the comparison test as extended in the preceding para¬ 
graph. Thus, for instance, if 26„ converges, then 22 b n converges and, by the sec¬ 
ond inequality in (3), 2<3„ also converges; etc. 


Example 2 The series 2(/i + 2)/(2 n 3 - 3) converges, because 21/2rc 2 converges 
and 

(n + 2)/(2 n 3 - 3) 2 n 3 + 4 n 2 

-1/2^- = as 

and 2 sin (lIn) diverges, because 21 In diverges and 


[Recall that lirn^o (sin x)/x = 1; see Section 9.2.] 


The limit comparison test is slightly more convenient to use if condition (2) 
is replaced by 

lim t 2 - = L, 
n->°° b n 

where 0 < L < oo. The proof is essentially the same and will not be repeated. 

Example 2 shows that in using the limit comparison test we must try to guess 
the probable behavior of 2 a n by estimating the “order of magnitude” of the rcth 
term a n . That is, we must try to judge whether a n is approximately equal to a 
constant multiple of the nth term of some familiar series whose convergence be¬ 
havior is known to us, such as 
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X X X X 4 or X -T- 

To apply this method effectively, it is clearly desirable to have at our disposal a 
“stockpile” of comparison series of known behavior. Our next example provides 
a family of series that is especially valuable for this purpose. We emphasize once 
again that the limit comparison test is used only if the terms of the series being 
tested are all positive numbers. 


Example 3 


If p is a positive constant, then the p-series 

00 1 111 

X _ 1 + 2/' + Jr + 4^ + ' ' ' 

n — l 


(4) 


diverges if p < 1 and converges if p > 1. 

To establish this, we notice first that if p < 1, then n p < n or \/n < \ln p , so 
(4) diverges by comparison with the harmonic series SIM. We now prove that 
(4) converges if p > 1 by showing that its partial sums have an upper bound. Let 
n be given and choose m so that n < 2 m . Then 


— s 2 m - 1 = 1 + 




(2 


• + 


(2 m - \) p 


2_ _4_ 2 m ~ 1 

” 2 p + 4 p + ’ ’ ' + 


If we put a — \/2 p 1 , then a < 1 since p > 1, and 


1 — a m 1 

s n < 1 + a + a 2 + • • ■ + a m ~ 1 = —-< --. 

1 - a 1 - a 


This provides an upper bound for the sfs, and the argument is complete. 


As an illustration of the use of this family of series, we see that 

r , 1 

Vrt 3 + 3 

converges, because the p-series (with p — f) SI In 312 converges and 

1/Vk 3 H- 3 _ I 

\/n V2 V n 3 + 3 * 

It i s worth noticing that 21 ln p does not necessarily converge if p is a variable > 
1. This is shown by the series 

y 1 

ZX n \ + \/n' 

which diverges because SIM diverges and 

1 = 1 

Vn ' 

[Recall that lim^oo n Un = 1 by Problem 3(b) in Section 13.2.] 
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We conclude this section with some observations on the process of rearranging 
the terms of a series, which was briefly discussed in Section 13.4. Suppose that 
'Zcin is a convergent series of nonnegative terms whose sum is s, and form a new se¬ 
ries 'Zbn by rearranging the afs in any way. For instance, ^Zb n might be the series 

flio + ^3 + <*5 + 01 + 06 + <*2 + * • ‘ • 

Let n be a given positive integer and consider the n\h partial sum t n = b\ + 
b 2 + • * * + b n of the new series. Since each b is some a, there exists an m with 
the property that each term in t n is one of the terms in s m — a\ + a 2 + * * * + 
a fn . This tells us that t n < s m < 5, so 2 b n converges to a sum t < s. On the other 
hand, the first series is also a rearrangement of the second, so by the same rea¬ 
soning we have s < t, and therefore t = s. This proves that if a convergent se¬ 
ries of nonnegative terms is rearranged in any manner, then the resulting series 
also converges and has the same sum. If this conclusion seems rather obvious 
and trivial to students, let them recall from Problem 10 in Section 13.4 that it 
isn’t true if we drop the assumption that the terms of the given series are non¬ 
negative numbers. 


PROBLEMS 


1 Establish the convergence or divergence of the following 
series by using the comparison test: 


(a) I 


1 


Vn(n + 1) 


(b) X 


1 


Vh 2 (h + 1) 


<c) 2 y 


Z „In n ’ 


(o Z 


n + 1 
n{n — 1) ’ 


^ (2 n + 3)" y- / n V 2 

(8) - (h)X(7mj- 

Determine by any method whether each of the following se¬ 
ries converges or diverges. 


2 X 


n 2 + r 


4 V sin -y. 

^ u 

6 

Vn 

n 2 + 5' 


3 X 


1 + 3 n 2 
n 3 + 700' 


v— 1 1 

5 Z, cos -3- 


^ X 


i 


(1 + !/«)"' 


8 X 


9 X—■ 

^ n 




n + Vn 


12 X™- 

^ n i 

v n 

14 X 


•6 X 


n 2 + 100‘ 

1 

5000/1' 


13 X 
15 X 


1000 


i/n + 1 Vn 2 + 5 ‘ 


n 10" 


>7 I ^ 3 "- 7 


n 3 — 2n + 5 * 
2 


v- V n “t" 2 v -1 n A 

18 X T F - ~T 7- ■ 19 X~-• 

Vn 3 + 3 Vn 3 + 5 " 17 

20 X 3 + C 2 ° S ” ■ 21 X ln C + l/« p ).P>0. 


22 VZXXX^ZX. 23 y (1 - C osl/n). 

^ n t— 1 


24 X Vn In 


n + 1 


25 Y -4 /j + J_ + . . . + _L\ p > o. 

^ nP \ 2 p n p J y 

26 If 2a,, is a convergent series with nonnegative terms, 
show that lLa n 2 also converges. With the same hypothe¬ 
ses, show by examples that ZVa,, is sometimes conver¬ 
gent and sometimes divergent. 

27 If p is a positive constant, show that 

(2n - 1)P + 3P + 5^ + ' ' ' 

converges if p > 1 and diverges if p < 1. 
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28 Show that 21/rc diverges by comparing it with the di¬ 
vergent series 2 In (1 + 1 In) of Problem 6(b) in Section 
13.4. Hint: Compare the graphs of the functions y = x 
and y = In (1 + x) for jt > 0. 

29 Use the idea of Problem 28 to show that 


S 11 

l 

converges. Also, 


Among the simplest infinite series are those whose terms form a decreasing se¬ 
quence of positive numbers. In this section we study certain series of this type 
by means of improper integrals of the form 

f j\x)dx = lim [ fix) dx. (1) 

We recall that the integral on the left is said to be convergent if the limit on the 
right exists (as a finite number), and in this case the value of the integral is by 
definition the value of the limit. If this limit does not exist, then the integral is 
called divergent. There is an obvious analogy between (1) and the correspond¬ 
ing definition for series, 

oo k 

y a„= lim Y a n . 

^ oo ^ 

n — 1 n = 1 

Our purpose is to exploit this analogy by using integrals to obtain information 
about series. 

Consider a series 


X a n - a \ + a 2 +•••+«„+•' ' (2) 

n= 1 

whose terms are positive and decreasing. In most cases the nth term a n is a 
function of n given by a simple formula, a n = fin). Suppose that the function 
y = f{x) obtained by substituting the continuous variable x in place of the dis¬ 
crete variable n is a decreasing function of x for x > 1, as shown in Fig. 13.4. 
On the left in this figure we see that the rectangles of areas a\, a 2 ,..., a n have 
a greater combined area than the area under the curve from x = 1 to x = n + 1, 
so 


a x + a 2 + * * * + a n > Jj f(x) dx > | f(x) dx. (3) 




(n + l) 2 
1 n(n + 2) 

find the sum of this series. 
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1 2 3 


n n + 1 n + 2 


Figure 13.4 
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On the right side of the figure we make the rectangles face to the left, so that 
they lie under the curve. If we momentarily ignore the first rectangle, with area 
a i, then we see that 

a 2 + ^3 + ' ‘ ‘ + a n < Jj fix) dx\ 

and including a\ gives 

a x + a 2 + • • • + a n < a x + fix) dx. (4) 


By combining (3) and (4), we obtain 

Jj fix) dx < a\ + a 2 + • • • + a n < a t + fix) dx. (5) 

The point of all this is that the inequalities (5) enable us to establish the inte¬ 
gral test : 


Iff(x) is a positive decreasing function for x > 1 with the property thatf(n) = a n for 
each positive integer n, then the series and integral 

X a n and Jj fix) dx 

n= 1 

converge or diverge together.* 

The argument is easy, for if the series converges, then the inequality on the left 
of (5) shows that the integral does also; and if the integral converges, then the 
inequality on the right shows that the series also converges. 


Example 1 The p-series revisited. If p is a positive constant, then we know 
from Section 13.5 that the p-series 


X 


nP 


= 1 + 


1 


2 p 


( 6 ) 


converges if p > 1 and diverges if p < 1. It is of some interest to give another 
proof of this as an illustration of the integral test. Since a n = 1 !n l \ we consider 
the function fix) = \Jx /J (which clearly satisfies all the stated conditions) and ex¬ 
amine the integral 


i; 


dx 

xP' 


If p— 1, this integral diverges, because 


/; 


dx 

x 


lim 

b—> oo 



lim In b = °°. 

b—>°° 


*This test is often called the Cauchy integral test, after its discoverer, the eminent nineteenth-century 
French mathematician Augustin Louis Cauchy (pronounced “Ko-shee”). In mathematical productiv¬ 
ity Cauchy (1789-1857) was surpassed only by Euler, and his collected works fill 27 fat volumes. 
He was a prolific contributor to number theory, algebra, and many branches of physics. However, his 
most important achievements were in the field of analysis. Together with his contemporaries Gauss 
and Abel, he was a pioneer in the rigorous treatment of limits, continuous functions, derivatives, in¬ 
tegrals, infinite series, and differential equations. 
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If p 1, then 


f°° dx 
’i xP 


ct> 

lim x p dx — 

b —->°o J 1 


lim 

b—>o° 



and the issue of convergence hangs on the behavior of b ] ~ p as b —> oo. If /? < 1, 
so that 1 — p> 0, then b x ~ p —> °° and the integral diverges. If /? > 1, so that 
1 — p < 0, then Z? * 1 —> 0 and the integral converges. By the integral test we now 

conclude again that the /7-series (6) converges if p > 1 and diverges if p < 1. 


It is clear that the integral test holds for any interval of the form x > k, not 
just for x > 1. We make use of this remark in our next example, which deals with 
a class of series whose behavior is not revealed by any of our previous tests. 


Example 2 The terms of the series 


X 


1 

n In n 


(7) 


decrease faster than those of the harmonic series. Nevertheless, it is easy to see 
by the integral test that (7) diverges, for 


f°° dx 
'2 x In x 


lim [*-£- 
b—>°° J2 X In Jt 


= lim [In In xf 2 

b—$oo 


= lim (In In b - In In 2) = oo. 

b-> °° 

More generally, if p is a positive constant, then 

X — 1 — 

„= 2 n ( ln n y 

converges if p > 1 and diverges if p ^ 1; for if p 1, we have 


dx 


= lim 


(b dx 


= lim 


2 jc(ln x) p J 2 jc(ln x) p &->< 


= lim 

b—*°° 


(In x) 1 ~p 

1 -p 

(In b) ] 'P - (In 2 ) l ~P 

1 -p 


and this limit exists if and only if p > 1.* 


We now return to the series (2) and squeeze some additional information out 
of the inequalities (5). By subtracting the integral that occurs on the left, these 
inequalities can be written as 

0 < a\ + ’ • + a n ~ f(x) dx < a\ t (8) 


*The series of this example are called Abel's series, after the great Norwegian mathematician Niels 
Henrik Abel, who first investigated them and determined their convergence behavior. Abel 
(1802-1829) died of tuberculosis at the age of 26 before the publication of his many brilliant dis¬ 
coveries made him world famous among mathematicians. His most memorable achievement was his 
proof (mentioned on p. 106) of the impossibility of solving the general fifth-degree equation by means 
of radicals. He also contributed to the rigorous theory of infinite series, and his discovery of elliptic 
and other transcendental functions launched a new era in mathematical analysis. 
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and this serves to focus our attention on the quantity in the middle. If we denote 
this quantity by F(n) so that 

F(n) = a\ + <22 + * * * + a n ~ /(•*) dx. 


then ( 8 ) becomes 


0 < F(n) < a { . 

From our present point of view, the key to this situation is the fact that {F(n)} 
is a decreasing sequence. This follows from the calculation 


F(n)-F(n + 1) = 


Cl\ + a 2 * * * + a n 



r 

fn+\ 

a, + a 2 + • 1 

"+a n+ 1 -Jj f(x)dx 



a n +1 0, 


where the reason for the last-written inequality can be understood by examining 
the left side of Fig. 13.4. Since any decreasing sequence of nonnegative numbers 
converges, the limit 


L = lim F{n) = lim 

n —>°o n—*°o 


a \ + a 2 “I" ' * ' + a n 



(9) 


exists and satisfies the inequalities 0 < L < a\. 

As our main application of these ideas, we deduce the existence of the im¬ 
portant limit 

lim (1 + ]r +•••+- — In A (10) 

n—*°° \ 2 n I 


This is easily seen to be the special case of (9) in which a n = 1 In and/( jc) = 1 lx, 
because 


f” dx 


= In x 


n 

= In n. 

l 


The value of the limit (10) is usually denoted by the Greek letter y {gamma), and 
is called Euler s constant : 


7= lim (l +^-+ •••+--Inn). (11) 

o° \ 2 n ) 

This constant occurs quite frequently in several parts of advanced calculus, 
especially in the theory of the gamma function, and is, along with it and e, one 
of the most important special numbers of mathematics. Its numerical value, 
7 = 0.57721 56649 01532 86060 .. . , has been calculated to many hundreds of 
decimal places. Nevertheless, no one knows whether y is rational or irrational. 

In order to describe some of the uses of Euler’s constant, it is convenient to 
introduce a notation which has been widely accepted in twentieth-century math¬ 
ematics. Let { a n } and {b n } be two sequences, and suppose that b n > 0. We say 
that “ a n is little-oh of b n ,” and symbolize this by writing 


a n = °(b n ), 
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if a n lb n —> 0. In particular, a n = o(l) means that a n —> 0. An equation of the form 
a n ~ b n + o(l) means that a n — b n = o(l), so a n and b n differ by a quantity that 
approaches zero as n —> °° In our work we will use the symbol o(l) to mean any 
sequence that approaches zero as n —> °° as in the calculation [a 4- o(l)] + 
2 [b + o(l)] = a + 2b + <?(1). 

With the aid of this notation, (11) can be written in the form 

1 + ^!r + •••+ — = In n + y + <?(1). (12) 

2 n 

Since In n —> °o as n —> c>o, this formula displays in a very transparent way the 
reason for the divergence of the harmonic series. It is also useful for many other 
purposes, as the following examples show. 

Example 3 We can use (12) to give a simple proof of the formula 

1 - y + j- ^-+ " = ln2 . (13) 

Let s n be the nth partial sum of this series, and observe that 



= [In 2w + y + o(l)] — [In w + y + o(l)] 


= In 2 + o(l)-»ln 2. 

The odd partial sums approach the same limit, because 

s 2n + 1 = + y J+ ~ \ ~* Hm S2n = ln 2 ’ 

so the proof of (13) is complete. We emphasize that this method establishes (13) 
on the basis of ( 12 ) alone, without making any use of the power series expan¬ 
sion of ln (1 + x) as given in Section 13.1. 


Example 4 We can also use (12) to obtain the remarkable formula 


1 



(14) 


which was the subject of Problem 10 in Section 13.4. The method is similar to 
that of Example 3. If s n is the nth partial sum of (14), then, since 2 n is the nth 
even number and 2 zi — 1 is the nth odd number, we have 
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= [In 4/2 + 7 + o(l)] - {[In 2 n + y + 0 ( 1 )] - {[In n + y + o (\)] 


= In 4/2 — In In 1 4 - o{ 1 ) = In ^ n + 0 ( 1 ) 

2 V2 n 


= In 2 3/2 + 0(l)->ln 2 3/2 = { In 2. 
It is easy to see that the partial sums 


*3n+ l - S 3 „ + 


and 


S3/I + 2 $3n + 


1 


4 / 2+1 4/2 + 

approach the same limit, so the proof of (14) is complete. 


4/2 + 3 


Remark The basic idea of the integral test is to compare sums with integrals 
by looking at their geometric meanings in terms of areas. This idea can also be 
used to prove the divergence of the series of the reciprocals of the primes, as 
mentioned in Section 13.3: 



This proof is a bit complicated; and since it is not essential to the main line of 
thought in this chapter, we place it in Appendix 3 at the end of the chapter. 


PROBLEMS 


Use the integral test to determine whether each of the fol¬ 
lowing series converges or diverges. 



2 X 

n~ 1 


1 

n(n + 1)’ 


3 X 


n 

n 2 + r 


4 X 

n = 1 

2 X 


/ 2 2 + 1 

tan~ l n 
2- 


5 X 


„e. +1 ' 


6 X 


»-1 3 " + 1 


1 + n 


8 X^ 

n = 2 U 


9 (a) The series (In n)/n diverges by comparison with 
the harmonic series, since 


1 < In n 
n n 


for n > 3 . Establish this divergence by means of the 
integral test. 

(b) If p is a positive constant, show that £“=3 (In n)/nP 
converges if p > 1 and diverges if p < 1. 


10 (a) Use the integral test to show that the series X“=j n/e n 

converges. 

(b) What is the sum of the series i n (a)? Hint: Assume 
that the geometric series 2 ^= 0 *" = 1 + x + x 2 + 
• • • = 1/(1 — x) can legitimately be differentiated 
term by term on the interval — 1 < x < 1 . 

11 The curve in Fig. 13.5 is the graph of y = 1/x Convince 
yourself that the combined area of all the infinitely many 
shaded regions is Euler’s constant 7 . By inspecting the 
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figure, show that the value of y is between y and 1, and 
is only slightly larger than y. 

12 Use (12) to show that 

1 + y- y + i + j- f+ + - • ’ ' = In 3. 

13 If {jc„) is the sequence defined by 

r -i.i, | * 

n + 1 n + 2 n + n 

then x n —» In 2 because 


+ 1/ai + n 


1 + 2 In 


+ ■ 


1 

+ — • 
n 


P 


= In 2. 


14 


Jo 1 + x 

Establish this fact by using formula (12). 
The harmonic series 


1 


1 + n/n 




+ ■ • • 


diverges very slowly. To grasp how slowly, use (12) to 
show that in order to get s n to exceed 

(a) 10, we must add about 12,000 terms; 

(b) 20, we must add about 272,405,000 terms; 

(c) 184, we must add about 4.56 X 10 79 terms. 

The last number is somewhat larger than the estimated 
total number of elementary particles in the entire uni¬ 
verse, which is about 2.36 X 10 79 .* 


*The English astronomer Sir Arthur Eddington believed he had shown 
that the number of these particles is precisely { X 136 X 2 256 , whose 
value is about 2.36 X 10 79 . Most modern astronomers think Edding¬ 
ton’s number is nonsense; on the other hand, few of them (if any) 
seem able to criticize his ideas—so in the time-honored tradition, 
they dismiss him as irrelevant. More detail on these matters can be 
found in Sir Edmund Whittaker’s From Euclid to Eddington (Dover, 
1958). 


In the case of the geometric series with r > 0, the ratio a n +\/a n of the 
0 n + 1 )st term to the nth term has the constant value r, since 
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We know that this series converges if r< 1, essentially because for these r’ s, 
condition (1) guarantees that the terms decrease rapidly. Analogy leads us to ex¬ 
pect that any series 2 a n of positive terms will also converge if the ratio a n+ i/a n 
is small for large n , even though this ratio may not have a constant value. 

These ideas are made precise in the ratio test : 


If'Zcin is a series of positive terms such that 


then 


lim 


a n 


= L, 


(a) if L < 1, the series converges ; 

(b) ifL> 1, the series diverges ; 

(c) if L = 1, the test is inconclusive . 


( 2 ) 


To establish (a), we assume that L < 1 and choose any number r between L and 
1 so that L < r < 1. Then the meaning of (2) tells us that there exists an no such 
that a n +\!a n < r for all n > no , which is equivalent to 


Q/i+i 



or 


a n +1 < a n 

y.n+ 1 y.n 


for n > no . 


This says that the sequence {a n /r n } is decreasing for n>no\ in particular, 
a n /r n < a Ho /r n ° for n > no. Thus, if we put K = a no /r n ° , then we have 

a n < Kr n for n > n 0 . (3) 


But 'ZKr" converges because r < 1, and therefore, by the comparison test, (3) 
implies that ^La n converges. To prove (b), we simply observe that L > 1 implies 
that a n +\/a n > 1, or equivalently a n +1 > a n , from some point on, so a n cannot 
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approach zero, and by the nth term test we know that the series diverges. Part (c) 
says that if L = 1, then no conclusion can be drawn, that is, sometimes the se¬ 
ries converges and sometimes it diverges. To demonstrate this, we consider the 
p-series 21 ln p . It is clear that for all values of p we have 

fln+i = nP = / n \p 
(n+l)P \n + l) 

and yet this series converges if p > 1 and diverges if p < 1. 

The ratio test is especially useful for handling series whose nth term a n is given 
by a formula that involves various products, since even though a n itself may be 
complicated, the ratio a n +\/a n can often be simplified by cancellations. 


Example 1 We know that the series 


X- 

n\ 


converges by the argument given in Section 13.4. The ratio test yields the same 
conclusion much more easily, because 


L = li m A±L =Bn W» + l>! 

a n 1 in\ 


— lim 


i! 


= lim —= 0. 


(«+!)! n + 1 


Since L < 1, the series converges. 


Students should notice our use of the equation^ + 1)! = (n + l)rc! in this ex¬ 
ample, because this fact will often be needed in our future work. 


Example 2 In the case of the series 


A n \’ 

n=0 n - 


it is easy to see that 


_ a n+ 1 3 n+1 n\ 3 

L = lim-= lim -——tt • — = lim-- = 0. 


(«+D! 3 n 
Again we have L < 1, so the series converges. 


n + 1 


(4) 


Since the nth term of any convergent series approaches zero, and therefore the 
convergence of (4) tells us that 3 n /n\ —> 0, we know that n\ increases faster than 
3 n as n —» oo. Students should try to develop an intuitive feeling for the relative 
rates of growth of expressions like these as an aid in forming quick but reliable 
judgments about the probable behavior of series. In this connection we point out 
here that the numerator 3 n of the nth term of series (4) contributes the 3 to the 
numerator of the ratio a n +\/a n after simplification, and that the n\ in the de¬ 
nominator contributes the n + 1 to the denominator of this ratio. 


°° in 
T.n ’ 


Example 3 For the series 
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we have 


n Vi + 1 


10 


3” 

*,10 


—K)" 


3 


3' 


Again we have L < 1, so the series converges by the ratio test. 


In this example the convergence of the series tells us that 3 n grows faster than 
n ]0 , and we see from the calculation of L that the series behaves like the geo¬ 
metric series with r = j. We also observe that the polynomial factor n ]0 con¬ 
tributes the factor 1 to the calculation of L, so no such polynomial factor ever 
has any effect on the outcome of the ratio test. 


Example 4 The remark preceding Example 1 is illustrated with special clarity 
by the series 

1 1-3 1 - 3-5 1 - 3 - 5 - - - {2n - 1 ) 

2 + 2’5 + 2- 5- 8 + “' + 2 • 5 • 8 • • • (3w — 1) + " '' 


Here the cancellation of factors yields 


L = lim 


a n 


1 - 3 - - - (2 n- \){2n + 1) 
im 2 • 5 • • • (3n - l)(3n + 2) 


2 • 5 • ■ • (3n - 1) 
1 - 3-■- (2/1-1) 


= lim 


2n + 1 
3/i -E 2 


2 

3’ 


and the series converges because L < 1. 


We now discuss the so-called root test, which is another convenient tool for 
studying the convergence behavior of series. 

Suppose that Za n is a series of nonnegative terms with the property that from 
some point on we have 

a n < r n , where 0 < r < 1. (5) 

The geometric series 'Zr' * 1 2 clearly converges, so Za n also converges by the com¬ 
parison test. The fact that the inequalities (5) can be written in the form 

Va~ n < r < 1 (6) 

brings us to a convenient statement of the root test : 

lfZa n is a series of nonnegative terms such that 

lim Va n = L, (7) 


then 

(a) if L < 1, the series converges ; 

(b) if L > 1, the series diverges ; 

(c) if L = 1, the test is inconclusive. 

The proof rests on the preceding remarks. For (a), if L < 1 and r is any number 
such that L< r < 1, then the meaning of (7) tells us that ( 6 ) holds for all suffi- 
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ciently large n ’s, so ^Za n converges. For (b), if L > 1, then \ x fa n > 1 from some 
point on, so a n > 1 for all sufficiently large n’ s, and the series diverges because 
a n does not approach zero. Finally, we establish (c) by observing that L = 1 for 
both the divergent series 2 Mn and the convergent series 21 In 2 , since X/n —> 1 as 
n —> . 


Example 5 In the case of the series 



we have 


L = lim Xfa n = lim = 0. 

In n 


Since L < 1, the series converges. 


In general, it is clear that the root test is most likely to be useful for treating 
series in which a n is complicated but X/~a n is simple, so that lim Xfa n is easy to 
compute. However, the practical value of the root test is outweighed by its the¬ 
oretical significance, and this appears mainly in the advanced theory of power 
series. 

Remark 1 The ratio test and the root test were first stated and correctly proved 
by Cauchy in 1821, as part of the earliest satisfactory exposition of the basic con¬ 
cepts of the theory of series. 

Remark 2 We have seen that the ratio test is inconclusive when lim a„+ \/a n = 1, 
but this is far from the end of the story. If a n +\/a n —> 1 from above, then we have 
a n +\/a n > 1 or a n +\ > a n , and certainly diverges, because a n does not ap¬ 
proach zero. But if a n + \/a n -a 1 from below, then there are several more delicate 
tests that are capable of yielding additional information. The curious reader will 
find some of these tests discussed in Appendix A. 12. 


PROBLEMS 


Use the ratio test to determine the behavior of the following 
series. 


1 

I 

n 

2 n ' 

2 

X 

n 2 _ 

2 n ‘ 

3 

X 

n n 

2 n ‘ 

4 

X 

n\ 

2 n 

5 

X 

n\_ 

n n ' 

6 

X 

nl 

(2/i)!' 

7 

X 

in'.) 2 

( 2 n)!' 

8 

X 

(«!) 3 

(2k)! 

Q 

v 

3 ■ 5 • • ■ (2« + 1) 

1 it 

v 

2 2n 

y 

zL 

n\ 

IU 


(2 n + 1 )!' 

11 

X 

3^” 

12 

X 

(2 n + 2 ) 
3 n (rt !) 2 ' 


13 


14 


1-3 1-3-5 

1 + + ——— + ■ ■ ■ 


2 ! 


3! 


1-3-5 


(2 n- 1) 


y (w!) 2 3” 

^ (2n)! ' 


15 Let be a series of positive terms with the follow¬ 
ing property: There exists a number r < 1 and a posi¬ 
tive integer n 0 such that a n +\/a n ^ r for all n>riQ. 
Show that 'Zcin converges even though lim a n +i/a tl may 
not exist. 


Use the root test to determine the behavior of the following 


series. 
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21 



Most of our attention so far has been directed at series of positive terms. We now 
wish to consider series with both positive and negative terms. The simplest are 
those whose terms are alternately positive and negative. These are called alter¬ 
nating series, and can be written in the form 


where the a„ s are all positive numbers. As examples that are already familiar 
from our previous work we mention the In 2 series 



and also the series 



whose sum was discovered by Leibniz in 1673 (Appendix 3, Chapter 10). 

It is easy to see that both of the alternating series (2) and (3) have the prop¬ 
erty that the a n 's form a decreasing sequence that approaches zero: 

(i) — ^3 — ‘ * 

(ii) a n —> 0 . 

In 1705 Leibniz noticed that these two simple conditions are enough to guaran¬ 
tee that any alternating series (1) converges. This fact is called the alternating 
series test. 

The essence of the situation lies in the back-and-forth movement of the par¬ 
tial sums of the series (1) under the stated hypotheses, as illustrated in Fig. 13.6. 
To locate the partial sums s\, S 2 , 53 , . . ., we start at the origin and go to the right 
a distance a\ to reach s 1 , then go left a smaller distance a 2 to reach S 2 , then go 
right the still smaller distance a 3 to reach 53 , and so on. The behavior of these 


n + 1 


13.8 

THE ALTERNATING 
SERIES TEST. 
ABSOLUTE 
CONVERGENCE 


Figure 13.6 The alternating series test. 
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partial sums is similar to that of a swinging pendulum that oscillates back and 
forth and slowly approaches an equilibrium position which represents the sum s 
of the series. We suggest that students keep this figure in mind while reading the 
proof in the following paragraph. 

Now for the details of the argument. A typical even partial sum s 2n can be writ¬ 
ten in two ways, as 

s 2n = («i “ a 2 ) + («3 “ a 4 ) + • * * + (a 2n -\ ~ a 2n ) 

= a { - (a 2 ~ a 3 ) - (a 4 - a 5 ) - (a 2n - 2 ~ a 2n - 1 ) - a 2n , 

where each expression in parentheses is nonnegative because the a„s form a 
decreasing sequence. The first way of writing s 2n displays it as the sum of n 
nonnegative terms, so s 2n — s 2 n+ 2 and the even partial sums form an increasing 
sequence, as shown in the figure. The second way of writing s 2n shows that 
s 2n < a\ y so the s 2n ’s have an upper bound. Since every bounded increasing se¬ 
quence converges, there exists a number s such that 

lim s 2n = s. 


But the odd partial sums approach the same limit, because 

s 2 n+\ - a\ - a 2 + <33 — <24 + • • • - a 2n 4- a 2n +\ 


S 2 n a 2n+U 


and therefore 


lim 52 / 1+1 = lim s 2n + 

h—>00 n —>00 


lim a 2n +\ =5 + 0 = 5, 

n —>00 


since a 2n +\ —> 0 as n —» «). This tells us that the sequence {5 W } of all the partial 
sums converges to the limit s, and therefore the alternating series (1) converges 
to the sum s under the stated conditions. 


Example 1 The alternating series test clearly implies the convergence of series 
(2) and (3), 

,_i + i_i + ." = |n 2 md 

because 1 In and 1/(2 n — 1) both decrease to zero. However, this test gives us no 
information at all about the sums of these series. Students will recall that the first 
of these indicated sums was established earlier by two very different methods, 
one conjectural and the other solidly rigorous, whereas the second sum is still 
only a conjecture.* 


Example 2 Determine the convergence behavior of the alternating series 




( a > X 


n = I 


(~l) n+1 n 
1000 + 5 n ’ 


(b) x (-D" +1 

n=2 


In n 
n 


Solution (a) Even though this series is alternating, we nevertheless have a n = 
/i/(1000 + 5n) —» j as n —> so the series diverges by the nth term test. 


*The method of Leibniz in Appendix 3 of Chapter 10 assumes the validity of term-by-term integra¬ 
tion of geometric series. This procedure is valid, but we don’t yet know it with certainty. 
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(b) To prove that this series converges by using the alternating series test, we 
must show that a n = (In n)/n decreases to zero. We know that (In n)/n —> 0 by 
Problem 3(a) in Section 13.2. To demonstrate that the a n J s are decreasing, we 
note that the function 

. In x , , . . . 1 - In x 

f{x) =- has derivative f m =- 5 -. 

x x 

This derivative is negative for x> e, so f(x) is a decreasing function for x > e, 
and therefore a n > a n +\ for n > 3. (As usual in considering the matter of con¬ 
vergence, we can disregard the first few terms of the series.) In this case—and 
in others—we may be convinced that a n decreases to zero without feeling any 
need for a detailed verification. However, if there is any doubt at all, we should 
be prepared to supply such a verification. 

ABSOLUTE CONVERGENCE 

Why is it that the alternating harmonic series 


converges, even though the harmonic series 

1 + t + y + { + ‘ * 

diverges? The essential reason for the divergence of the harmonic series is that 
its terms don’t decrease quite fast enough, as do the terms of the convergent se¬ 
ries 21/2", for example. The partial sums of 21 In consist of many small terms 
that add up to a large total, whereas the terms of 21/2" decrease so fast that no 
sum of any large number of them can even reach 2 . 

In contrast to the relatively simple behavior of these series, the alternating har¬ 
monic series converges, not only because its terms get small, but also because 
the well-placed minus signs prevent the partial sums from growing too large and 
permit them to approach a finite limit. 

Some series with terms of mixed signs do not need the assistance of minus 
signs for convergence, but converge because of the smallness of their terms alone; 
they would still converge even if all the minus signs were replaced by plus signs. 
Series of this kind are especially important and are called absolutely convergent; 
that is, a series 2 a n is said to be absolutely convergent if 2 |a n | converges. 

These remarks suggest that absolute convergence is a stronger property than 
ordinary convergence, in the sense that absolute convergence implies conver¬ 
gence. This is true and is easy to prove, as follows. Suppose that 2 a n is an ab¬ 
solutely convergent series, so that 2|«„| converges. The inequalities 0 < a n + 
\a n \ < 2\a n \ are clearly valid, because a n T \a n \ equals 0 or 2\a n \ according as 
a n < 0 or a n > 0 ; and since 22 |<z w | converges, we know that 2 (a n + \a n \) also 
converges by the comparison test. Since 2 (a n + \a n \) and 2|a w | both converge, so 
does their difference, which is 2 a n . 

When we try to establish the convergence of a series whose terms have mixed 
signs, testing for absolute convergence is a good first step, because (as we have 
just seen) this implies convergence. To do this, we merely change all minus signs 
to plus signs and test the resulting series of nonnegative terms. We remind stu¬ 
dents that all our previous tests—the comparison tests, integral test, ratio test, 
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and root test—apply only to series of positive (or nonnegative) terms, and are 
therefore essentially tests for absolute convergence. 


Example 3 Test the following series for absolute convergence, and also for con¬ 
vergence: 


(a) 1 2 2 + 3 2 4 2 + 

(b) i ± ji ± ji ± ji ± • ■ ■; 


(c) 


V2 V3 V4 


Solution (a) Here the series 2|a„| of absolute values is 21 In 2 , which converges. 
This tells us that series (a) is absolutely convergent, and therefore convergent. 
This series also converges by the alternating series test. 

(b) The intent here is that the plus or minus signs are to be inserted in any 
manner, either at random or according to some systematic pattern. In either case, 
it is clear that, just as in part (a), the series is absolutely convergent, and there¬ 
fore convergent. However, without the concept of absolute convergence we would 
have no means of determining the convergence behavior of this series. 

(c) In this case the series of absolute values is 21/Vn, which is a divergent 
p-series. Series (c) is therefore not absolutely convergent. Nevertheless, this se¬ 
ries is clearly convergent by the alternating series test. 


Remark 1 A convergent series that is not absolutely convergent is said to be 
conditionally convergent. As examples we mention series (2) and (3), and also 
the series in Example 3(c). All series of this kind are capable of startling but fas¬ 
cinating misbehavior, and should be labeled “handle with care.” For instance, any 
such series can be made to converge to any given number as its sum, or even to 
diverge, by suitably changing the order of its terms without changing the terms 
themselves. On the other hand, any absolutely convergent series can be rearranged 
in any manner without changing either its convergence behavior or its sum. In 
an earlier section we said, “The effective use of infinite series rests on our free¬ 
dom to manipulate them by the various processes of algebra.” Generally speak¬ 
ing, this freedom is available only when we are working with absolutely con¬ 
vergent series. These issues and others that are not normally part of a first course 
are discussed in detail in Appendix A. 13. 

Remark 2 The only test we have that establishes convergence rather than ab¬ 
solute convergence is the alternating series test. Several other tests of this kind 
are presented in Appendix A. 14. 

Remark 3 Students may have noticed that almost all of our work in the pre¬ 
ceding sections of this chapter has been devoted to showing whether a series con¬ 
verges or diverges, and very little to finding the sum of a convergent series. A 
good reason for this is that the second problem is usually much harder than the 
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first. Thus, the convergence of the series of the reciprocals of the squares is quite 
easy to establish, but the exact value of its sum, namely, the fact that 


I 


n = \ 



7 T 


2 


6 ’ 


is far from obvious and can only be discovered by great ingenuity. Further, the 
convergence of the series of the reciprocals of the cubes is equally easy to es¬ 
tablish, but its sum has never been discovered and remains to this day one of the 
more tantalizing unsolved problems of mathematics: 


v- i 1 

X^- |+ ^T + TT + 


A second, and perhaps even more important reason for not pressing harder on 
the problem of finding the sum of a convergent series is that this problem will 
be constantly before us in the next chapter. That is, we shall be “representing” a 
given function by a certain kind of series, and we will always give careful at¬ 
tention to the matter of proving that this series actually converges to the given 
function as its sum. In this way, the sums of many convergent series of constants 
will be known to us as a minor consequence of our work on convergent series 
of functions. 


PROBLEMS 


Classify each of the following series as absolutely convergent, 
conditionally convergent, or divergent. 


i £(-»" + i— 2 
Vn + 10 


X (-D ' ,+1 



19 

21 


X 

X 


(- 1 )" 


„9/2 


(-D " +1 



20 X( _| )” +,|n ^- 

22 X(-l)" +l ^. 

^ n\ 


3 X 

5 XM)"" 

i Xm)" 4 

9 X ^ 1 )" 4 


n 


n + r 

n\‘ 
n 3 

1 + n 5 ' 

Vn_ 

In n 


II £(_!)*+1 1521 . 

\Tn 


13 X (-D" + ' 
15 XM )^ 1 

17 X(-*)' ,+ ' 


Vn 

n + 3 ’ 

2 

3 n 2 ’ 
sin 2 n 


4 y (-i) n+i 4X. 

6 X(- 1 )" + 1 ^r- 
8 XM)" 


In (n + 2 )' 
10 y (-1)" +1 In--. 

^—' n 


12 y ( -iy 


, 2«+1 




14 

16 (— 1 ) ,, + 1 sin nir. 

18 X(-»" + 1 T 


23 

24 

25 



26 We know that if { a n } is a decreasing sequence of posi¬ 
tive numbers such that a n —> 0 , then the alternating se¬ 
ries (1) converges to some number s as its sum. Show 
that s lies between consecutive partial sums s n and 5^+1 
and that \s n — s\ < a n +\. 

27 State whether each of the following is true or false: 

(a) every convergent alternating series is conditionally 
convergent; 

(b) every absolutely convergent series is convergent; 

(c) every convergent series is absolutely convergent; 

(d) every alternating series converges; 

(e) if is conditionally convergent, then ^\a n \ di¬ 
verges; 
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(f) if diverges, then Y.a n is conditionally conver¬ 
gent. 

28 If the a n 's are all positive numbers, show that the series 

— a\ + — <33 + — * * * converges if and only if the 

series a] - a 2 + — <74 + * * * converges. [This shows 

that starting an alternating series with a positive term, as 
in (1), is merely a convenience, not a necessity.] 

29 If 2 a n and ^Zb n are absolutely convergent, show that 

(a) 2( a n + b n ) is absolutely convergent; 

(b) 2ca n is absolutely convergent for any constant c. 

30 If 2a n 2 and 2b n 2 converge, show that 2 a n b n is absolutely 
convergent. Hint: (a — b) 2 > 0, so lab < a 2 + b 2 . 

31 Use Problem 30 to show that if 2 a 2 converges, then 
'Zajn is absolutely convergent. 

32 In using the alternating series test, it is a common error 
to check only that a n —> 0; but this is not enough, and 


convergence cannot be deduced unless both of conditions 
(i) and (ii) are verified. Demonstrate this fact in the case 
of the alternating series 

1 — Xj_ — — — 

1 1 ' 2 2 ' 3 3 ' 

by showing that 

(a) a n -> 0; 

(b) the a n 's do not form a decreasing sequence; 

(c) the series diverges (hint: consider s 2n )• 

33 If 5 is any given number, show that the alternating har¬ 
monic series 1 — y + y — j + • • • can be rearranged 
(that is, its terms can be written in a different order) in 
such a way that the resulting series converges to 5. Hint: 
Take just enough positive terms to get above 5, then just 
enough negative terms to get below 5 , etc. 


CHAPTER 13 REVIEW: DEFINITIONS, CONCEPTS, TESTS 

Think through the following . 


1 

Limit of a sequence. 

10 

The limit comparison test. 

2 

Convergent sequence. 

11 

The p-series. 

3 

An increasing sequence converges if and only if it is 

12 

The integral test. 


bounded. 

13 

Euler’s constant. 

4 

c 

Convergence and sum for infinite series. 

14 

1C 

The ratio test. 

6 

Geometric series. 

Harmonic series. 

I D 

16 

1 ne root test. 

The alternating series test. 

7 

The ftth term test for divergence. 

17 

Absolute convergence. 

8 

Repeating and nonrepeating decimals. 

18 

Absolute convergence implies convergence. 

9 

The series for e . 

19 

Conditional convergence. 


ADDITIONAL PROBLEMS FOR CHAPTER 13 


SECTION 13.2 
1 Find lim x n if 


(a) = 


(b) x n = \\ - 


(c) = 


(d) x n = 


1 - 




Vn 2 + 1 Vn 2 + 2 


Vn 2 + n 


2 Iff(x) = lim^^oo [lim m _^oo (2/tt) tan -1 (m sin 2 [n\irx])], 
show that f(x) = 0 when x is rational, and 1 when jc is 
irrational. 


If f(x) = lim^.^oo [lim^oo cos m («!77^:)], show that 
f(x) = 1 when x is rational, and 0 when x is irrational. 
Find the value of each of the following limits: 


(a) lim 


1 


— (*>0); 


(b) lim 


Jt* +1 + n 


X n + X~ n ' co Jt” + 2ft 

For each sequence {x„} whose ftth term is given, ver- 
if y that the first three terms are 1, y and find the fourth 





term: 

_ 1 1 

1 , 

I ^ 


n 2 + 1 

n 2 + 2 

n 2 + n ’ 

(a) x, 

1 

+ 1 

, + 1 . 

(c) X, 


(b) = 


1 


2ft 3 - 12ft 2 + 23ft - 12 ’ 


1 


ft 2 ( "' 


-l)(n-2)(rt-3) * 


*6 If a is any given number, define a sequence (by 
constructing a suitable formula for x n in terms of n) 
which has the property that x\ = 1 , x 2 = y, JC 3 = y, and 
X 4 = a. 
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*7 The so-called Fibonacci sequence 1, 1, 2, 3, 5, 8, 
13,... is defined recursively by putting x x = 1, x 2 = 1, 
and x n = x n -2 + x n ~ \ for n > 2. f Find a formula for x n 
in terms of n . Hint: Make the ingenious guess that x n 
has the form aA n + fiB n for suitable values of a, /3, A, 
B; then determine A and B so that the recursion formula 
is true for all cfs and /3’s; and finally, find a and /3 so 
that x\ — 1 and x 2 = 1. 

8 If {jt„} is the Fibonacci sequence defined in Problem 7, 
show that lim x n +\/x n = (1 + V5)/2. 

9 The sequence V2, V 2 V 2 , V2V2\/^,. . . can be 
defined recursively by putting *1 = \fl and x n +\ = 
\Z2x n for n > 1. 

(a) Use mathematical induction to prove that x n < 
x n +i <2 for every n} This shows that the sequence 
is increasing and has 2 as an upper bound, and there¬ 
fore converges to a limit x < 2. 

(b) Show that x = 2 by using the recursion formula. 

(c) Show that x = 2 by finding an explicit formula for 
x n in terms of n. 

10 The sequence V2, V 2 + V2, V 2 + VT+ \2, 
.. . can be defined recursively by putting x\ = V2 and 
x n +\ = V2 + x n for n> 1. Show that it is increasing 
with 2 as an upper bound, and find its limit. 

*11 If a > 0, then the sequence Va, va + Va, 

Vfl + Vfl+^,. .. can be defined recursively as in 
Problem 10. Show that it converges and find its limit. 
12 Let f(x) be an increasing continuous function on the in¬ 
terval 0 < x < 1. Define two sequences { a n } and { b n } 
by 



(a) Show that 


Fibonacci, or Leonardo of Pisa (ca. 1170-1230), was an Italian busi¬ 
nessman who traveled extensively in the Middle East and was chiefly 

responsible for introducing the Hindu-Arabic numerals (i.e., I, 2, 

3, .. .) into Europe. He encountered his sequence in a problem about 
the progeny of rabbits. It has since been applied extensively (and ec¬ 
centrically) to religion, art, the shapes of seashells, etc., etc. Fi¬ 
bonacci’s problem was this: Start a rabbit colony with a pair of new¬ 
born rabbits, one male and one female. Suppose it takes a newborn 
pair 1 month to grow to sexual maturity and 1 more month to pro¬ 
duce a litter. Assuming that no rabbits die and that each litter consists 
of one male and one female, find the number of pairs in the colony 
after n months. Answer: the «th term of the Fibonacci sequence— 
think about it. 

^Recall that the principle of mathematical induction asserts the fol¬ 
lowing: A statement S(n ) which is meaningful (in the sense of being 
either true or false) for each positive integer n is true for all n if (i) 
5(1) is true; and (ii) S(n ) implies S(n +1). This principle is discussed 
in detail in Appendix B.2. 


dx<b n 


and 


V"- l fix )' 

Jo n 


(b) Show that both sequences converge to the limit 

Jo/to 

(c) State a corresponding fact for the interval a<x< 
b. 

*13 Use Problem 12 to obtain the following limits: 

, , 1 , 2 , , n 1 

(a) ~ + ~ + ' ' ' + 2 ’ 


n ‘ 


n* 


22 

n 3 n 3 


(b) — + — +••• H—t 

' n' vi~ > 


2 I- 

3 ’ 


(c) 

(d) 

(e) 

(f) 


n + 1 

+ — 
n 

+ 2 + 

1 

i 

1 ■+... 

n + 1 

i 

n 

+ 2 + 

n 


n i 

n 2 + l 2 ' 

n 2 + 2 2 

n 


n 

(n + I 

d 2 + 

(n + 2) 2 


-A In 2; 


n + n 


+ --> In k\ 

kn 


n 


n z + n z 
n 


4 ’ 


(n + n) 2 


(g) “I 

(h) 


, • 27T , . n„ . 

+ sin-h ■ ■ ■ + sin- -»—; 

n n 7T 


+ sin 2 


1 / . 7T 

I sin — 
n \ n 

1 / . 2 7T 

— sim — 
n \ n 

(i) 2. (Ve + Ve* + ■ ■ ■ + Ve") e - 1; 

1 1 I 


, - 2 2ir _L 

+ sin 2 -h 

n 


1 

^ 2 ; 


U) 


V n 2 + l 2 Vn 2 + 2 2 


Vn 2 + n 2 
-4 In (1 + V2); 


(k) - (in - + In - + 
n \ n n 


+ In 


-> - 1 . 


i 

14 Use part (k) of Problem 13 to show that-— 

n e 

*15 Use Problem 14 to show that 


(a) - VCn + l)(n + 2) •••(« + n) 

n e 


(b) - V (2 n + 1)(2 n + 2 ) • • • (2 n + n) -> 
n Ae 

*16 If x n ~^>x, then the sequence of the arithmetic means of 
the Vs also converges to ;c; that is, 


27 
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= 


X\ + X2 + ' ' ' + X n 
n 


x. 


lim 

n—>°° 


fin) ~ n\ 
n\ 


lim 

ft—> oo 


fin) 

n ! 


= 0 . 


Prove this in two steps, as follows. 

(a) Begin by assuming that x = 0, find a positive inte¬ 
ger riQ such that \x n \ < e /2 for all n > n 0 , and use 
the fact that for these ri s we have 

jfl + X 2 + • • • + Xn 0 -]\ + kj + • • • + kl 
n n 




where a — |*i + * 2 + * * * + jc„ 0 — i I is a constant, 

(b) In the general case, where x = 0 is not assumed, 
use the fact that since x n — x 0 , we can infer from 
part (a) that 

ix\ - x) + ix 2 ~ x) + • • • + ix n - x) 
yn~ x = - 

-4 0 . 


17 Use Problem 16 to show that 

1 + j + ■ ■ ■ + l/n 

(a) -> 0 ; 

n 

1 + V2 + NiT + • • • + N/n 

(b) n '■ 

*18 If {jc„} is a sequence of positive numbers such that 
x n +\/x n r, then we also have ^/x~ n —> r. Prove this as 
follows: Put y n = In x n +\/x n \ show that 


y± +T2 + ' ' ' + Tft-i 


= In Vk _ In 


and apply Problem 16. 

19 Use Problem 18 to show that ^/fi\ln -a Me. 

*20 Wallis’s product, which can be expressed in the form 

2 2 4 4 6 6 2 n 2 n it 

13 3 5 5 7 2k— 12n + l > 2’ 

is proved in Appendix 2 at the end of Chapter 10. Since 
2n/i2n + 1 ) 1, this can also be written as 

2 2 4 2 6 2 (2 n — 2) 2 ^ it 

F ' 5^' 72 " ' (2« - i ) 2 ' Ln ^ T 

By taking square roots and multiplying numerator and 
denominator by 2 • 4 • 6 • • • (2w — 2), establish the 
formula 

lim Sal^L = V^, 
n-> ~ (2 k)!Vk 

which is needed in Problem 21. 

*21 It is a remarkable fact that the function fin) = 
V 27777 is a good approximation to ai! for large n, 
in the sense that the relative error approaches zero: 


This is equivalent to the statement that 


V 2 mi n n e~ n 

which is known as Stirling's formula? In addition to its 
intrinsic interest, this formula is a useful tool (in statis¬ 
tics and the theory of probability) for the approximate 
numerical calculation of n\ when n is large. Prove Stir¬ 
ling’s formula by verifying the following statements: 
(a) 2/(2 n + 1 ) < ln (1 + Mn) (hint: compare the area 
under the curve y = Mx from x = n\ox = n+ 1 
with the area of the trapezoid whose top is tangent 
to the curve at x = n + {)\ 


(b) e < 



.zi+l/2 


hint: (n + i) in (l + ^ ln (l + i)" +,/2 ]; 

(c) the area A under y = ln jc from jc = 1 tojc = zi is 

fn I n\ n 

J i lnxdx = nlnn-n+l=ln I—j + 1; 

(d) the number x n defined by 

jn/e) n Vn 


is < 1 [hint: compare the area A i n part (c) with the 
area B = 1 + ln nl - ln Vn of the following fig¬ 
ure: Divide the interval from x = 1 to x = n into 
subintervals by the points f, w — on 

the first and last subintervals construct rectangles 
with heights 2 and ln n\ and on the remaining 
subintervals construct trapezoids whose tops are 
tangent to the curve y = ln x at x = 2, 3, . . . , 
n — 1 ]; 

(e) {jt,*} is an increasing sequence which is bounded by 
part (d), so lim x n exists; 

v 2 1 

(f) lim jc„ = lim - 2 - = —— 

X 2n V2t T 

[hint: use the formula established in Problem 20]; 

(g) part (f) implies Stirling’s formula. 


SECTIONS 13.3 AND 13.4 

22 If 2~ = i a n = s, what is the sum of the series 
27=i(tfft + a n+ 1 )? 


f James Stirling (1692-1770) began his career by being expelled from 
Oxford for supporting the defunct Stuart dynasty, and ended it as the 
successful manager of a mining company. In his salad days he was a 
friend of Newton, and wrote an essay on infinite series in which he 
almost discovered the formula that bears his name. 
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23 For what values of jc is 


^ + a + Jl + 

jc jc 3 .r 3 


x 2 - 2 


valid? 

24 Find the values of jc for which 


* l x V + 1 x 


1 + X \1 + */ \1 +■* 

converges. What is its sum? 

*25 By finding a closed formula for the nth partial sum s fl , 
show that the series Y^ = \nx n converges to jc/(1 - x) 2 
when |jc| < l and diverges otherwise. 

26 Find the sum of the series y + f + y? + ■ ■ ■ . 

*27 Use the fact that V = — to show that 

n=\ n 6 

,,111 7 T 2 

(3) 72+^2 +32+-"=T ; 

rM J_L . J_L . 

[ > l 2 2 2 3 2 4 2 12’ 

,,1111 1 1 7 r 2 

(c) 7t + 7t + 7I + 7tt + 772+772+"- = -9-- 
28 Show that 

f~ xdx = y J_ 

Jo e x ~ 1 “’j « 2 

by expressing the integrand as a geometric series and 
integrating term by term. 

*29 Show that 

1 1 


(a) 


1*2 3*4 5-6 


+ ■ ■ ■ = In 2; 


(b) TT + T 1 ! + 6 ^ + " ' = 1 ” ln 2; 

1 


1*2 2*3 3 • 4 4*5 + 


(c) 

30 Show that 

f ' 1 

“1 n{n + 1 ){n + 2) 4 ’ 

( c) v-i-L.. 

^ n(n + \) ■ ■ ■ (n+ k) k ■ k\ 

31 Find the sum of 


= 2 In 2 - 1. 


mi - ^ — -= 

n = 1 ( n + 1 )vn + nVn 4* 1 
If f{n) —> L, show that 


S [/(«) “/(« + 2)] =/(l) +/(2) - 2L 


and use this to establish the following statements: 

(a) 


- + -L + -U + -L + ....1 

1*3 2*4 3*5 4*6 4’ 


_ 1 1,1 1 1 

(b) 1-3 2 • 4 + 3 • 5 4 • 6 + _ 4’ 

*33 If a\, ci 2 , « 3 , ■ ■ ■ are the positive integers whose decimal 
representations do not contain the digit 5, show that 
21 la n converges and has sum < 90. 

*34 Figure 13.7 shows the region bounded by two circles 



of radius 1 that are tangent to each other and by a 
straight line tangent to both. A sequence of smaller cir¬ 
cles, each having the largest possible radius, is inscribed 
in the region in the manner shown in the figure. It is 
clear from the geometry of the situation that the lengths 
of the diameters of these smaller circles are the terms 
of a convergent series whose sum is 1. Show that this 
series is 


1 


1 


1 + ^- + 


n(n + 1) 1 - 2 ' 2-3 ' 3-4 

SECTION 13.5 

35 Determine whether each of the following series con¬ 
verges or diverges: 


(a) X 
(c) X 

(e) X 

(g) X 
« X 


2 

(b) 

X 

n 2 + 31 . 

n 2 + n ’ 

10,000n 3 ’ 

1 

(d) 

X 

n 

(n + 3) 2 ’ 

Vn 2 + 2 ’ 

1 

(f) 

X 

\4 . 

[1 + (n - l)/n] n ' 

n + 5 ’ 

tan -1 n 

(h) 

X 

3 


2 + \4’ 

(3 n + l) 3 . 

(j) 

X 

1 

(n 3 + 2) 2 ’ 

^3 n 2 + 1 


*32 
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(k) 


I 


_ 1 

Vn(n + l)(n + 2)’ 


( 1 ) 

(n) 


> 2n + 3 
^ n • 3 n ’ 



(m) X 
(o) X 


5n — 7 
(n + 5)n! ’ 

( n + 1 V» 

l 2n )' 


(P) X 

« X 
(t) X 

(V) X 


/n 2 - i y« 
\ n 3 + 3 / ; 

2” + 3" 

3" + 4" ’ 


(n + 1)" 
n" +l ’ 

[In (n + 1 )]" 


(q) 

(s) 

(u) 


X 


Vn + 1 

y,n+ 1/2 


X (1 - e _1/ T; 

X sin 2 7r(/i + - 


36 If a > 1, show that SI /a ]nn diverges if a < e and con¬ 
verges if a > e. 

37 Prove that any convergent series of positive terms can 
be rearranged so that its terms form a decreasing se¬ 
quence. 

38 If p is any positive constant, show that SI /(In n) p di¬ 
verges. 

39 Show that the series 


X 


1 


44 


45 


^3 n In n (In In n) p 

converges if p > 1 and diverges if p < 1 . 
If k is any integer > 1, show that 


1 


n + 


n + 2 


+ f- -> In k. 
kn) 


The sum of the convergent series S“=i1/ai 3 is not 
known. However, if this sum is denoted by s, show that 

7 


(b) 


8 


■L + J---L + 

2 3 3 3 4 3 


*46 


l 3 

J_ 

1 3 2 3 3 3 4 3 4 

For p > 1 , the sum of the p-series S“=il ln p is a func¬ 
tion of p called the zeta function (the symbol £ is the 
Greek letter zeta) and denoted by £(p); that is, 




-L J_ H 

2 p + 3 p + 


Euler discovered that £(2) = tt 2 /6, £(4) = tt 4 /90, and 
£( 6 ) = 7 r 6 /945 (see the Appendix at the end of Chapter 
14), but the value of £(p) is not known when p is odd. 

(a) Use the inequalities (5) in Section 13.6 to show that 
the zeta function satisfies the inequalities 


1 


P-1 



X 


_J_ 

(In n)' nn 


and 


X 


1 

(In In rc) ln n 


are both convergent. Hint: Express (In rc) ln n as a power 
of n. 

40 Show that 


ZX (ln n yn In n 

diverges. Hint: (In In n) 2 < In n for large n (why?). 

41 If a n > 0 and 1a n converges, and if { b n } is a bounded 
sequence of nonnegative numbers, prove that 1a n b n also 
converges. Use the series 1 — \ + y — j + ■ ■ ■ to show 
that this is false if the assumptions a n > 0 and b n > 0 
are dropped. 

42 If la n and 1b n are series of nonnegative terms such that 
1 a n 2 and "lb 2 both converge, show that la n b n also con¬ 
verges. 


and 


■=«<”* a- 


(b) Show that lim^] £(p) = °o and lim^ ..>«> £(p) = 1. 

^ 1 

(c) Show that lim a > , . = 1. 

a—>0+ n ]+a 

n— I 

*47 Let k be an integer > 1 and show that 


X 


n 


= In k, 


where a n {k) is defined by 

f 1 if n is not a multiple of k, 

an —(k — \) if n is a multiple of k. 

*48 The Cauchy condensation test states that if a\, a 2 ,. . ., 
a n ,. . . is a decreasing sequence of positive numbers, 
then the two series 


SECTION 13.6 
43 Show that 


X 


1 

n In n In In n 


diverges, and also that if p is a positive constant, then 


X a n - a 1 + £*2 + ^3 + ^4 + ' ' ' 

/i= 1 


X 2 n a2 n — a\ + 2^2 + 4^4 + 8 a% + ■ ■ ■ 

n = 0 


and 
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converge or diverge together. (This statement is called 
the condensation test because it says that a rather small 
proportion of the terms of the first series determines its 
convergence behavior.) 

(a) Prove the condensation test. Hint: If s n and t n are 
the partial sums, group the terms of the first series 
into blocks to show that s n < t m if n < 2 m , and t m 
< 2 s n if 2 m < n. 

(b) Use the condensation test to show that the series 


X 1 and X -r- 

n ^ nvan 

diverge, and that the series 

X and X 

^ n p ^ rt(ln n) p 

converge if p > 1 . 

49 Prove that 





+ In 2 + — y + o(l). 


Show that 




(a) 

I 

n= 1 

1 

n(2n + 

1) ” 

2 - 2 In 

2; 

(b) 

z 

n~\ 

1 

n(4n 2 - 

- 1) " 

= 2 In 2 - 

- 1; 

(c) 

X 

n= 1 

1 

n(9n 2 - 

- 1) " 

■fan 3 

- 1); 

(d) 

X 

n= 1 

1 

n(\6n 2 

- 1) 

= 3 In 2 

- 2; 

(e) 

X 

n= 1 

1 

n(36n 2 

- 1) 

= fln3 

+ 2 In 2 - 

(f) 

X 

n — 1 

n 

(4 n 2 - 

l) 2 ' 

l 

: 8’ 


(g) 

X 

n = l 

1 

n(4n 2 - 

- I) 2 

-f- 2 

In 2. 


SECTION 13.7 

Use the ratio test to determine the behavior of the following 
series. 

51 ££ 52 x " 1000 (!)“- 

v 1 - 3 - 5 ■ ■ ■ (2/i - 1) 

^ 2 • 4 • 6 • • • (2 n) ' 

v 1 -3-5---(2 k- 1) 

^ 1 • 4 • 7 • • • (3n - 2) ' 


55 

56 

57 

58 

59 


60 


61 


I 

I 


1 • 6 - 11 • • • (5/2 - 4) 

2 • 6 ■ 10 • • ■ (4n - 2) • 
1000 ” 

All ' 


I 

I 


(n + 3)! 
n\ 3” ' 

2^ n 

2-4-6 - • • (2rc) 


2 2-4 2 


1-4-7 


4 • 6 


1 - 4 • 7 - ■ • (3w - 2) 
+ 2 • 4 • 6 • • • (2w) 

(a) Show that the ratio test fails for the series 


^ 2«+(-i) w * 


(b) Show that the root test succeeds for the series in 
part (a) and tells us that this series converges. (Thus, 
the root test works in some cases where the ratio 
test fails. Even more can be said, for Additional 
Problem 18 asserts that if { a n } is any sequence of 
positive numbers, then 


” +1 —> L implies V ~a n -> L. 

a n 

In principle, therefore, the root test is more power¬ 
ful than the ratio test.) 

Consider the series 


X a n ~ cl + b + a 2 + b 2 + a 3 + b 3 + • • • , 

M=1 

where 0 < a < b < 1. Show that the ratio test fails, and 
establish convergence by using the root test. 


SECTION 13.8 
62 Show that the series 


V2-1 V2 + 1 + V3 - 1 V3 + l + 

diverges. Does this contradict the alternating series test? 
63 Use the alternating series test to prove the existence of 
Euler’s constant y as follows. 

(a) Show that the series 

1 _ f 2 dx_ J f 3 dx l _ f 4 dx 

Ji x 2 h x + T " Ja jc 


+ 


p dx_ 
J4 x 


+ 


converges. 

(b) If the sum of the series in (a) is denoted by 7 , show 
that 
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* 2 -' " 1 + 2 + 3 + 


H-In n, 

n 


so that 


lim (1 + -J- + 


H-In n = y. 

n 1 


64 Use the alternating series test to show that the improper 
integral 


f°° sir 

Jo J 


dx 


converges. Hint: Sketch the graph of y = (sin x)/x for 
x > 0 and observe that it consists of an infinite number 


of parts, each covering an interval of length 7r and ly¬ 
ing alternately above and below the jc-axis; and then ex¬ 
press the integral as an alternating series, 


■ dx = 


77 sin x 


dx + 


f 2w _sir 

Jtt J 


■ dx + 


= a \ — + <23 — a <\ + * ' ' • 

*65 Show that 1 — { + 7 ~i^+*' ,= yln 2 + 7r/3 V3. 
*66 Show that 

X (-1)" — = y In 2 - UIn 2) 2 . 

n = 2 11 Z 

Hint: See equation (9) in Section 13.6. 


APPENDIX 1: EULER’S In Section 13.4 we encountered Euler’s formula for the sum of the reciprocals of the 
DISCOVERY OF THE squares, 


oo ^ ^ 7 

FORMULA Y —=■ = 

V n 1 6 


111 7I 2 

+ 4 + _ 9 + l6 + ~6‘ 


(1) 


Our purpose in this appendix is to understand the heuristic reasoning that led Euler to this 
wonderful discovery. 

We begin with some simple algebra. If a and b are =£ 0, then it is clear that these num¬ 
bers are the roots of the equation 




This equation can also be written in the form 


1 - ( — + — 
v a b 


x H-- x 

ab 


2 _ 


o, 


( 2 ) 


(3) 


in which it is evident that the negative of the coefficient of x is the sum of the reciprocals 
of the roots. If we replace jc by jc 2 , and a and b by a 2 and b 2 , then (2) and (3) become 


£_ 

b 2 


= 0 


and 


_ ( _L + J_U + _L_ 

V b 2 a 2 b 2 


0 . 


(4) 


(5) 


The roots of (4) are plainly ±a and ±b, and (5) is the same equation in polynomial form, 
from which we see that the negative of the coefficient of jc 2 is the sum of the reciprocals 
of the squares of the positive roots. This pattern persists as we move to equations of higher 
degree, for 


x 


= 0 


(whose roots are obviously ±a, ±b , and ±c) can be written as 

,_a + -u-u 


b 2 


\ a 2 b 2 


b 2 c 2 


a 2 b 2 c 2 


r 6 = 0, 


and so on. 









o° J 

APPENDIX I: EULER'S DISCOVERY OF THE FORMULA v -y 

^ n 1 


TT~ 

~6~ 


All 


Let us now consider the transcendental equation 

sin x = 0 


or 


x — 




= 0 . 


This can be thought of as “a polynomial equation of infinite degree” with an infinite num¬ 
ber of roots 0, ± 7 r, ±27r, ±37 t, .... The root 0 can be removed by dividing by x, which 
gives the equation 


x 


or 


1 - 




• = 0, 


with roots ±7r, ±27 t, ±3 7 r,.... In the light of our knowledge of the roots of this equa¬ 
tion, the situation in the previous paragraph suggests that the infinite series 

sin x _ _ x 2 x 4 x 6 

+ IT"" 7! 


can 


be written as an “infinite product” 


sin x 


7 r 


Further, our analogy also suggests that 




i 

47T 2 


47T 2 / 


+ 9tt 2 + 


.jeLL.. 

9t t 2 ) 


( 6 ) 


3!’ 


from which Euler’s formula (1) follows at once. As an additional observation, it is inter¬ 
esting to note that if we put x = irfl in (6), we find that 


2 


7 7 




I AVI 1 

2*2/14*4 


A 1 
6 ‘ 6 


which is equivalent to 


7 r 
2 




m 



This is Wallis’s product , which was rigorously proved in Appendix 2 at the end of Chap¬ 
ter 10. 

These daring speculations are characteristic of Euler’s unique genius, but we hope that 
no students will suppose that they carry the force of rigorous proof. It will be seen that 
the crux of the matter is the question of the meaning and validity of (6), which is known 
as Euler’s infinite product for the sine. The shortcomings of this discussion invite the con¬ 
struction of a general theory of infinite products, within which formulas like (6) can take 
their place as firmly established facts. This aim is achieved in more advanced fields of 
mathematics. 
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APPENDIX 2: MORE 
ABOUT IRRATIONAL 
NUMBERS. 7 t IS 
IRRATIONAL 


Readers who have never thought about the matter before may wonder why we care about 
irrational numbers. In order to understand this, let us assume for a moment that the only 
numbers we have are the rationals—which, after all, are the only numbers ever used in 
making scientific measurements. Under these circumstances the symbol V2 has no mean¬ 
ing, since there is no rational number whose square is 2. One consequence of this is that 
the circle x 2 + y 2 = 4 and the straight line y = x through its center do not intersect; that 
is, in spite of appearances, there is no point that lies on both, because both curves are dis¬ 
continuous in the sense of having many missing points, and each threads its way through 
a gap in the other. This suggests that the system of rational numbers is an inadequate tool 
for representing the continuous objects of geometry and the continuous motions of physics. 
In addition, without irrational numbers most sequences and series would not converge and 
most integrals would not exist; and since it is also true that e and tt would be meaning¬ 
less (we prove below that i t is irrational), the enormous and intricate structure of math¬ 
ematical analysis would collapse into a heap of rubbish so insignificant as to be hardly 
worth sweeping up. As a practical matter, it is clear that if the irrationals did not exist, it 
would be necessary to invent them. It was the ancient Greeks who discovered that irra¬ 
tional numbers are indispensable in geometry, and this was one of their more important 
contributions to civilization. 

In Section 13.4 we proved that e is irrational by assuming the contrary and construct¬ 
ing a number a which was then shown to be a positive integer < 1 —an obvious impos¬ 
sibility. This strategy is also the key to the proofs of the following two theorems, but the 
details are somewhat more complicated. 

We shall need a few properties of the function f(x) defined by 


/(*) = 


x n (\ ~x) n 


1 V k 
= <***■ 

U ‘ k — n 


(1) 


where the c*’s are certain integers and n is a positive integer to be specified later. First, it 
is clear that if 0 < jc < 1, then we have 


0 <f(x) < 4- (2) 

n\ 

Next,/(0) = 0 and f (m) ( 0) = 0 if m < n or m > 2n\ also if n < m < 2n, then 

/ (m) (0) = ^l Cm , 

and this number is an integer. Thus, f(x ) and all its derivatives have integral values at 
x = 0. Since /(I - x) = f(x\ the same is true at x = 1. 


Theorem 1 e r is irrational for every rational number r ± 0. 


Proof If r = p/q and e r is rational, then so is ( e r ) q = e p . Also, if e~ p is rational, so is 
e p . It therefore suffices to prove that e p is irrational for every positive integer p. 

Assume that e p = a/b for certain positive integers a and b. We define/(x) by (1) and 
F(x) by 

Fix) = p 2n f{x) - p 2n - x f\x) + p 2n ~ 2 f'\x) - pf^-'Xx) +fi 2n Xx\ (3) 

and we observe that F(0) and FXl) are integers. Next, 

[e px F(x)\ = e px [F\x) + pF(x)] = p 2n+l e px f(x), (4) 

where the last equality is obtained from a detailed examination of F\x) + pF(x) based on 
(3). Equation (4) shows that 
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b J Q l p 2n+ 'e px f(x) dx = b[eP x F(x)]l = aF( 1) - bF( 0), 


which is an integer. However, (2) implies that 



and since the expression on the right —> 0 as n —> °° (by Example 3 in Section 13.2), it 
follows that the integer aF( 1) — bF( 0) has the property that 


0 < aF( 1) - bF( 0) < 1 


if n is large enough. Since there is no positive integer < 1, this contradiction completes 
the proof. 

If we say that a point (*, y) in the plane is a rational point whenever both x and y are 
rational numbers, then this theorem asserts that the curve y = e x traverses the plane in 
such a way that it misses all rational points except (0, 1). An equivalent statement is that 
y = In x misses all rational points except (1,0), so In 2, In 3, . . . are all irrational. It can 
also be proved that y = sin x misses all rational points except (0, 0), and that y = cos x 
misses all rational points except (0, 1).* Each of these theorems implies that tt is irra¬ 
tional, since sin tt = 0 and cos tt = -1. However, we prefer to prove the irrationality of 
77 by the following more direct argument. 


Theorem 2 77 is irrational 


Proof It is clearly sufficient to prove that 77 2 is irrational, so we assume the contrary, 
that 77 2 = a/b for certain positive integers a and b. We again define/(x) by (1), but this 
time we put 

F(x) = b n [v 2n f(x) - 7T 2n ~ 2 f'\x) + 7T 2n ~Y 4) M -+ (-l ) n f i2n) (x)l (5) 

and again we observe that F(0) and F(l) are integers. A calculation based on (5) shows 
that 

- 7 - [F\x) sin 77X - 7 rF(x) cos ttx\ = [F"(x) + tt 2 F(x)] sin 7 tx 
dx 


= b n 7r 2n+2 f(x) sin Tjx = TT 2 a n f(x) sin 7 tx, 


so 



which is an integer. But (2) implies that 



if n is large enough; and this contradiction—that F( 1) + F(0) is a positive integer < 1 — 
concludes the proof. 

The underlying method of proof in Theorems 1 and 2 was devised by the French math¬ 
ematician Hermite in 1873, but the details of the latter argument were first published by 
Niven in 1947. 


*The details can be found in Chapter II of I. Niven’s excellent book, Irrational Numbers (Wiley, 
1956). 
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INFINITE SERIES OF CONSTANTS 


APPENDIX 3: THE 
SERIES 21 !p n OF THE 
RECIPROCALS OF 
THE PRIMES 


About 300 B.C. Euclid gave the classical proof of the fact that there exist infinitely many 
prime numbers. About 2000 years after the time of Euclid, in 1737, Euler discovered two 
fundamentally different new proofs, and the methods he used laid the foundations of a 
new branch of mathematics that is now called analytic number theory . 

In order to understand Euler’s ideas, we begin by recalling that the harmonic series 


1 



• + 


i + 


n 


diverges. On the other hand, we know that for any exponent 5 > 1, the series 



converges, and the so-called zeta function is defined to be the sum of this series, 

considered as a function of the variable 5.* 

Euler’s basic discovery was a remarkable identity connecting the zeta function with the 
prime numbers, 

» = n-r=V (l) 

where the expression on the right denotes the product of the numbers 1/(1 - p~ s ) for all 
primes p = 2, 3, 5, 7, 11,..., that is, where 


n 


l 

l - \/p s 


_i_l_l_i_ 

1 - 1/2* ‘ 1 - 1/3* ’ 1 - 1/5* ‘ 1 - 1/7* 


To see how the identity (1) arises, we recall that the geometric series 1/(1 — x) = 1 + 

x + x 2 + • • • is valid for \x\ < 1, so for each prime p we have 

1 — \lp s ^ p s + p 2s + p 3s ~l” 

Without stopping to justify the process, we now multiply these series together for all 

primes p, remembering that each integer n > 1 is uniquely expressible as a product of 

powers of different primes. This yields 



which is the identity (1). 

One of Euler’s arguments is based on (1) and goes this way. We begin by observing 
that if there were only a finite number of primes, then the product on the right side of (1) 
would be an ordinary finite product and would clearly have a finite value for every 
s > 0, even for 5 = 1. However, the value of the left side of (1) for s = 1 is the harmonic 
series, 


*We denote the independent variable by s (instead of p, as in Section 13.5) in order to retain the no¬ 
tation that is customary in the theory of numbers. 
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«D = 1+y + T+■ ■ ■, 

which diverges to infinity. This argument by contradiction, which can be made into a rig¬ 
orous proof, shows that there must be an infinite number of primes. Euler’s second argu¬ 
ment rests on his discovery that the series of the reciprocals of the primes diverges , 


v± = l + I + I + l + -L + ... = »- 

^ p n 2 3 5 7 11 

for if there were only a finite number of primes, it is obvious that this series couldn’t 
sibly diverge. 

The proof of (2) that we give here starts with the geometric series 


1 - 


1 + + -k- + 
2 2 2 


+ — + "ry + 

3 3 2 


+ 5 + 5^ + 


( 2 ) 

pos- 


_J_ 

1 - 1 IPn 



If we multiply these series together by forming a new series whose terms are all possible 
products of one term selected from each of the series on the right, then this new series 
converges in any order to the product of the numbers on the left.* Since every integer 
greater than 1 is uniquely expressible as a product of powers of different primes, the prod¬ 
uct of these series is the series of the reciprocals of all positive integers whose prime fac¬ 
tors are < p n . In particular, all positive integers < p n have this property, so 

_i_^i 

1 — 2 1 - J 1 “ l, Pn *= , k 


fp „+1 dx 

> J i — = In (p n + 1) > In Pn . 

(It is in the transition here from the sum to the integral that we use the ideas of Section 
13.6.) It follows that 


- iVi - 2 


1 


Pn) In Pn’ 


and taking logarithms of both sides yields 


^ In (1 — < - In In p n . 


We next show that 


—— < In 
Pk 


Pk 


for when this is applied to (3), we get 


(3) 


( 4 ) 


'This statement follows from one of the theorems proved in Appendix A.13. 
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_2 I ~ ln ln Pn 

k= 1 P k 
or 

n i j 

S - > T ln 
*= 1 ^ Z 

and our conclusion that 2l/p n diverges will follow from the fact that In In p n —» °o To es¬ 
tablish (4) and complete the argument, it suffices to observe that the line y = 2x lies be¬ 
low the curve y = In (1 4- x) on the interval - { < x < 0 and that every prime is > 2* 

*For other proofs of (2), see I. Niven, Amer. Math. Monthly, 78 (1971), pp. 272-273; and C. V. Eyn- 
den, Amer. Math. Monthly, 87 (1980), pp. 394-397. 



POWER 

SERIES 


We have remarked before that polynomials are the simplest functions of all, and 
power series can be thought of as polynomials of infinite degree. An expansion 
of a function/(x) in a power series, 

f(x) = a 0 + a\x + a 2 X 2 + • • • + CLnX" + • • ■ , 


14.1 

INTRODUCTION 


is therefore a way of expressing/(x) by means of functions of a particularly sim¬ 
ple kind. 

Our introductory investigations in Section 13.1 led us to the following impor¬ 
tant power series expansions: 


= 1 X + X 2 X 3 + • • • , 

1 + X 

valid for - 1 < x < 1; 

(i) 

X 2 X 3 

In (1 + *) = *- y + --, 

validity unknown for the moment; 

(2) 

, X 3 X s 

tan 1 x = x —— + —-- ■ ■ , 

validity unknown for the moment; 

(3) 

X 2 X 3 

e * =1+ * + ¥ + ¥ + '" 

, validity unknown for the moment. 

(4) 


We have rigorously proved formula (1), but at this stage of our work the others 
are supported only by the plausibility arguments given in Section 13.1, which are 
suggestive but not conclusive. What is needed for converting plausibility into cer¬ 
tainty is a method for approximating a function by polynomials, together with a 
useful expression for the error committed by making such an approximation. This 
method is provided below by Taylor’s formula with remainder. The remainder— 
or expression for the error—will enable us to obtain definite information about 
the x’s for which expansions like (2), (3), and (4) are valid. We will also be able 
to use polynomial approximations for computing numerical values of functions 
to any previously specified degree of accuracy. 

Students of calculus do not always understand that infinite series are primar¬ 
ily tools for the study of functions. For instance, in Problem 11 of Section 13.4 
we proved the validity of the power series expansions of the sine and cosine, 

X ^ x 2 X 4 

sinx = x~ — + —-■■■ and cos * = 1 - — + — , (5) 


483 




484 


POWER SERIES 


for all values of x; but since these functions were presumably well known be¬ 
forehand, it may not be clear what purpose is served by expressing them in this 
form. Expansions of known functions have their own importance, especially in 
the computation of numerical values for these functions. However, in advanced 
work it often happens that an unknown power series arises from some other 
source, perhaps as a solution of a differential equation. In such a case the series 
is used to define the otherwise unknown function which is its sum, so the series 
itself is the only tool we have for investigating the properties of this function. 
This situation can best be understood by supposing that the basic facts about 
sin x and cos jc —their continuity, the identities they satisfy, their properties with 
respect to differentiation and integration, etc.—could only be discovered by 
examining the series given above. For these familiar functions such a tortuous 
process is of course unnecessary, but for many important functions of higher 
mathematics there is no practical alternative. 

Thus, we discussed series of constants in Chapter 13 as a prelude to studying 
power series in this chapter. And our primary motive in studying power series is 
to learn what we can about the sums of such series. Also, once the expansions 
(2), (3), and (4) are firmly established, it will be pleasant to know that so many 
familiar but very different functions share the property of being expressible in 
the single pattern provided by power series. These functions are therefore—in a 
manner of speaking—“brothers under the skin.” This kinship is the central theme 
of the branch of advanced mathematics called complex analysis. 

Meanwhile, we emphasize that the power series expansions (1) to (5) are among 
the most important formulas in all of mathematics. In addition to fully under¬ 
standing them—and this is one of the main purposes of the present chapter— 
students should also memorize them. 


14.2 

THE INTERVAL OF 
CONVERGENCE 


To start at the beginning, a power series is a series of the form 

^ a n x n = cio + a { x + ci 2 X 2 + • • • + a n x n + • ■ • , (1) 

n= o 

where the coefficients a n are constants and x is a variable. In view of what we 
have said in the preceding section, it should be clearly understood here that this 
series stands on its own and is not assumed to be associated with any given func¬ 
tion. Since power series are almost always indexed from n — 0 to n — we 
often simplify the notation by writing (1) as 'Za n x n . 

The geometric series 


X x n = 1 + jc + x 2 + • • • (2) 

is evidently the simplest power series. We know that this series converges for 
|jc| < 1 and diverges for |jc| > 1. In general, we expect a power series to converge 
for some values of x and to diverge for others. 

We will clearly be very interested in knowing the Vs for which a given power 
series ^a n x n converges. For any such jc, the sum of the series is a number whose 
value depends on jc and is therefore a function of x. If we denote this function 
by/(;c), then /(jc) can be thought of as defined by the equation 


fix) = X a »x n - 


(3) 
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Sometimes a power series has a known function as its sum. For example, if 
\x\ < 1, we know that the series (2) has 1/(1 — x) as its sum. However, in gen¬ 
eral there is no reason to expect that the sum of a convergent power series will 
turn out to be a function that we can recognize from our previous experience. 

We will be concerned with two major groups of questions. First, what prop¬ 
erties does the function /(jc) defined by (3) have? Is it continuous? Is it differen¬ 
tiable? If it is differentiable, can its derivative be calculated by differentiating (3) 
term by term? And second—turning the whole situation around—if a function 
f(x) is given beforehand, under what circumstances does it have a power series 
expansion of the form (3)? How can this expansion be calculated, and what can 
be said about the jc’s for which the expansion is valid? These are some of the 
issues we study in the next few sections. 

We begin by determining the nature of the set of points at which an arbitrary 
power series converges. 

First, a few examples. It is clear that every power series converges for x = 0. 
Some series converge only for this value of jc, for instance, 


^ n n x n = x + 2 2 x 2 + 3 3 * 3 4- 4 4 x 4 +■■■.* (4) 

n= 1 

To see this, it suffices to observe that for any jc ^ 0 we have \nx\ > 1 if n is large 
enough, and therefore the nth term (nx) n does not approach zero and the series 
cannot converge. At the opposite extreme are series like 




(5) 


which converges for all values of jc. We establish this by showing that (5) is ab¬ 
solutely convergent for every jc, and this is easy to prove by the ratio test: 

[jc n+] /(n + 1)!| ^ |jc |” +1 nl \x\ 

\x n ln\\ (n + 1)! " \x\ n n + 1 ' 

(When using the ratio test, it is necessary to test for absolute convergence, be¬ 
cause this test applies only to series of positive terms.) As a simple example that 
lies between these extremes, we have the geometric series (2), which converges 
on the interval |jc| < 1 and diverges everywhere else. 

Our task is to discover the structure of the set of all jc’s for which a given power 
series converges. The examples discussed above show that there are at least three 
possibilities: This set may consist of the single point jc = 0, or the entire real line, 
or a finite interval centered at the origin. We will prove that these are the only 
possibilities. 

In order to establish this, we need the following lemma: 


If a power series 2 a n x n converges at x \, x\ =£ 0, then it converges absolutely at all x 
with |;t| < |jci |; and if it diverges at x i, then it diverges at all x with \x\ > \x\\. 


The proof is quite easy. If Y,a n x\ n converges, then a n x\ n —» 0. In particular, if n 
is sufficiently large, we have |a n jci y, | < 1, and therefore 


< r" 


(6) 


*This series is indexed from n— 1 to n = °° because n n has no meaning when n = 0. 
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where r = \xfx\\. Now suppose that |x| < \x\\. Then we have 

r = — < 1, 

*1 

and the inequality (6) tells us that 2|a /1 x ,, | converges by comparison with the 
convergent geometric series Zr n . This proves the first statement. To prove the 
second statement, we assume that Za n x\ n diverges. Then Za n x n cannot converge 
for any x with \x\ > |jcj|, for if it does, then what has just been proved implies 
the absolute convergence—and therefore the convergence— ofZa n x\ n , and this 
contradicts our assumption. 

We are now in a position to state and prove the main facts about the conver¬ 
gence behavior of an arbitrary power series. 

Given a power series Za n x n , precisely one of the following is true : 

(i) The series converges only for x = 0. 

(ii) The series is absolutely convergent for all x. 

(iii) There exists a positive real number R such that the series is absolutely convergent 
for |x| < R and divergent for |x| > R. 

The argument goes this way. If (i) is not true, then the series converges for some 
x\ =£ 0, and by the lemma we know that the positive number r = |xi| has the 
property that Za n x n is absolutely convergent on the interval — r < x < r. Let S 
be the set of all positive numbers r with this property. If S is unbounded, then 
(ii) is true. Now suppose that neither (i) nor (ii) is true, so that S is a nonempty 
set of positive numbers which has an upper bound. By the basic completeness 
property of the real number system, S has a least upper bound. This means that 
there is a smallest number R such that r ^ R for all r’s in 5.* It is now easy to 
see that R has the properties stated in (iii). 

The positive real number R in case (iii) is called the radius of convergence of 
the power series. We have seen that the series converges absolutely at every point 
of the open interval (~R, R ), and diverges outside the closed interval [-R, R]. 
No general statement can be made about the behavior of the series at the end¬ 
points R and —R. There are examples of series that converge at both endpoints, 
or diverge at both, or converge at one and diverge at the other; to find out what 
happens for any particular series, we must test each endpoint separately. The set 
of all x’s for which a power series converges is called its interval of convergence. 
These ideas are illustrated in Fig. 14.1. 

It is customary to put R = 0 when the series converges only for x = 0, and to 
put R — <x> when it converges for all x. This convention allows us to cover all 
possibilities (except endpoint behavior) in a single statement: 

Eveiy power series Za n x n has a radius of convergence R, where 0 < R < ©o, with the 

property that the series converges absolutely if |x| < R and diverges if |x| > R. 

It should be noticed that if R = 0, then no x satisfies the condition |x| < /?, and 
if R — then no x satisfies the condition |x| > R, so in both of these cases our 
general statement is true by default. 


*See Appendix A.l for a general discussion of least upper bounds. 
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Figure 14.1 The interval of conver¬ 
gence. 

-R 

0 
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If a power series 2a n x n is given, how do we find its interval of convergence? 
The first step is to find the radius of convergence R . There is a simple formula 
for R that works in many situations: 


R = lim 


a n 

Qn+ 1 


(7) 


provided this limit exists—and has °o as an allowed value. This follows directly 
from the ratio test, because the series converges absolutely, or diverges, accord¬ 
ing as the number 


lim 


\a n x n | 


= lim 


Qn+ 1 

a n 


_ \x_ 

lim \a„/a»+i\ 


is < 1 or > 1, that is, according as 


\x\ < lim 


a n 

a n +1 


or 


x\ > lim 


a n 

Qn+ 1 


This establishes (7). 

The second step is to test the behavior of the series at the endpoints. 


Example 1 


Find the interval of convergence of the series 



As indicated, this series is understood to be indexed from n = 1 to n — <*>, be¬ 
cause a n — lln 2 has no meaning for n = 0. 


Solution Here we have 

a n l//i 2 _ {n + l) 2 

a n +i l/(/i + l) 2 n 2 

so R = 1. In this example the series converges at both endpoints: At x = 1 the 
series is 21 In 2 , which is a convergent p-series, and at x = —1 it is 2(—1 ) n /n 2 , 
which converges by the alternating series test. The interval of convergence is 
therefore the entire closed interval [—1, 1], 


Example 2 Find the interval of convergence of the series 


^ 2 _ 3 4 9 

E^*" = 2+ 7* + 3i* 2 + - 


Solution This time we have 

_an_ _ n + 2 3” +1 n + 2 3 ^ 3 

a n+ 1 3 n n + 3 « + 3 
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so R = 3. In this case the series diverges at both endpoints: At x = 3 it becomes 
2 + 3 + 4 + • • • , and at x = -3 it becomes 2 - 3 4- 4 — • • • . The interval 
of convergence is therefore ( — 3, 3). 


Example 3 


Find the interval of convergence of the series 


I 


Hrr-l+f + f + 

n + 1 2 3 


Solution Here we have 

a n _ V{n + 1) _ n + 2 ^ ^ 

a n +1 1 1(n + 2) m -H 1 

so R = 1. At x = 1 the series is the harmonic series 1 +{ + {+••• , which di¬ 
verges, and at x = — 1 it is the alternating harmonic series 1 — { + {—••• , 
which converges. The interval of convergence is therefore [ — 1, 1). 


Example 4 


Find the interval of convergence of the series 


X(-D" 


(2 n)! 


x ^ 

= 1 - — + — - 
2! 4! 


( 8 ) 


Solution Formula (7) does not apply directly because half of the coefficients of 
this power series are zero. Nevertheless, the series can be written in the form 


1 - 



where y = x 2 . Our formula can be applied to this series and yields 


<*n 

&n + 1 


1/(2 ft )! 

\R2n + 2)! 


(2n + 2)! 
(2 n)\ 


= (2 n + 1)(2 n + 2) 


o°. 


so R = oo for the y-series. It follows that the x-series also converges for every x, 
so the desired interval of convergence for (8) is (— 00 , oo). 


If a is a real number, the series 

X a »( x ~ = a 0 + a\(x - a) + a 2 {x - a) 2 + • • • (9) 

n=0 

is called a power series in x - a. For the sake of emphasis, the special case (1), 
in which a = 0, is often called a power series in x. If we put z = x ~ a, then 
(9) becomes Xa n z n , which is a power series in z. If we can determine the inter¬ 
val of convergence of this latter series by the methods previously described, then 
this information can be used to find the interval of convergence of (9). For in¬ 
stance, if Xa n z n has [-R, R ) as its interval of convergence, then it converges for 
—R < z < R- This means that —R <x-a<R or a — R < x < a + R. We then 
say that [a — R, a + R) is the interval of convergence, and that R is the radius 
of convergence, of the power series (9). 

For the sake of simplicity of notation, we shall confine most of our detailed 
discussions to power series in x. 
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PROBLEMS 


Find the interval of convergence of each of the following 
power series. 

1 X^r*"- 2 H n ' x "■ 


3 y 
5 X 
7 X 
9 X 
n X 


13 X, 
^2 n + 


15 X 


(-1)” 

In + 1 

(- 2 )" 


._, y' 1 

17 X 7-• 

^ In n 

19 y ( ~ 1)n x". 

^ Ai(ln ri) 2 


v- 2 n 

4 Y— jc". 
^ nr 


14 y n 2 x n . 

X 


(~D" +I K n 

V7i2» 


18 X— 

^ n 

'in 

20 irf*"- 


21 X £<•«-«>’■ 22 


22 X jgf'(* - 2)" 24 y nJ2 . . 


n 2 + 1 
(x -3)” 


25 X ^ ~ 10) '’- 26 X ( ! )n+1 ( x - 3 )"- 

^ nl ^ n In n 


x n 

6 

, r n 

1 2 

n + 4 

27 Xy vix-er. 28 Xjr(x + 2r. 

x n 

8 

V 2" 2 n 

29 Find the radius of convergence of 
(a) the hypergeometric series 

tv' + \ 

^ (2n)! ' 

(3n)! x n 

10 

y x " 

, , ab , a(a + 1 )b{b + 1 ) 7 

1 + x + ^ / x L 

c 2 !c(c + 1) 

(2«)! • 

^ n(n + 1)‘ 

x 2n+1 

(-3)”' 

12 

y — . 

a(a+ \)(a + 2)b(b + \)(b + 2) , 

3!c(c + l)(c + 2) * 


(b) the Bessel function 


Ux) - 1 - ~2 + 


2 2 2 2 • 4 2 2 2 • 4 2 • 6 2 

30 Give an example of a power series with R = it. 

31 If infinitely many coefficients of a power series are 
nonzero integers, show that R < 1. 


Consider a power series Xa n x n with positive radius of convergence R. We saw 
in Section 14.2 that this series can be used to define a function/(x) whose do¬ 
main of definition is the interval of convergence of the series. Specifically, for 
each x in this interval we define f(x) to be the sum of the series, 

f(x) = ao + a\x + 02 * 2 + ’ * * + a n x n + • • • . (1) 


14.3 

DIFFERENTIATION AND 
INTEGRATION OF 
POWER SERIES 


This relation between the series and the function is often expressed by saying 
that Xa n x n is a power series expansion of f(x). For example, we know that if 
\x\ < 1, then 


1 

1 + x 


= 1 


- X + X 2 - JC 3 + • • • + (-1 ) n x n + • • • , 


( 2 ) 


because the geometric series 2(— \) n x n converges and has the sum 1/(1 + x). 
Accordingly, the function f(x) = 1/(1 + x) has the series on the right of (2) as a 
power series expansion. 

Polynomials, which are finite sums of terms of the form a n x n , are very simple 
functions. They are continuous everywhere, and can be differentiated and inte¬ 
grated term by term. Even though the sum of a power series can be a much more 
complicated function, it is still simple enough to share these three properties with 
polynomials inside the interval of convergence. 
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We give the following formal statement of these very important facts: 

(i) The function fix) defined by ( 1) is continuous on the open interval (-R, R). 

(ii) The function fix) is differentiable on ( —R , R), and its derivative is given by the 
formula 

fix) = a\ + 2a 2 x + 3ayx 2 + • • • + na n x n ~ l + • • • . (3) 

(iii) If x is any point in (-R, R), then 

[ fit) dt = a 0 x + \ a x x 2 + 2 a 2 x 3 + ■ ■ ■ + * a,,x" + ' + •••. (4) 

Jo 2 3 n -f 1 

The proofs of these statements depend on a special kind of convergence called 
uniform convergence. The details can be found in Appendix A. 15. 

Several comments are in order. 

First, we observe that if (ii) is applied to the function f(x) in (3), then it fol¬ 
lows that/'(;t) is itself differentiable. This in turn implies that/"(*) is differen¬ 
tiable, and so on. Thus, the original function fix) has derivatives of all orders. 
We can summarize the situation this way: In the interior of its interval of con¬ 
vergence, a power series defines an infinitely differentiable function whose de¬ 
rivatives can be calculated by differentiating the series term by term. The term- 
by-term differentiation can be emphasized by writing (3) as 

It is worth knowing that the term-by-term differentiability of a convergent series 
of functions is usually false; it is true here only because we are dealing with a 
special kind of series. As a simple example of the failure of this property, we 
mention the series 25T=i (sin nx)/n 2 , which is absolutely convergent for all x by 
comparison with the convergent series El In 2 , because |(sin nx)/n 2 \ < 1 In 2 . The 
difficulty arises with the term-by-term differentiated series E(cos nx)/n, because 
this series diverges for x = 0. 

In the case of (iii), if we prefer to avoid using the dummy variable t, then (4) 
can be written in the form 

J fix) dx = aox + y a x x 2 + j a 2 x 3 + * • * + a„x n+1 + • • •, ( 5 ) 

provided we find the indefinite integral on the left that equals zero when x = 0. 
The term-by-term integration of power series can be emphasized by writing (5) 
as 

J ^ a n x n ) dx = | ja n x n dx). 

We also point out that it is part of the meaning of (3) and (4) that the differ¬ 
entiated and integrated series on the right sides of these equations converge on 
the interval ( —R , R). We shall prove this at the end of this section. However, be¬ 
fore doing so we give several examples of the practical value of the procedures 
discussed here. 


Example 1 Find a power series expansion of In (1 + x). 
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Solution Our starting point is the fact that the derivative of In (1 + jc) is 
1/(1 + jc), and for \x\ < 1 this function has the power series expansion given by 
(2). Now, using (5) and the fact that In (1 + x) equals zero when jc = 0, and in¬ 
tegrating (2) term by term, we at once obtain 


In (1 + x) = x - 



+ (-i r 


y,n +1 


n + 1 


+ • • • 


= y ( -i) n+ i —. 

«=i n 

We know from the preceding discussion that this expansion is valid for \x\ < 1. 
Further, we know from Section 13.6 that it is also valid for x = 1, but our pre¬ 
sent methods give no information on this matter. 


Example 2 Find a power series expansion of tan 1 jc. 


Solution The derivative of this function is 1/(1 + jc 2 ), and we see by replacing 
jc by jc 2 in (2) that 


= 1 - ;c 2 + x 4 - 


+ (-l)"* 2 * + 


1 + x 2 

if |jc| < 1. Using (4) this time, we get 

1 v ^ 2 = f (1 - t 2 + t 4 - t 6 + • • •) dt 

J 0 1 + * *'0 


tan 




- y + ■ • • =X C-D” 


2/7 + 1 


for |jc| < 1. Again, our present methods give no information about what happens 
at the endpoints jc = ± 1 . 


Example 3 Find a power series expansion of e x . 


Solution In Section 8.3 we proved that e x is the only function that equals its 
own derivative everywhere and has the value 1 at jc = 0. To construct a power 
series equal to its own derivative, we use the fact that when such a series is dif¬ 
ferentiated, the degree of each term drops by 1. We therefore want each term to 
be the derivative of the one that follows it. Starting with 1 as the constant term, 


the next should be jc, then y- jc 2 , then * ■ 

2 2-3 

series 


and so on. This produces the 


r 2 r 3 Y n 

i + , + _ + ¥ + ... + _ + ... ( (6) 
which converges for all jc because 

/? = lim yyyy = lim( " + 1) = ~- 

We have constructed the series (6) so that its sum is unchanged by differentia¬ 
tion and has the value 1 at jc = 0. In view of the above remark, this establishes 
the validity of the expansion 
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for all x. 


e x 


+ x + l\ + 




Example 4 Find power series expansions of 1/(1 - x ) 2 and 1/(1 - x ) 3 . 

Solution We begin by noticing that 

■ L--iLf-U 

(1 — x) 2 dx \ 1 - X) 

The next steps are to expand 1/(1 — x) into the power series Zx n for |jc| < 1, and 
then to differentiate this series term by term: 

^±- = ± il+x+ x2 + ... +x n + ... ) 

= 1 + 2x + 3x 2 + • • • + nx”~ l + • • • 


= ^ nx n 1 = (n + \)x n . 

n = I « = 0 


Another differentiation yields 
2 d 


(1 - x) 3 dx 


(1 - x) 2 


= -r- (1 + 2x + 3x 2 +•••+, 
dx 


+ •■■) 


= 2 + 3 • lx + 4 • 3x 2 + 


+ n(n — \)x* 


— fvn-2 


SO 


3 = - [2 + 3 • 2x + 4 • 3x 2 + • • • + n(n - \)x' l ~ 2 + • • •] 


(I -x) 


= 1 


n(n - 1) 


n=2 


: n-2 = jf* (” + 2)(W + 1) ^ 


Example 5 Find the sum of the series 



Solution It is easy to see that R = 1, so the series converges to some function 
f(x) for \x\ < 1. Differentiating this series clearly simplifies it to 

f(x) = l+ ;c + ;t 2 + ■••+;<:" + ••• = — l — r 


Since /(0) = 0, integration now yields 

fix) = “In (I - x). 


Example 6 Find the sum of the series 
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x + 4x 2 + 9x 3 + ' ' ‘ = X n2jcn ■ 
i 


Solution Again we clearly have /? = 1, so the series converges to some func¬ 
tion f(x) for |jr| < 1. We can write 

/(*) = x + 4x 2 + 9x 3 + • • • + n 2 x n + • • • = xg(jt), 

where 

g(x) = 1 + 2 2 x + 3 2 jc 2 + • • • + n 2 x n ~ l + • • • . 

At this point we notice that 

g(x) = (jc + 2* 2 + 3x 3 + ■ • • + /ix" + • • •) 

= -j- [41 + 2 jc + 3jc 2 + • • • + h*”- 1 + • • •)]• 


By Example 4, 


1 + 2x + 3 * 2 + • • • + hx" 1 + • • • = —-- 7 , 

(l ~xr 


so 


and 



JC 

(1 -x) 2 


1 + x 
(1 -*) 3 


f(x) = 


x ~F x 2 
(1 -*) 3 ' 


In conclusion, we return to the unfinished business of showing that series (3) 
and (4) converge on the interval ( — R, R). 

The proof for (4) is easy: Since converges and 


the comparison test implies that ^ 


a n x n 
n + 1 


converges, and therefore 




a n x n _ y 

n + 1 " ^ 


1 

n + 1 


a n x 


n+ 1 


also converges. 

The proof for (3) is a bit more complicated. Let * be a point in the interval 
(-R, R) and choose e > 0 so that |*| + e < R . Since |*| + e is in the interval, 
2|fl n (|*| + e) n \ converges. In Problem 7 students are asked to show that the in¬ 
equality 

\nx n ~ l \ < (|*| + e) n 

is true for all sufficiently large ns. This implies that 

\na n x n ~ x \ ^ \a n (\x\ + e) n \, 

so the series 2 \na n x n ~ y \ converges by the comparison test. 
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PROBLEMS 


1 


Find power series expansions for the following functions, 
and determine the values of x for which these expansions 
are valid: 


(a) 


_1_ 

(1 +x) 2; 


(b) 


1 

(1 + x) 3 ‘ 


2 Show that 


by integrating another series. 

7 If 6 > 0, show that the inequality \nx n ~'\ ^ (|*| + ef is 
true for all sufficiently large rc’s. Hint: n ]/n |x| 1-(1/aj) —> \x\. 

8 On p. 41-7 in Vol. 1 of his Lectures On Physics (Addison- 
Wesley, 1964), Richard Feynman (Nobel Prize, 1965) 
writes: 


I 

>i = 0 


(n + 1 )(/i H- 2 )(n + 3) 


1 


0 -*) 4 


Find the sum of each of the following series: 

X 3 X^ Y 2n+\ 


«■>'+* + *+■ 

(c) X + 2x 2 + 3x 3 + 

(d) x + 2x 3 + 3x 5 + 
Show that 


2 n + 1 

r /i— I 

+ -- + 


+ nx n + • • • ; 
+ nx 2 "-' + • 


y- = - P 


* In (1 - t) 


dt. 


5 Show that the Bessel function 

x 2 x 4 

^=1-^+22742 


2 2 • 4 2 • 6 2 


+ • • 


satisfies the differential equation xy" + y + xy = 0. 
Obtain the series 

/ - 1 1 • 4 v*5 

in(* + VTT^) = *-- T + YTTy 

1-3-5 X 7 


2-4-6 7 


Thus the average energy is 

/ ^ v hco( 0 + x + 2x 2 + 3x 3 + • ■ •) 

i + x + x 2 + . . . • 

Now the two sums which appear here we shall leave for 
the reader to play with and have some fun with. When 
we are all finished summing and substituting for x in 
the sum, we should get—if we make no mistakes in the 
sum— 


{E) = 


hco 

ghu>/kT _ j * 


This, then, was the first quantum-mechanical formula 
ever known, or ever discussed, and it was the beautiful 
culmination of decades of puzzlement. 


Use Problem 3(c) to have the fun that Feynman recom¬ 
mends. (It is not necessary to know that h is Planck’s con¬ 
stant, but it is necessary to substitute x = e~ htolkT .) A few 
sentences on, Feynman writes: “This expression should, 
of course, approach kJ as 0. See if you can prove 
that it does—learn how to do the mathematics.” Prove that 
it does. 


We have solved the problem of determining the general nature of the function 
that is the sum of a convergent power series: Inside the interval of convergence, 
it is a continuous function with derivatives of all orders. We now investigate the 
TAYLOR SERIES AND converse problem of starting with a given infinitely differentiable function and 
TAYLOR’S FORMULA expanding it in a power series. In Section 14.3 we established several such ex¬ 
pansions for a few special functions with particularly simple derivatives. Our pur¬ 
pose here is to consider a method of much greater generality. 

It may seem that the coefficients of a convergent power series are not con¬ 
nected with one another in any necessary way. In fact, however, they are bound 
together by an invisible chain, which we now make visible. 

To this end, let us assume that a function /(x) is the sum of a power series 
with positive radius of convergence, 

fix) = ^ dnX n = a 0 + d\x + a 2 x 2 + • • • , R > 0. (1) 

n= o 


14.4 
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By the results of Section 14.3, repeated term-by-term differentiation is legitimate 
and yields 

f\x) = a\ + 2ci2X + 3 ayx 2 + • • ■ , 
f(x) = 1 * 2^2 + 2 • 3a^x + 3 ■ 4 a^x 2 + • • ■ , 
f"\x) = 1 • 2 • 3a 3 + 2 • 3 • 4 a# + 3 • 4 • 5 a 5 x 2 + • • • , 
and in general, 

f( n \x) = n\a n + terms containing x as a factor. 

We know that these series expansions of the derivatives are valid on the open 
interval \x\ < R . By putting x = 0 in these equations we obtain 

/(0)=flo, /'(0) = a\ 9 f"(0) = 1 »2fl 2 , 

/"'(0) = 1 ■ 2 ■ 3fl 3f ..., f<"X0) = n\a n , 


so 


_ /"( 0 ) 
- - j]-, 


flo=/(0), ai =/'( 0), 

_ no) _ / ( ^(Q) 

a 3 3 | . •••> a n n . 


( 2 ) 


These are very remarkable formulas, for they tell us that iff(x) has a power 
series expansion of the form (1), then its coefficients must be the numbers given 
by (2). The series (1) therefore becomes 

f(x) =/(0) +f'(0)x + + • • • + + . . . . ( 3 ) 


The power series on the right of (3) is called the Taylor series off(x) [at x— 0]. 
The following conclusion is implicit in this discussion: 


If a function is represented by a power series with positive radius of convergence, then 
there is only one such series and it must be the Taylor series of the function. 


Briefly, power series expansions are unique, because (2) tells us that the coeffi¬ 
cients are uniquely determined by the function itself. If we use the standard con¬ 
ventions mentioned earlier, that 0! = 1 and that the zeroth derivative of f(x) is 
f(x) itself [/ (0) U) =/(*)], then (3) can be written as 


m =£ 

n= 0 


f (n \ 0) 

n\ 


x n . 


(4) 


The numbers a n - f^ n \0)/n\ are called the Taylor coefficients of f(x).* 

Equation (3) was true in the preceding discussion because we started with a 
convergent series whose sum is denoted by f(x). We now start with a function 
f(x) that has derivatives of all orders throughout some open interval I containing 


■"Brook Taylor (1685-1731) was secretary of the Royal Society and an enthusiastic supporter of New¬ 
ton in his acrimonious controversy with Leibniz and the Bemoullis about the invention of calculus. 
Taylor published his power series expansion of a function in 1715, but only as a formula and with¬ 
out any consideration at all of the issue of convergence. This expansion had already been published 
by John Bernoulli in 1694. Taylor was fully aware of this fact, but chose to ignore it out of partisan 
malice. In those days science roused passions that today we see only in politics and religion. 
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the point x = 0. We can form the Taylor series on the right of (3) and ask the 
question, Is equation (3) a valid expansion of f(x) on the interval /? Lest there 
be any misunderstanding, we state as clearly as possible that this equation is not 
always valid, and whether it is or not depends entirely on the individual nature 
of the function f(x). In Remark 2 we will give an example of an infinitely dif¬ 
ferentiable function whose Taylor series converges everywhere, but not to the 
value of the function, so in this particular case equation (3) is false. 

In order to put our ideas on a firm foundation, we proceed as follows. Break 
off the Taylor series on the right side of (3) at the term containing x tl and define 
the remainder R tl (x) by the equation 

fix) = /(0) + f'(0)x + ffr-x 2 + ■ ■ ■ + - f - X n + R„(x). (5) 

Then the Taylor series on the right side of (3) converges to the function f(x) 
precisely when 

lim R n {x) = 0. ( 6 ) 


These equations don’t really solve anything, because R n (x) is defined to be what¬ 
ever it takes to make (5) true, and (6) is merely the meaning of the statement that 
the Taylor series converges to the function. This approach is useful only if we 
can show that R n (x) can be expressed in a form that makes it feasible to try to 
prove (6) in the case of particular functions. We emphasize that (6) is not always 
true, because (3) is not always true. The most convenient general formula for 
R n (x) is 


Rn(x) = 


(«+D! 


(7) 


where c is some number between 0 and x. When R n (x) is expressed this way, (5) 
is called Taylor s formula with derivative remainder* The proof of (7) that we 
give is fairly technical, and is placed in Remark 3 so that students who wish to 
skip it can easily do so. 

To summarize, Taylor's formula with derivative remainder states that under 
the above assumptions the function/(*) can be written in the form 


/M=/(0)+/'(0)* + 


/"( 0 ) 

2 ! 


x 2 + 


f {n \ 0 ) 


x” + 


f {n+l \c) 

(n + 1 )! * 


where c is some number between 0 and x. 

We now give several illustrations of the use of (6) and (7). First, however, we 
observe that 


lim ^- 7 - = 0 ( 8 ) 

n ! 


for every x, because the series Xjc'7 n\ is absolutely convergent everywhere. We 
shall need this fact in our first two examples. 


Example 1 Find the Taylor series for the function /(jc) = e x , and use (6) and (7) 
to prove that it converges to e x for every x. 

Solution We clearly have 


*There are other forms of the remainder that we do not discuss. 
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f(x) = e x . 

/(0) 

f’(x) = e\ 

/'(0) 

f'\x) = e\ 

/"(0) 


and so on. By substituting in (3) we obtain 

v 2 Y 


= 1 + x + Y\ + 


• + ■ 


+ • 


n\ 


1 ; 

1 ; 

1; 



(9) 


and to prove the validity of this expansion by our present methods, we examine 
the remainder R n (x). For any x, the maximum value of the exponential function 
on the interval from 0 to jc is easily seen to be its value M at the right endpoint. 
By (7) and (8) we therefore have 


\Rn(x)\ 


(n + 1)! * 


(n + 1)! 




< A/ 


(*+ D! 


0, 


so (9) is valid for all x. Of course, we established (9) in another way in Section 14.3. 


Example 2 Find the Taylor series for /(jc) = sin jc, and use (6) and (7) to prove 
that it converges to sin jc for every x. 


Solution We can arrange our work as follows: 

fix) = sin x, /(0) *= 0; 
fix) = cos x, f'i 0) = 1; 
f”ix) = -sin x, /"(0) = 0; 
fix) = -cos x, /"'(0) = -1; 

and the subsequent derivatives follow this same pattern. By substituting in (3), 
we obtain 


sin jc = jc — 



■ • • + 


(- 1 )" 


x 2 “ +1 
( 2 n + 1)! 


+ • • • 


= 1 (-D" 

n= 0 


iln + 1)!' 


( 10 ) 


So far, all we know is that if sin x has a power series expansion, then that ex¬ 
pansion must be (10). To prove that (10) is actually true for every jc, we use (6) 
and (7). Since either 

|/ (w+1) (jc)| = |sin x\ or |/ ( " +1 ) (jc)| = |cos jc|, 

it is clear that |/ ( " +1) (c)| < 1 for every number c. Therefore by (7) and (8) we 
have 


\Rn(x)\ = |/ ( ” +1) (c)| 


An+X 


in + 1)! 


|jc |“ +1 
in + 1)! 


so (10) is true for all jc. Students will remember that we proved (10) in Prob¬ 
lem 11 of Section 13.4 by a very different method, one requiring considerable 
ingenuity. Our present method has the advantage of being straightforward and 
systematic. 
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Example 3 Find the Taylor series for cos x, and prove that it converges to 
cos x for every x. 

Solution We could proceed directly, as in Example 2. Instead, this is left for 
students to carry out in the problems, and we obtain the desired series by dif¬ 
ferentiating (10), 


cos Jt = 



■ • • + (-1)" 


(2/i)! 


+ • • • 


= X (-D” 


n = 0 



(ID 


The validity of this expansion is guaranteed by the results of Section 14.3. Since 
we have found a power series expansion for cos x, we know by the discussion 
of uniqueness given above that this series must be the Taylor series. 


The three series established in these examples can be used to find power se¬ 
ries expansions for many other functions. Thus, since (9) is true for every x, a 
power series representation for e~ x2 can be found by substituting -x 2 for x. This 
yields 

y4 y6 y2n 

e-* 2 =1-Jt 2 + — - — + ■ ■ ■ + (-1)” — +••■. (12) 

2! 3! n\ 

We can apply this formula to evaluate the definite integral 

( e~ xl dx, 

Jo 


even though the corresponding indefinite integral cannot be calculated in terms 
of elementary functions. Term-by-term integration of (12) gives 


x 3 x 5 

’T + 5 • 21 


3 + 5 ■ 


3! 


2 ! 


7 • 3! 


This expression for the value of the integral as a series of constants is exact, 
and can be used to obtain this value in decimal form to any desired degree of 
accuracy. 


Example 4 
of Section 


The inverse tangent series and the computation of tt. In Example 2 
14.3 we gave a rigorous proof that the expansion 

v 3 


tan 


x 5 

+- 

5 


A v—> 

y + • • • = 1 ( 

1 n =0 


-i r 


In + 1 


(13) 


is valid for — 1 < x < 1, but this proof gave no information about what happens 
at the endpoints x = ±1. Taylor’s formula is not much help here either, because 
it is very laborious to calculate the successive derivatives of the function/(x) = 
tan -1 x. To give a rigorous proof that (13) is actually valid on the closed inter¬ 
val -1 < x < 1, we adopt a completely different—and much more elementary 
—approach. 

We begin with the algebraic identity 
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-j “ = 1 + X + X 2 + • • • + *" 1 + —^ , X tM, 

\ - X \ - X 

which can easily be verified by multiplying both sides by \ — x. Replacing x 
by -t gives 


f-n n t n 


TT7 = \ - t + t 2 + ( - 1) n “ 1 r“ l +-j +r . 

with the restriction that t =£ — 1. When t is now replaced by f 2 , this becomes 


1 + t 2 


= 1 - t 2 + r 4 - 


+ (- 1)"- 1 , 2 "- 2 


+ - 


r-m 2/i 


l + r 2 


without restriction. By integrating both sides of this from t = 0 to t = x, we 
obtain 


tan 1 x = 


f x dt 

I o 1 + t 2 



. . • +(-l)»-i 


2 n - 1 


+ R n (x), 


(14) 


where 


Rn(x) = (-\r 


r* t 2n 

lo 1 + t 2 


dt. 


We now prove that R n (x) —> 0 as n —» ©o, provided only that -1 < x < 1. But 
this is easy, because the inequality 1 + t 2 > 1 allows us to write 


\RnW\ = 


fx t 2n 


fx 

-—~dt 

< 

t 2n dt 

Jo 1 + t 2 


Jo 


r \2n +1 


2A! + 


2 n + 1 


The fact that R n (x) —> 0 for the stated values of x enables us to conclude from 
(14) that 

tan -1 x = x - ^— • ■ •, — (15) 


Leibniz’s famous formula 



now follows at once from (15) on putting x = 1. In principle, Leibniz’s formula 
can be used for computing the numerical value of tt , but as a practical matter, 
the series converges so slowly that this method is of little value. A more efficient 
procedure is to use the formula 

-j = tan -1 j + tan -1 y, (16) 


and then to compute the two terms on the right by means of (15). [To establish 
(16), notice that if A = tan -1 7 and B = tan -1 y, then 


tan (A + B) = 


tan A + tan B 
1 - tan A tan B 



= L] 


However, most of the extended computations of tt have been based on the 
formula 


TT , 1 

— = 4 tan 1 — - tan 
4 5 



(IV) 
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which was discovered in 1706 by John Machin, a Scottish astronomer. We shall 
not pursue the details of these matters any further, but instead simply mention 
that 7 t has been computed by these methods to more than 500,000 decimal places, 
of which the first twenty are 

77 = 3.14159 26535 89793 23846 .... 

Further information can be found in P. Beckmann, A History of tt (Golem Press, 
1971), especially pp. 140-141 and 180-181.* 


Remark 1 Given a function f(x) that is infinitely differentiable in some interval 
containing the point x — 0, we have examined the possibility of expanding this 
function as a power series in x. More generally, if f(x) is infinitely differentiable 
in some interval containing the point x = a, we can ask about its possible ex¬ 
pansion as a power series in x — a. 


/« = X 


n =0 


a n (x — a) n = a 0 + a\(x — a) + a 2 (* ~ a) 2 -(-•••. 


(18) 


This problem is equivalent to the first, because (18) is the same as the series 
g(w) = cio + a\w + CI 2 W 2 + • • • , where w = x — a and g(w) = f{x ), and 
therefore no separate discussion is required Since g (n) (0) = f^ n \a), the Taylor 
series of f(x) in powers of x — a (or at x = a) is 


fix) = X 

n = 0 


f in \a) 

n\ 


(x - a) n 


= f(a) +f'{a)(x - a) + ^^ (x - a) 2 + • ■ ■ 


(19) 


Some writers refer to (3), which is the special case of (19) corresponding to 
a = 0, as Maclaurin’s series. However, this custom has no historical justification 
and is rapidly being abandoned. Whenever we use the phrase “Taylor series” 
without qualification, we always mean “Taylor series in powers of xf or at 
x - 0. 

The fundamental tool for proving the validity of the Taylor series expansion 
(19) for a specific function f(x) is a slightly expanded version of Taylor's for¬ 
mula with remainder as discussed above. This expanded version states that f(x) 
can be written in the form 

fix) = fid) + f'(a)(x - a) + (x - a) 2 + {x - a )3 

+ • • ■ + ix ~ a) n + R n ix), (20) 

n\ 


where the remainder R n (x) is given by the formula 

f{n+\ V \ 

Ux) = (n + 1}! U - a)*+l (21) 

and c is some number between a and x. And just as in the simpler but equiva¬ 
lent case expressed by (5), (6), and (7), the series on the right of (19) converges 


*More recently, tt has been computed to more than 100 million decimal places. For an account of 
this see the article on the Indian mathematical genius Srinivasa Ramanujan (1887-1920) in the Feb¬ 
ruary 1988 issue of Scientific American. 
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to/(;c) precisely when 

lim R n (x) = 0. (22) 

n —>» 

The polynomial part of (20) is called the nth-degree Taylor polynomial at x — a. 
We notice that the remainder term (21) is very similar in form to the terms in the 
Taylor polynomial, except that f^ n+l \x) is evaluated at c instead of a. All we can 
say about c is that it lies somewhere between a and x. The expression (21) for 
the remainder is often called Lagrange’s remainder formula* 

Example 5 Use (21) and (22) to prove the validity of the following Taylor se¬ 
ries expansion of f(x) = sin x about the point a — 7t/2: 



Solution We start by calculating the successive derivatives of f(x) = sin x and 
finding their values at a = tt!2 \ 


f(x) = sin x, 


f(x) = cos ;t, 
f'\x) = — sin x, 


f“'(x) = -COS X, 


/ (4) U) = sin X, 



sin — = 1; 
2 

77 n 

cos — = 0; 



II , 

-sin — = — 1; 

17 n 

— cos — = 0; 

• 77 i 

sin — = 1; 

2 


and so on. These results show that in this case the Taylor series (19) is just the 
series (23). We must now prove that this series converges to sin x for all x. But 
Lagrange’s formula gives 


|^U)| = 


/ (n+l) (c) / 
(n + 1)! 1* 



\x — tt/2|' 1+i 
(n + 1)! ’ 


because/ ( " +1> (c) is either ±sin c or ±cos c, and therefore |/ ( ” +1) (c)| ^ 1. By 
using (8) we now conclude that 


lim R n (x) = 0, 

n — 

and this proves the validity of (23). 


^Joseph Louis Lagrange (1736-1813) was the foremost French mathematician of the late eighteenth 
and early nineteenth centuries. In his early life he made outstanding discoveries in the calculus of 
variations, number theory, and analytical mechanics. His genius for generalization and analysis was 
most fully revealed in his great treatise Mecanique Analytique (1788). In this masterpiece he unified 
general mechanics and made of it, as the Irish scientist Sir William Hamilton later said, “a kind of 
scientific poem.” Among the enduring legacies of this work for the mathematical physics of our own 
time are Lagrange’s generalized coordinates, Lagrange’s equations of motion, and the concept of po¬ 
tential energy. [For some details about these matters, see pp. 526-532 of George F. Simmons, Dif¬ 
ferential Equations, 2nd ed. (McGraw-Hill, 1991).] His later life was shadowed by a profound melan¬ 
choly in which he lost his taste for mathematics and science and was sadly skeptical of the grandiose 
bureaucratic schemes of politicians for reforming human nature and relieving human misery. “If you 
wish to see the human mind truly great,” he said, “enter Newton’s study when he is decomposing 
sunlight with his prism or unveiling the system of the world with his mathematics.” 
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Remark 2 In Problem 42 of Section 12.3 we asked students to consider the func¬ 
tion fix) defined for all real numbers x by 


/(*) = 



jc* 0, 
x = 0. 


This function is continuous and has derivatives of all orders for all values of x. 
Furthermore, every derivative vanishes at x = 0, that is,/ (w) (0) = 0 for every pos¬ 
itive integer n. This means that the graph of the function (Fig. 14.2) is extremely 
flat at the origin—we might even say “infinitely flat.” For this function, Taylor’s 
formula (5) becomes 


f(x) = 0 + 0+ -- -+ 0 + R n (x). 

The Taylor series of the function is therefore the series 

0 + 0 + -- -+ 0 + -- -, 

which converges for every x but converges to fix) only for x = 0. Thus, even 
though a function has derivatives of all orders everywhere, it still is not neces¬ 
sarily represented by its Taylor series; and if we wish to establish the validity of 
such a representation, we must invoke solid additional arguments of some kind, 
as illustrated in the examples discussed above. 


Remark 3 We now give a proof of formula (7) for the remainder R n (x). First, 
we define a function S n (x) by writing 

R n {x) = S n (x)x n+] 


for x ^ 0. Next, we hold jc fixed and define a function Fit) for 0 < t < x (or 
jc < t < 0) by writing 



Fit) = fix) -fit) -f\t)ix 


(*-<>>- 


/ ( "V) 


(x - t) n - s„ux.t - ;) 


— A n + ' 


Equation (5) shows that F(0) = 0. Also, it is obvious that Fix) = 0. It follows 
that F\c) = 0 for some number c between 0 and jc.* By differentiating Fit) with 
respect to t, canceling, and replacing t by c, we get 


f(n-\ 

F\c) = - J — 


n (c) 


(x — c) n + S n (x)(n + 1)(jc — c) n = 0, 


SO 


S n (x) = 


/ (n + l) (c) 
(n+ 1)! 


and the proof of (7) is complete. 


*In words, if the graph of our function F{t) touches the f-axis at two points, then it must have a hor¬ 
izontal tangent somewhere in between. This inference rests on the Mean Value Theorem. 
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PROBLEMS 


1 Use (3) to find the Taylor series of each of the follow¬ 
ing functions, and then use (6) and (7) to prove that each 
of these expansions is valid for all x: 

(a) cos x; (b) e~ x \ (c) e 3x . 

2 Obtain the series (11) for cos x from the series (10) for 
sin x by integrating term by term. Hint: Remember that 
the indefinite integral on the left must equal zero when 
x = 0. 

3 Obtain the series (10) for sin x by differentiating the 
series (11) for cos x. 

4 Find the Taylor series expansion of each of the follow¬ 
ing functions (hint: any power series that converges to a 
function in an interval about x = 0 must be the Taylor 
series of that function): 

(a) x 2 e x \ (b) xe~ 3x ; 

(c) cos Vx; (d) x sin 5x; 

(e) sin x 2 ; 

(f) cos 2 x [hint: cos 2 x = y(l + cos 2x)]; 

(g) sin 2 x; (h) I - 7 sin 2 x; 

0) fW = for x =£ 0,/(0) = 1. 
x 

In Problems 5-9, use any method to obtain each of the given 
Taylor series expansions as far as indicated, without worry¬ 
ing about convergence. 

5 tan x = x + yx 3 + -jyx 5 -f- • • • . 

6 sec 2 x = 1 + x 2 + yx 4 + * * * . 

x 2 x 4 x^ 

7 In (cos x) = - — — — — • • • . 

8 In (1 + sin x) = x — yr 2 + \x 3 - -jyx 4 + • • • . 

9 In (1 + e x ) = In 2 + {x + yx 2 — ~^x 4 + • • • . 

10 Show that a polynomial P(x) = a 0 + a\x + a 2 * 2 + • * • + 
a n x n is its own Taylor series. 

11 If P(x) is a polynomial of degree n and a is any number, 
show that 


PM = P(a ) + ( x -a) + 


, P {n \a) 

+ -;-(x - a) n . 


(b) Observe that this series terminates and is a polyno¬ 
mial whenever p is a nonnegative integer, and only 
in this case. In all other cases, show that this series 
has radius of convergence R = 1. The fact that the 
expansion in (a) is valid for |x| < 1 is not easy to es¬ 
tablish by the methods of this section; a different 
type of proof is outlined in Additional Problem 9 at 
the end of this chapter. 

14 Use Problem 13 to write the Taylor series expansion of 
(1 - x 2 ) -1/2 , and integrate this to get the Taylor series 
for sin -1 x. 

15 Find power series representations for 


12 Find the Taylor series expansion of 

(a) 3x 2 - 5x + 7 in powers of x - 1; 

(b) x 3 in powers of x + 2. 

Check the expansions in (a) and (b) by using algebra. 

13 (a) Let p be an arbitrary constant and use (3) to obtain 

the binomial series 


(a) j 
(c) | 


dx\ 


(b) / VT 


+ x 3 dx\ 


dx 


VT+ x 4 ' 

16 Prove formula (17) by putting A = tan -1 y, B = 
tan -1 yk» an d computing successively tan 2A, tan 4A, 
and tan (4A — B). 

17 Calculate the numerical value of it from each of formu¬ 
las (16) and (17) by using the first four nonzero terms of 
the series (15) to calculate the inverse tangents. Round 
off your answers to five decimal places. 

18 Use Lagrange’s remainder formula to prove the validity 
of the following Taylor series expansions for all x: 


(a) sin x 




_iV5 .J-L-iV+ i.±f < -£' 


3! 


2 4! 


(b) cosx = |V2-jV2(x- jj-|V2- 

i/ -v* + !v5.j r L_zy + 


2! V 4 / 2 3! V 4 


2 4! V 4 


(c) sin X = y V3 + y - yj - 


y v HM) 2 -KM) 3 + 


(X+x)P=x+px + p(P-R x i 


ivT^- f r + ... ; 


p(p - D(P - 2 ) x 3 + .. 

3! 


pip - l)(p ~ 2) • • ■ (p - n + 1) 


x" + 


(d) cos x = -(* - y) + -jj- (x - -y' 


M-0 
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How are the values of sin x computed? In everyday work, we simply push a cou¬ 
ple of buttons on our calculator and the desired values are displayed. But where 
do the values come from that the engineers put into the calculator in the first 
COMPUTATIONS USING pi ace 9 j s there a little old man in the calculator factory with very sharp eyes, a 
1AY LfJK o rfJKl 1ULA SU p er b protractor, and a space-age-quality ruler who draws the angles with great 

care, measures the opposite side and hypotenuse, and then computes the sine? 
Not very likely. This method might give two or even three decimal places cor¬ 
rectly, but not the many decimal places available from a good calculator. 

In this section we describe a method for obtaining any value of sin x with great 
accuracy. More generally, we will show how to use the Taylor polynomials of 
Section 14.4 to find the values of such important functions as sin x, cos x, e x , 
and In x to as many decimal places as we please. And in doing this, we will freely 
use the great power of calculators for quickly performing simple but tedious arith¬ 
metical calculations. 



THE USE OF LAGRANGE’S REMAINDER FORMULA 
In the previous section we proved that for all x , 

X 2 JC 3 x n 

In particular, if we put x = 1 we obtain the following familiar formula for e as 
the sum of an infinite series: 


e ' l + l+ i[ + 3r + 


This means that e can be approximated to any degree of accuracy by using a suit¬ 
able partial sum, 




(1) 


n 

, , i i i 

+ + 2\ + + + ni 

0 

1.000000000 

1 

2.000000000 

2 

2.500000000 

3 

2.666666667 

4 

2.708333334 

5 

2.716666667 

9 

2.718281526 

10 

2.718281801 


The value of n in this sum is at our disposal: the larger we take n y the more 
accurate the approximation will be. This is shown in the adjoining table, where 
we have used (1) to approximate e for various values of n. The value of e to nine- 
decimal-place accuracy is 2.718281828. This level of accuracy is achieved with 
n = 12. 

Now suppose we want to approximate e to a previously specified level of ac¬ 
curacy. It would then be important to determine how large the value of n should 
be to guarantee the desired accuracy. For example, how large should n be to guar¬ 
antee an error that is less than 0.000005? Lagrange’s remainder formula is the 
basic tool for answering such questions. 

According to Lagrange’s formula (21) in Section 14.4, if f(x) is approximated 
by its /ith-degree Taylor polynomial at x = a, then the absolute value of the er¬ 
ror R n (x) is 


\Rn(x)\ 


/ ( " +l) (c) 

(«+ D! 


(x - a) n+l 


in + D! 


( 2 ) 


In this formula, c is an unknown number between a and x, so the value of 
f (n+l \c) usually cannot be determined exactly. However, we saw in Section 14.4 
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that it is often possible to find an upper bound for |/ (n+1) (c)|, that is, to find a 
number M such that |/ (n+1) (c)| — M. In such a case it follows from (2) that 

k-«l n+1 . (3) 

so we have a convenient upper bound for the magnitude of the error R n (x). In the 
following examples we give several applications of these ideas, but first we in¬ 
troduce some terminology intended to clarify the concept of the accuracy of ap¬ 
proximations. 

We say that an approximation is accurate to n decimal places if the magni¬ 
tude of the error is less than 0.5 X 10 _n . This means that the largest possible 
error cannot change the figure in the / 2 th decimal place. The following table will 
be helpful for understanding this idea. 


Level of accuracy 

Magnirude o f error 
is less than 

1 decimal place 

2 decimal places 

3 decimal places 

4 decimal places 

0.05 = 0.5 X 10 _1 
0.005 = 0.5 X 10~ 2 
0.0005 = 0.5 X 10“ 3 
0.00005 = 0.5 X 10" 4 


Example 1 Use (1) to find an approximation for e that is accurate to four dec¬ 
imal places. 

Solution By Taylor’s formula with remainder we have 

y 2 yH pC 

e x = \ + x + — —r + ■ 


n\ (n + 1)! 
where c is between 0 and x. For ;t = 1 this gives 


e - 1 + 1 + T + 


+ T + . 


n\ (/2 + 1)! 


where c is between 0 and 1. Thus the magnitude of the error in the approxima¬ 
tion (1) is 


Rn = 


(n + 1)! 


(n + 1)!’ 


where 0 < c < 1. Even though we do not know the exact value of c, neverthe¬ 
less c < 1 tells us that e c < e 1 = e, so 


\Rn\< 


(n+ 1)! 


(4) 


Furthermore, even though we do not know the exact value of e (this is what 
we are trying to approximate!), we do know that e < 3. This allows us to replace 
(4) by 


Rn < 


(n+ 1)! 


( 5 ) 


We are now on solid ground: If we choose n so that 
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\R„\ < ~ * 1} , < 0.5 X 10- 4 = 0.00005, 


( 6 ) 


then the approximation (1) is guaranteed to be accurate to four decimal places. 
A suitable value of n can be found by trial and error by using a calculator. Thus, 
we can calculate 3 l(n + 1)! for n = 0, 1, 2,. . . until we find a value of n satis¬ 
fying (6). For n = 7 and n — 8 we have 


(7 + D! 


= 0.000074 


and 


(8 + 1 )! 


= 0.000008, 


so n = 8 is the first positive integer satisfying (6). Therefore, to four-decimal- 
place accuracy we have 

" Sl + l+ l! + 3! + i[ + ^ + i + 7! + i!- 


which a calculator shows is approximately 2.71827876985. We conclude that 
e = 2.7183 to four-decimal-place accuracy. 


APPROXIMATING SINES AND COSINES 


Example 2 Use the Taylor series for sin x at a = 0 to approximate sin 5° to five- 
decimal-place accuracy. 


Solution In the Taylor series 

sin , = ,_|_ + |i_^ + ... j 

we remind the student that the angle x is assumed to be measured in radians. 
Since 5° = tt/36 radians, we have 


Co 77 7T (7T/36) 3 (7T/36) 5 

Sin5 =sm 36 = 36 -3l ^ 5! 


(7r/36) 7 

7! 


+ • • 


(7) 


We must now decide how many terms of this series to keep in order to guaran¬ 
tee five-decimal-place accuracy. 

If we write f(x) = sin x y then by (2) the magnitude of the error committed when 
sin x is approximated by its /ith-degree Taylor polynomial is 


(w+1)! M . 

where c is some number between 0 and x. Since f^ l \c) is either ±sin c or 
±cos c, it is clear that |/ (/7+1) (c)| < 1, and therefore 


r in+1 


\Rn\ 


(« + 1 )!' 


In particular, if x = tt/36 then 


\Rn\^ 


( 7 r/36) n + 1 
(n+ 1)! 


This shows that we will achieve five-decimal-place accuracy if 
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f 77-/361>" + 1 

V C. < 0.5 X 10 -5 = 0.000005. (8) 

(n + 1)! 

By experimenting with the help of a calculator, we find that for n = 2 and n = 3 
we have 

= 0.0001 and = 0.000002, 

so n = 3 is the first positive integer satisfying (8). This shows that in (7) we only 
need to keep terms up to the third power to guarantee five-decimal-place accu¬ 
racy. We can therefore write 

sin 5 0 = -£ - —= 0.08716, 

36 3! 

in full confidence that five decimal places are correct. 


To approximate the value of a function f(x ) at a point xq by using a Taylor se¬ 
ries in x — a, two factors must be kept in mind when choosing the location of 
the point a. First, it should be fairly easy to evaluate /(;t) and its derivatives at a, 
since these values are necessary for constructing the Taylor series. And second, 
a should be as close as possible tox 0 , since the series will converge more rapidly 
when xo — a is small; that is, fewer terms will be required in a partial sum to ob¬ 
tain a specified level of accuracy. 

Example 3 Approximate cos 93° to six-decimal-place accuracy. 


Solution We choose a = 7r/2 ( = 90°), because cos x and its derivatives are easy 
to evaluate at this point and this value of a is quite close to the point x 0 = 93° = 
jfiTi t where we want to make the approximation. From Problem 18(d) in the pre¬ 
vious section, the Taylor series for cos x at a = 7t/ 2 is 


cos x = 



For x = 77 this becomes 



cos 93° 


31 

COS 60 7r = 


_JL + _L 

60 3! 



5! 



+ • • • . 


(9) 


We must now decide how many terms of this series to keep for six-decimal-place 
accuracy. 

If we write f(x) = cos x, then the magnitude of the error committed in ap¬ 
proximating cos x by its /ith-degree Taylor polynomial at a = 7t/ 2 is 


l/ ( " +1) (c)| 


(n+ 1)! 


X 


77 
2 


n + \ 


where c is some number between tt/2 and x. Just as in Example 2 we have 
|/(' l+1 )(c)| < 1, so 


In particular, if x = 77 then 
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(77/6or +i 
(* + 1 )! • 


We will have six-decimal-place accuracy if 

— J — < 0.5 x 10" 6 = 0.0000005. 
(n + I)! 


For the cases n = 2 and n = 3, our calculator tells us that 


(7t/60) 3 

3! 


= 0.00002 


and 


(7r/ , 6 ,° )4 = 0.0000003, 
4! 


( 10 ) 


so /i = 3 is the first positive integer that satisfies (10). Thus, in (9) we only need 
to keep terms up to the third power to be sure of six-decimal-place accuracy: 

cos93 ‘--^ + ir(^) > “-° 052336 - 


APPROXIMATING LOGARITHMS 


The approximation of natural logarithms begins with the familiar series 


y-2 v-3 v-4 

ln(l+*) = *- y + y- y + 


(ID 


which is valid for — 1 < x < 1 (Example 1 in Section 14.3). This series itself is 
of little value for computation because it converges so slowly. However, if we 
replace x by —x we obtain 


X 2 X 3 X 4 

In (1 — x) = —•* j 3 4 

for —1 < x < 1, and by subtracting (12) from (11) we get 
'1 + *' 


In 


. . x 3 x 5 x 1 

= 2 | * + t + t + t + 


( 12 ) 


(13) 


for — 1 < x < 1. This series can be used for computing the natural logarithm of 
any positive number y by putting 



Figure 14.3 


or equivalently 


1 + x 



(14) 


(15) 


If we sketch the graph of the function (14) [Fig. 14.3], we see at once that y > 0 
corresponds to — 1 < jc < 1. Thus, for example, to compute In 2 we put y = 2 in 
(15), which gives x — Substituting this value in (13) yields 


In 2 = 2 


T 1 
3 + 3 


3 + i 

3/ 5 


,35 + i4V + 


If we now calculate the sum of the four terms shown here and round off to five 
decimal places, we get 

In 2 = 0.69313, 


which compares favorably with the calculator value In 2 = 0.69314718056. 
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PROBLEMS 

It is to be understood that these are all calculator problems, 
so we will not repeat the symbol H) that signals this fact. 

1 Use the inequality (5) to find a value of n that guaran¬ 
tees that (1) will approximate e to 

(a) five-decimal-place accuracy; 

(b) ten-decimal-place accuracy. 

2 Use jc = -1 in the Taylor series for e x at a = 0 to ap¬ 
proximate Me to four-decimal-place accuracy. 

3 Use x = { in the Taylor series for e x at a = 0 to ap¬ 
proximate Ve to five-decimal-place accuracy. 

4 Use the Taylor series for sin jc at a = 0 to approximate 
sin 3° to six-decimal-place accuracy. 

5 Use the Taylor series for cos jc at a = 0 to approximate 
cos (7 t/ 15) to six-decimal-place accuracy. 

6 Use an appropriate Taylor series for sin jc to approximate 
sin 84° to six-decimal-place accuracy. 

7 Use an appropriate Taylor series for cos jc to approximate 
cos 32° to six-decimal-place accuracy. 

8 Use an appropriate Taylor series for sin jc to approximate 
sin 50° to six-decimal-place accuracy. 

9 Find the lowest-degree Taylor polynomial for sin jc at 
<3 = 0 that guarantees five-place-accuracy on the interval 
|jc| ^ 7T/4. 

10 If R n is the nth remainder for sin jc at a = 0, show that 
(a) \R 3 \ < 0.5 X 1CT 5 for |jc| < 0.22 = 12.6°; 


(b) \R 5 \ <0.5 x 10" 5 

for 

|jc| < 0.59 = 33.8°; 

(c) \R 7 \ < 0.5 X 10“ 5 

for 

|x| < 1.06 s 60.7°; 

(d) \R 9 \ <0.5 X 10- 5 

for 

H < 1.61 =92.2°. 

If R n is the rcth remainder for cos jc at <3 = 0, show that 

(a) |/e 2 | < 0.5 X 10~ 5 

for 

|*| <0.10 = 5.7°; 

(b) \R 4 \ <0.5 X 10“ 5 

for 

|*| < 0.39 = 22.3°; 

(c) \R 6 \ <0.5 X 10“ 5 

for 

|*| < 0.81 s 46.4°; 

(d) \R g \ <0.5 X 10- 5 

for 

|*| < 1.33 3 76.2°. 


12 Use the first two nonzero terms of the series (13) to ap¬ 
proximate In 1.25. Round off your answer to four deci¬ 
mal places. Repeat the calculation by using the first four 
nonzero terms of the series. 

13 Repeat the preceding problem for In 3 by using the first 
two, four, five, and six nonzero terms of the series, each 
time rounding off your answer to four decimal places. 

14 Find an interval o f values centered o n x = 0 within which 

x 3 

sin x can be approximated by jc —with four-decimal- 


piuvv UVVM1 UVJ U11VVVVI« A 111U XX 

val for approximating sin jc by jc 


X ^ 

3~ + 5~r with the 


same accuracy. 

15 Find an upper bound for the error in the approximation 
of cos jc by 1 — — + — on the interval |jc| < 0.4. 


One very large part of mathematics where power series can be put to effective 
use is in the solution of differential equations. We hinted at this in Section 13.1, 
and now we explore the possibilities a little more fully. 

Example 1 We begin by considering the equation 

/=* 0) 


14.6 

APPLICATIONS TO 

DIFFERENTIAL 

EQUATIONS 


which we discussed in an informal way in Section 13.1. We assume that this 
equation has a power series solution of the form 

y = ao + a\x + a 2 x 2 + • • • + a„x n + • • • (2) 

that converges for |;c| < R with R > 0. We know from Section 14.3 that a power 
series can be differentiated term by term in its interval of convergence, so 

y' — a\ + 2a 2 x + 3«3jc 2 +••• + («+ \)a n +\x n + • • • . (3) 

If two power series are equal on an interval, the ideas of Section 14.4 tell us that 
the corresponding coefficients must also be equal. Therefore, since y' = y, the 
series (2) and (3) must have equal coefficients: 

ci\ qq, la 2 ci i, 3c?3 q 2 i ..., (Ji *F .... 

These equations enable us to express each a n in terms of a$\ 




ftp 

2-3’ 




a l ~ 0p» 


a, 
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When these coefficients are inserted in (2), we obtain our power series solution, 


y = <*o 1 + x + — 


2! 3! 


+ •••+ — + 


(4) 


where no condition is imposed on a q. It is important to understand that so far 
this solution is only tentative, because we have no guarantee that (1) actually has 
a power series solution of the form (2). The above argument shows only that if 
(1) has such a solution, then that solution must be (4). However, it follows at 
once from the ratio test that the series in (4) converges for all x, so the term-by- 
term differentiation is valid and (4) really is a solution of (1). In this case we 
easily recognize the series in (4) as the power series expansion of e x , so (4) can 
be written as 


y = aoe x - 

Needless to say, we can get this solution directly from (1) by separating vari¬ 
ables and integrating. Nevertheless, it is important to realize that (4) would still 
be a perfectly respectable solution even if (1) were unsolvable by elementary 
methods and the series in (4) could not be recognized as the expansion of a fa¬ 
miliar function. 


Example 2 We next attack the second-order equation 

y" + y = o (5) 

by the same method. That is, we assume the equation has a solution in the form 
of a power series 

y = ao + a\x 4- a 2 x 2 + a 3 x 3 + • • ■ = X a n x n (6) 

n = o 

with radius of convergence R > 0. It is permissible to differentiate this series 
term by term, so 

y = a l + 2a 2 x + 3 a 3 x 2 + • • • = X na„x n ~ l 

n= 1 

and 

y" = 2a 2 + 2 • 3 a 3 x + 3 • Aa^x 2 + • * • = X (n — 1 )na n x n ~ 2 . 

n = 2 

In order to combine the series for y and y” more easily, we write y" as follows: 

/ = X + W" + 2)a n+2 x n . (7) 

n= 0 

In effect, we replace n in the first formula for y" by n + 2 and start the summa¬ 
tion at 0 instead of 2. Now, adding the series (6) and (7) in accordance with the 
differential equation (5), we obtain 

X K w + !)(" + 2)a n+ 2 + a n ]x n = 0. (8) 

n = 0 

By thinking of the right side of this as a power series whose coefficients are all 
zero, we see that all the coefficients on the left must also be zero: 
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(n + l)(/i + 2 )fl^ 4-2 + a n — 0 


or 


0 / 1+2 


(n+ l)(n + 2)' 


(9) 


where n = 0, 1, 2, 3, ... . This is called the recursion formula for the coefficients 
a n , because it enables us to calculate them step by step starting with a q. We can 
see more clearly what this means if we write these equations out as follows: 


n= 0, 
n = 1, 
n = 2, 
« = 3, 
n =4, 


02 : 


03 = 


«4 = - 


0Q 

1 • 2 

01 

2 • 3 
a 2 


ao. 

2 ! ’ 

0j_. 
3! ’ 


0o . 


a 5 = 


3-4 4!’ 


«3 


__ 0j_. 

4-5 5! ’ 


06 : 


04 

5 • 6 


0o. 
6 ! ’ 


n = 5, a-j = 
and so on. The emerging pattern is now clear: 


05 _ 0j_. 

6*7 7!’ 


for even coefficients, 


for odd coefficients, 


02n = (-l)" 
02n + l =(-1)" 


0Q 

( 20 )!’ 

0i 

(In + 1)!' 



By putting these coefficients back into the power series (6), we can write the so¬ 
lution of (5) as 


ao 2 01 3 , 00 4 I 01 5 00 6 01 7 

y = ao + a,* " - 37^ + - -^x 6 - —x 1 



' * 2 X 4 X 6 


X 3 X 5 X 7 , 

00 

” 2! + 4[ " 6! + " ' 

+ 01 

x ~3! + jr~ir + "\ 


Again, all we have shown is that if (5) has a solution of the form (6), then that 
solution must be (10). And again, since both series in (10) converge by the ratio 
test, term-by-term differentiation of these series is valid and (10) really is a so¬ 
lution of (5), without any restrictions on the constants ao and a x . In particular, 
we see that each of the bracketed series in (10) taken individually is a solution 
of (5), by first putting ao — 1 and a\ — 0, and then ao = 0 and a\ = 1. 


In this problem luck is on our side, because we recognize the two series in 
(10) as the Taylor series expansions of cos x and sin x. This solution of (5) is 
therefore the general solution that we already know from our earlier work, namely, 
y — ao cos x + a\ sin x> where ao and a\ are arbitrary constants.* 


*Problem 19 in Section 9.2. 
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However, in most cases we won’t be this lucky. Our solution will be a power 
series, but only rarely will we be able to recognize it as the power series expan¬ 
sion of any function that we know. The student should realize that we have at¬ 
tached names to only a few of the infinite number of functions that exist out there 
in the universe of mathematics, and it is extremely unlikely that any given dif¬ 
ferential equation will have solutions that can be expressed in terms of these fa¬ 
miliar named functions. Our next example provides an illustration of this. 

Example 3 The differential equation 

xy" + y' + xy = 0 (11) 

has applications in mathematical physics that justify giving it serious special at¬ 
tention. To solve it, we assume a power series solution 

y = a 0 + + <3 2 * 2 + a^x 3 + * ' ‘ = X QnXn (12) 

n = 0 

with positive radius of convergence. Then 

y' = a i + 2a 2 x + 3<33_* 2 + • • • = ^ na„x n ~ l 

n = 1 

and 

y" = 2.a 2 + 2 • 3<3 3 * + 3 • 4<34* 2 + • ' • = X — 1 )na„x n ~ 2 . 

n = 2 

The terms on the left side of (11) can now be written 

xy” = 2a 2 x + 2 • 3<3 3 * 2 + 3 • 4<3<i* 3 + • • • = ^ (n - 1 )na n x n ~\ 

n — 2 

y = <3i + 2a 2 x + 3<3 3 * 2 + • • • = ^ na n x n ~ x 

n = 1 

= <3j + ^ na n x n ~ l , 

n = 2 

and 


xy = Q[)X + a\x 2 + <3 2 * 3 + ' ' ' = X a n x ' l+i 

n = 0 

n = 2 

If we add these three series in accordance with the differential equation (11), the 
result is 

<31 + ^ [(« — \)na n + na tl + a n - 2 \x n ~ ] = 0. 

n—2 

All the coefficients on the left must be zero, so a\ =0 and (with a slight sim¬ 
plification) we have the recursion formula 

n 2 a„ + a n —2 = 0 


or 
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a n 


dn — 2 


n 


2 


(13) 


for n > 2. Since *] = 0, repeated application of (13) tells us that a n — 0 for every 
odd subscript. We now calculate the nonzero coefficients n 2n as follows: 


ao, 


a 2 = 


do 

2 2 ’ 


d 4 = 


02 

4 2 


do 


a *--& 


2 2 ■ 4 2 ' 

dQ 


do 


2 2 • 4 2 • 6 2 ’ 

since 2 2 • 4 2 • 6 2 • • • {In) 2 = (2 • 1) 2 (2 • 2) 2 (2 • 3) 2 • • • 
this information we can now write the solution (12) as 

v 6 


a 2n ( 1 )" 2 2«(„ ! )2’ 

(2 • n) 2 = 2 2, '(n!) 2 . With 


y = do 


l 


;t 2 x 4 
2 2 + 2 2 • 4 2 


2 2.42. 6 2 


= a„X (-D n 

n=0 


y-2/I 


2 2n (n\) 2 



This series is very similar to the cosine series in (10). It can be obtained from 
the cosine series by replacing each odd factor in the denominators by the next 
larger even number. Its sum is a useful special function that is studied in ad¬ 
vanced mathematics. This function is denoted by Jo(x) and called the Bessel 
function of order 0 [see Problems 29(b) in Section 14.2 and 5 in Section 14.3].* 


'Friedrich Wilhelm Bessel (1784-1846) was a famous German astronomer and an intimate friend of 
the great mathematician Gauss, with whom he corresponded for many years. He was the first man 
to determine accurately the distance of a fixed star: his parallax measurement of 1838 yielded a dis¬ 
tance for the star 61 Cygni of 11 light-years, or about 360,000 times the diameter of the earth’s or¬ 
bit. In 1844 he discovered that Sirius, the brightest star in the sky, has a traveling companion and is 
therefore what is now known as a binary star. This Companion of Sirius, with the size of a planet 
but the mass of a star, and consequently a density many thousands of times the density of water, is 
one of the most interesting objects in the universe. It was the first white dwarf star to be discovered 
(see Section 5.5) and occupies a special place in modem theories of stellar evolution. 


PROBLEMS 

1 Consider the following differential equations: 

(a) / = 2xy\ 

(b) / + y = 1. 

In each case, find a power series solution of the form 
'Za n x n , try to recognize the resulting series as the expan¬ 
sion of a familiar function, and verify your conclusion by 
solving the equation directly. 

2 Consider the following differential equations: 

(a) = y\ 

(b) x 2 / = y. 

In each case, find a power series solution of the form 
'Za n x n , solve the equation directly, and explain any dis¬ 
crepancies that arise. 


3 The differential equations considered in the text and pre¬ 
ceding problems are all linear, which means essentially 
that the dependent variable y and its derivatives occur only 
to the first power. The equation 

/ = l + y 2 (*) 

is nonlinear, and it is easy to see directly that y = ton x is 
the particular solution for which y( 0) = 0. Show that 

tan x = x + jjc 3 + -jyjc 5 + • • • 

by assuming a solution for equation (*) in the form of a 
power series 'Za n x n and finding the af s in two ways: 
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(a) by the method of the examples in the text (note par¬ 
ticularly how the nonlinearity of the equation compli¬ 
cates the formulas); 

(b) by differentiating equation (*) repeatedly to obtain 

y" - 2yy\ y'" = 2yy" + 2(y') 2 , 
and using the formula a n = 


4 Solve the equation 

/ =x-y, y( 0) = o 

by each of the methods suggested in Problem 3. What fa¬ 
miliar function does the resulting series represent? 

5 Find a power series solution of xy" — y = 0. For what x 
does this series converge? 


14.7 

(OPTIONAL) 
OPERATIONS ON 
POWER SERIES 


In Section 14.4 we obtained Taylor series for various functions by using the for¬ 
mula a n = f (n \0)/nl to find the coefficients. But computing successive deriva¬ 
tives can be difficult and discouraging work if no simple pattern emerges. We 
can easily appreciate this fact by finding the seventh derivative of such functions 
as tan jc or jc 5 /( 1 — jc 4 ), because the calculations visibly sink us deeper into the 
bog with every step. For some functions we were able to establish the validity 
of their Taylor expansions by proving that R n (x) — > 0 as n —> oo, but this also can 
be difficult. To avoid such problems, we now discuss several algebraic methods 
for obtaining valid new Taylor expansions from ones that are already known. 

Before we begin, we remind students that power series expansions are unique. 
This means that if a function /(jc) can be expressed as the sum of a power series 
by any method , then this series must be the Taylor series of /(jc). For example, 
we know from Section 13.3 that 


-j-= 1 +X + X 2 +X 3 + ■ ■ ■, |*| < 1; (1) 

\ ~ X 

and by first replacing jc by x 4 and then multiplying through by jc 5 , we find that 

-pr^r = i +* 4 + * 8 +* ,2 + • • ■. M<i. 

and 

j^- T =X 5 +X 9 +X 13 +X 17 + ■ ■ ■ , |*| d. 

1 — X 4 


We have deliberately avoided the very laborious task of using the formula a n = 
f (n \0)/n\ to verify that the three series on the right are actually the Taylor series 
of the functions on the left. But these verifications aren’t necessary, because this 
conclusion follows automatically from the uniqueness principle stated above. 


MULTIPLICATION 


Suppose we are given two power series expansions, 

/M = ^ a n X n = Qq + d\X + d2X 2 + a^x 3 + • ■ ■ (2) 


and 


g(x) = £ b n x n = bo 4- b\x + bix 2 + b^x 3 + • • • , (3) 


both valid on an interval |jc| < R. If we ignore the question of convergence for a 
moment, then we can multiply these series in the same way we multiply two 
polynomials. That is, we systematically multiply each term of the first series into 
all the terms of the second series and then collect terms involving the same power 
of x. First, the term-by-term multiplication— 
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a 0 bo + a 0 b\x + a 0 b 2 x 2 + aob^x 3 + • • 
a\b$x + a\b\x 2 + a\b 2 x 3 + ■ ■ * 
a 2 bQX 2 + a 2 b\x 3 + • • • 
a^box 3 + ■ ■ ■ 


By adding these columns, we obtain the power series 
a 0 bo + ( a 0 b\ + a\b 0 )x + (a 0 b 2 + aib\ + a 2 bo)x 2 


+ ( a 0 b 3 + a\b 2 + a 2 b\ + a 3 /?o)^ 3 + ■ * • . (4) 


The form of the coefficient of x n in (4) is evident: The subscripts of the o 's in¬ 
crease as the subscripts of the b's decrease, and their sum remains constant and 
equals the exponent n on x n . Briefly, we have multiplied (2) and (3) to obtain 



(5) 


We assert that this product of the series (2) and (3) actually converges on the in¬ 
terval \x\ < R to the product of the functions/( jc) and g(x), as indicated by (5). 
The proof is not easy, and depends on the absolute convergence of the two se¬ 
ries in the given interval. The details can be found in Appendix A.13. 

Example 1 Find the Taylor series for e x sin jc. 

Solution We know that 



(6) 


and 



(7) 


Our work can be arranged as follows: 

/ x 2 

e x sin jc = ( 1 + jc + — + 



= JC 




+ 


2 



= jc + jc 2 + yjc 3 + • • • . 


Since the two given series (6) and (7) converge for all jc, the same is true of the 
product series. It is rarely easy—and usually quite impossible—to recognize the 
formula for the general term of the product series in this process. 
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Example 2 Find the Taylor series for [In (1 - x)]/(x - 1). 


Solution We know that 


In (1 - 



so 


In (1 — x) 
x - 1 



[-In (1 - *)] 



— x + (1 + y)x 2 + (1 + 1 + y)x 3 + ■ ■ ■ 



This is one of the rare cases where the general term of the product series is easy 
to recognize. 


DIVISION 

Two power series can be divided by the long division process that is used in el¬ 
ementary algebra for dividing polynomials. Since we are working with power se¬ 
ries, the terms are of course arranged in order of increasing exponents, instead 
of in order of decreasing exponents as is usual with polynomials. In particular 
cases, however, one or both of the given series may be a polynomial. 

Example 3 Find the Taylor series for tan x by dividing the series for sin x by 
the series for cos x. 

Solution We have 


1 i 2 s 

X + 3 * 3 + ‘ ‘ 




SO 


tan x = x 4- + ^ x 5 + • • • . 


( 8 ) 


It can be shown that this expansion is valid on the interval \x\ < tt/ 2. Since this 
is the largest interval with center* = Oon which the denominator cos * is nonzero, 
the series (8) has radius of convergence R = ir!2. The problem of discovering 
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a formula for the general term of this series was solved by Euler in 1748. His 
solution depends on the ideas in the next section and is outlined in the Appen¬ 
dix at the end of this chapter. 


The actual process of dividing one power series by another is clearly not very 
difficult. The theory that justifies this process is given in Appendix A. 16. 


SUBSTITUTION 

This is a method we have already used: If a power series 

f(x) = a 0 + a x x + a 2 x 2 + ■ ■ • (9) 

converges for \x\ < R and if \g(x)\ < R , then we can certainly find/[g(jc)] by sub¬ 
stituting g(jc) for jc in (9). For example, there is no problem in using the series 
(1), (6), and (7) to obtain 

irW = I — (—2 jc 2 ) = 1 + ( ” 2 * 2) + ( " 2 * 2)2 + " ' 


= 1 -2x 2 +4x 4 - ■ ■ •, |2jc 2 | < 1; 


and 


1 + x* + ^ + ^ + 


= 1 + X 4 + 


2! 3! 

2! 3! 


all x; 


. , „ (3*) 3 , (3x) 5 

sin 3x = 3x -—-h 


3! 


5! 

243 

5! 


r5 - . . 


all x. 


In these examples we have substituted the simple functions g(jc) = — 2 jc 2 , jc 4 , and 
3x into appropriate power series, but much more is possible. Under suitable con¬ 
ditions, we can actually substitute one power series into another! Thus, suppose 
that the function g(jt) is given by a power series, 

gU) = b 0 + b\x + b 2 x 2 + • • *, (10) 

and substitute this entire series for jc in (9), 

/[#(*)] = 0o + aig(x) + a 2 [g(x)] 2 + ■ ■ ■ 

= a o + Q\[bo + b\x +•••]+ a 2 [bo + b\x + • • -] 2 + ■ • • . (11) 

Again, this is perfectly legitimate as long as \g(x)\ < R. However, the series on 
the right of (11) can now be converted into a power series in jc by squaring, cub¬ 
ing, etc., and collecting like powers of jc, and it can be proved that the power 
series formed in this way converges to f[g(x)] whenever (10) is absolutely con¬ 
vergent and |g(jc)| < R.* 


*See p. 180 of K. Knopp, Theory and Application of Infinite Series (Hafner, 1951). 
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Example 4 Apply the method of substitution to find the Taylor series of e sin * 
up to the term containing x 4 . 

Solution We can use (6) and (7) to write 


> sin * = 1 + (jc - — + 


+ !u- 


+ 7 U 


■ + * 


+ T4 [X ~^ + 


~ 1+ , *“T + 


+ 7 * 2 “ 7 + * * * + 7 U 3 + " 0 




= 1 +JC + -JC 2 --JC 4 + • • • 
2 8 


If we try to apply this method to e cosx , we find that infinitely many terms con¬ 
tribute to the formation of each coefficient, which is a difficult situation to deal 
with. For this reason, the method of substitution is not a practical tool unless 
bo = 0 in the series (10). 

EVEN AND ODD FUNCTIONS 

A function f(x ) defined on an interval \x\ < R is said to be even if/(— x) = /(x), 
and odd if /(— x) — — f(x). The Taylor series of the even function cos x contains 
only even powers of x , and the Taylor series of the odd function sin x contains 
only odd powers of x. These facts are special cases of a general principle: If f(x) 
is an even function, then its Taylor series has the form 

ao + d2X 2 + a±x 4 + a^x 6 + • * • ; 

and if f(x ) is an odd function, then its Taylor series has the form 

d\X + d^X 3 + d$X 5 + diX 1 + • • •. 

That is, the Taylor series of an even (odd) function contains only even (odd) 
powers of x. This is very easy to prove from the uniqueness of power series ex¬ 
pansions.* As another example of this phenomenon, we know beforehand that 
the series on the right side of (8) contains only odd powers of x, because tan x 
is an odd function. 

Many functions are even and many are odd, but most are neither. However, 
every function/(x) defined on an interval \x\ < R can be expressed as the sum of 
an even function and an odd function: 

/W = il/W +/(-*)] + i[/W -/(“*)] = g(x) + h(x), 

where—as is easily verified— 


*We have only to point out that if/(jc) = 2a n x n is even, then 2a, t x n = 2(-l ) n a n x n , so by unique¬ 
ness we have a n = (- 1 ) n a n , and therefore a n = -a n or a n = 0 if n is odd. Similar reasoning applies 
if f{x) is odd. 
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g(x) = {[fix)+f(-x)] is even 

and 

h(x) = y[/W - /(-*)] is odd. 

Further, if f(x) = S^=o a n x n , then 

g(x) = ^a 2n x 2n and h(x) = '^a 2n +\x 2n+l . 

n =0 n—0 


Thus, the Taylor series of /(jc) splits into two power series, one with even expo¬ 
nents representing the even part of /(jc) and one with odd exponents represent¬ 
ing the odd part. 


Example 5 The even part and the odd part of 


f(x) = e * = \ + x h-+-1-+ — + ... 

JK } 2 ! V 4 » 


4! 5! 


are 


and 


, e x + e x x 2 x* 

8{X) ~ -2- ' + 2!~ 4! 


o* _ 0-X 




These are the two hyperbolic functions cosh x and sinh x that were defined in 
Section 9.7. 


Example 6 Find the sum of the series 



Solution This is the even part of the familiar series 

X 2 X 2 JC 4 

f(x) = -In (1 - jc) = jc + — + — + — + ••• . 

The sum of the given series is therefore 

|[/W +/(-*)] = {[-In (1 - x) -In (1 + jc)] 

= In (1 — jc 2 ) = — In Vl — x 2 . 


PROBLEMS 


1 In Example 1, continue the calculation and find the terms 
of the product series as far as the term containing jc 6 . 

In Problems 2-13, use multiplication to show that the given 
function has the indicated power series expansion. 


2 

3 

4 


sin jc 

1 - JC 


JC + JC 2 



+ 5,4 + 1 
6 1 


gjc+jc^ _ | _|_ £ _|_ lj£ 2 _|_ 23 _|_ 25 ^4 

e x cos * = 1 + x - yx 3 - fr 4 - yy* 5 + • • ■ . 


O CM 
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tan 1 x 


13 


~ ■ = a + x 2 + -x 3 + -x 4 + —x 5 + ■ ■ ■ . 

1 — a 3 3 15 

6 In 2 (1 - x) = x 2 + x 3 + -Hjc 4 + fx 5 + ■ • ■ . 

- cos x , , , 1 , ,1 , , 13 . , 13 . , 

1 - X 2 2 24 24 

8 tan 2 x = x 2 + jx 4 + £x 6 + • • • . 

9 — = — + * a : + — x 2 + —x 3 + — x 4 + • • • 

2 +x 2 4* 8 48 96 


10 Vl +x In (1 +x) = . 


J_ v 3 _i_ _L v 4 _ 71 s 

4 X “I" 24-* - 


Jt 3 + 


11 e * tan x = x - x 2 + f x 3 - yx 4 + j^x 5 + ■ ■ ■. 


12 


1 + x+x 2 


X + X 3 


k 4 +x 6 


+ X 9 -X 10 + 


, „ sin x 0 5 , 5 , 10 1 c 

13 --- jc — jc 2 + -a 3 — — jc' 4 + 7 — a 5 + • • • . 

1 + a 6 6 120 

14 By squaring the series for sin a and cos jc, show that 
sin 2 x + cos 2 jc = 1 , at least as far as the jc 6 term. 

15 If/(jc) = ^=oa n x n , use multiplication of series to show 
that 


T~Z /W “X ( fl 0 + a \ + ' ' ' + a n)x n - 

1 X n = 0 

Use this result to write down the series in Problems 2, 5, 
and 7 by inspection. 

16 Use Problem 15 to find the sum of the series 

+ l)*"- 

17 The binomial series expansion of 1/Vl — x is 


1 


Vl -X 


1 + y* + 


1*3 , , 1-3-5 , 

- r z H- jc* 5 

2 2 • 2! X 2 3 ■ 3! 


+ 


1 - 3 • 5 • 7 
2 4 • 4! 


+ • • • . 


Check this by squaring the series and showing that the 
result is 1 + jc + jc 2 + jc 3 + jc 4 + ■ • ■ , at least as far as 
the jc 4 term. 

18 In Example 3, continue the calculation and find the se¬ 
ries for tan jc as far as the term containing jc 7 . 

19 Use division to obtain the series expansions given in 
Problems 2, 5, 7, 9, 12, and 13. 

In Problems 20-27, use division to obtain the given expan¬ 
sions. 

20 ——-— = 1 + 2jc + 3jc 2 + 4jc 3 + • • • . 

(1 “ x) z 

21 —-—j = 1 - x 2 - x 3 + x 5 + x 6 
1 - jc + x z 

-JC 8 - JC 9 + • • • . 
JC 2 

22 - —~2 -3 = X 2 + X 3 + JC 6 + JC 7 + - • • . 

1 - JC + JC Z - JC J 


23 

24 

25 

26 
27 


= 1 + -jc 2 + 


360 


A' 4 + 


1 + A + A 2 + A 3 + • • 


smx , , 1 1 ? 1 , , 

i— t;—;— r = 1 + —a - —x z - — x 3 + • • • . 
In (1 + a) 2 4 24 


sec a = 
sin" 1 a 


- — 1 + — A 2 + — A 4 + 

cos a 2 24 


= A + “A 3 + — A 5 + 

cos a 3 30 


In Problems 28 and 29, use the method of substitution to find 
the given Taylor series. 


28 

29 

30 

31 

32 


In ( 1 + sin a) = a 

1 


}a 2 + i - 3 


1 — x 2 e x 


= 1 + A 2 + A 3 


6 * 


4 , 13 5 
* + — x° 


24 x 


33 

34 

35 


Use substitution and the fact that sec a — 1/cos a = 
1/[1 — (1 — cos a)] to find the Taylor series for sec a up 
to the term containing a 6 . What is the radius of conver¬ 
gence? 

Use multiplication and the result of Problem 30 to find 
the Taylor series for tan a up to the term containing a 7 . 

(a) Find the Taylor series for sec 2 a as far as the term 
containing a 6 by expanding sec 2 a = 1/cos 2 a = 
1/(1 — sin 2 a) as a geometric series in sin 2 a. 

(b) Find the same series by squaring the series found in 
Problem 30. 

(c) Find the same series by differentiating the series 
found in Problem 31. 

Show that a function /(a) defined on an interval |a| < R 
can be expressed in only one way as the sum of an even 
function #(a) and an odd function h( a). 

A 3 A 3 

Find the sum of the series a + — + — + ■ • • . 

Calculate each of the following limits by first finding the 
Taylor series of the given function: 


(a) lim 

A—>• 


1 


(b) lim 

x —>0 


36 


A *~HJ A 

In this way, the use of Taylor series often provides a con¬ 
venient alternative to the use of L’Hospitafs rule. 

Find the sum of each of the following series: 


(a) a + a 2 - . 

(b) A 2 + A 3 + A 4 - A 


3 + A 4 + A 5 — A 6 + H— - • ■ ; 
5 + A 6 + A 7 

+ A 8 - A 9 + + + 


2 3 4 

, v A A Z , A J A* 

(c) 2! + l! + 4! + 5! + 

r 4 r 8 y\2 

(d) 1 + — + — + — + • 

V ' 4! 8! 12! 


37 


Calculate / (7) (0) if /(a) = tan a and use this to verify the 
coefficient of a 7 in the expansion found in Problem 18. 




























14.8 (OPTIONAL) COMPLEX NUMBERS AND EULER’S FORMULA 

Consider the three familiar power series expansions 


, X 2 X 3 X 4 X 5 

(1) 

e =1 + * + 2! + ¥ + T + 5T + '"’ 

1 X 2 , X 4 

cos X = 1 - — + —-, 

(2) 

X 3 X 

sin x = x- — + —-. 

(3) 


The second and third of these series seem to be parts of the first series in some 
nonobvious way that involves changes in the signs of some of the terms. This 
in turn suggests that the three functions on the left are probably related to one 
another. There is indeed such a relation, which is known as Euler's formula: 

e lx = cos x + i sin x , (4) 

where i = I is the so-called imaginary ; unit. This formula turns out to be one 

of the most important facts in the whole of mathematics, with implications that 
deeply influence mathematics itself and also many of its applications, particu¬ 
larly in the fields of physics and electrical engineering. A full explanation of 
Euler’s formula would require us to develop a fairly complete theory of complex 
numbers and functions of a complex variable. With apologies, we leave this task 
to a more advanced course, and instead briefly outline a few of the necessary 
ideas in a very incomplete way that at least has the merit of lending a little plau¬ 
sibility to formula (4). 

Up to this point, all of our work in this book has taken place in the context of 
the real number system. Nevertheless, the real numbers do have a serious defi¬ 
ciency—not every polynomial equation has a solution. Thus, the quadratic equa¬ 
tion x 2 + 1 = 0 has no solution in the real number system because there is no 
real number whose square is — 1. This deficiency was so crippling that several 
hundred years ago mathematicians felt the need to use the seemingly contradic¬ 
tory symbol Y — 1 to signify a solution of x 2 + 1 = 0. This symbol was later 
denoted by the letter i, and was thought of as an imaginary or fictitious number 
that could be manipulated algebraically just like an ordinary real number, except 
that i 2 — -1. Any qualms these early mathematicians may have felt about the 
puzzling nature of this “number” were set aside because it was too useful to ig¬ 
nore. Thus, for example, the equation x 2 + 1 = 0 was factored by writing it in 
the equivalent forms x 2 — i 2 = 0 or (x + i)(x — i) = 0, and its solutions were 
exhibited as the numbers x = ±i. 

Without entering into the details that would be needed to give mathematical 
respectability to our discussion, we now simply describe the complex numbers 
as all formal expressions a + bi , where a and b are real numbers and i is the 
imaginary unit for which i 2 = — 1. The complex numbers take on the character 
of a legitimate number system by means of the following general rule for per¬ 
forming calculations: In adding, multiplying, and dividing, follow all the famil¬ 
iar rules of elementary algebra and then simplify wherever possible by using the 
equation i 2 = — 1 to remove all powers of i higher than the first, as in 

i 3 = i 2 • i = (— 1) • i = —i, i 4 = i 2 • i 2 = ( — 1)(— 1) = 1, i 5 = i 4 * i = /, 

and so on. 
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The complex number a + bi can be identified with the real number a if 
b — 0, so the complex number system constitutes an enlargement of the real 
number system. Not only does the equation x 2 + 1 =0 acquire the two solutions 
i and —i in this way, but also every quadratic equation ax 2 + bx + c = 0 ac¬ 
quires the two familiar solutions 

—b ± Vb 2 — 4 ac 
X ~ la 

which are real and distinct if b 2 - 4 ac > 0, real and equal if b 2 — 4 ac = 0, 
and complex and distinct if b 2 — 4 ac <0. For example, the equation x 2 — 
6x + 13 = 0 has the distinct complex roots 

6 ± V36 - 52 6 ± V^16 6 ± 4Vm" 


6 ± 4 i 
2 


= 3 ± 2 L 


Much more than this is true: Every polynomial equation of the form 
a n x n + a n -\X n ~ l + • • • + a\x + ao = 0, 


where n is a positive integer and the a’s are arbitrary real numbers with a n =£ 0, 
has exactly n roots (some of which may be equal to one another) among the com¬ 
plex numbers. Moreover, this is still true even if the coefficients are complex. 
This fact is known as th t fundamental theorem of algebra. It shows that there is 
no need to construct further enlargements of the complex number system in or¬ 
der to solve all polynomial equations with complex coefficients.* 

We now return to our original purpose, which was to gain a little insight into 
why Euler’s formula (4) is true. 

A perfectly satisfactory theory of power series can be constructed in which the 
variable is permitted to be a complex number instead of merely a real number. 
Within this theory, all of the series (1), (2), and (3) converge for all complex val¬ 
ues of the variable. If we replace x in (1) by ix and use (5), then we obtain 


e ix = 1 + ix + 


(ix) 2 (ix) 3 (ix) 4 (ix) 5 

2! 3! 4! 5! 


= 1 + ix ~ 


. x 3 x 4 . x 5 




which gives Euler’s formula 

e u = cos x + i sin x. 


If we now replace x by — x and use the fact that cos (—x) = cos x and sin (—x) = 
-sin x, then this becomes 

e~ ix = cos x — i sin x\ 


There are many proofs of this important theorem, of varying levels of sophistication. See, for ex¬ 
ample, pp. 269-271 of R. Courant and H. Robbins, What Is Mathematics? (Oxford University Press, 
1941). 
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and by first adding and then subtracting, w e at once obtain 

e ix _|_ e ~ix e ix — e ~ix 

cos x =--- and sin x = -. 

2 2 / 

These formulas have many uses in advanced mathematics. A particularly inter- 
esting application is sketched in the Appendix at the end of this chapter. We also 
point out that if we put x = tt in Euler’s formula, then we get 

e 771 = cos 7 t + / sin v 


or 

e™ = - 1 . 

This beautiful equation, connecting the mysterious and pervasive numbers 77, e, 
and i, is one of the most remarkable facts in mathematics. 

The ideas of this section are sketched in such a cursory fashion that they are 
bound to seem more suggestive than convincing. The eminent British mathemati¬ 
cian E. C. Titchmarsh once remarked, “I met a man recently who told me that, so 
far from believing in the square root of minus one, he did not even believe in mi¬ 
nus one. This is at any rate a consistent attitude.” There is only one way to lift 
these concepts from the status of reasonable speculations to the realm of certainty, 
and this is to undertake a careful study of the theory of functions of a complex 
variable, also known as Complex Analysis. This subject is one of the richest and 
most rewarding branches of mathematics, and we heartily recommend it. 


CHAPTER 14 REVIEW: CONCEPTS, FORMULAS, METHODS 


Think through the following, and memorize the main ex¬ 
pansions. 

1 Power series, radius and interval of convergence. 

2 Differentiation and integration of power series. 


3 Taylor series. 

4 Taylor’s formula with derivative remainder. 

5 Taylor series expansions of ^ In (1 + x), tan -1 x, e x , 
sin x, cos x. 


ADDITIONAL PROBLEMS FOR CHAPTER 14 


SECTION 14.2 

1 Consider a power series 'La n x n and assume that lim 
^\a n \ exists, with °° as an allowed value. Show that the 
radius of convergence R of the series is given by the 
formula 


lim VR 


Use this formula to find the radius of convergence of 
xf_ , lx v- 1 

n n 

n n 


<*> X ^ 

(C) X 


(b) X 


(In n) n 


( n + \r 


2 X 


2 If the radius of convergence of 'La n x n can be calculated 
from formula (7) in Section 14.2, show that it can also 


be calculated from the formula in Problem 1. (Hint: See 
Additional Problem 18 in Chapter 13.) Show that the 
latter formula is more powerful than the former by con¬ 
sidering the series 


y —^— 

n=l 2' I+(-1) ” + 1 


X X* 
* + 2‘ 




+ * • * . 


*3 If a power series converges conditionally at a point x \, 
or diverges in such a way that its terms are bounded, 
show that x\ must be an endpoint of the interval of con¬ 
vergence. 

4 Use Problem 3 to find by inspection the radius of con¬ 
vergence of 

_ * * 2 * 3 
(a) 1 + 3 + 32 + 33 + " •; 
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V-4 y6 

(b) x 2 - — + — ; 

(c) 2x" ! = x + x + X 2 + x 6 + X 24 + * • •; 

(d) i + # + # + + # + 

(e) 2[2<-”x]" = 1 + |+ 2 2 x 2 + ^ 

+ 2 4 x 4 + • • • . 

5 Find the interval of convergence of 'La n x n if its coeffi¬ 
cients are chosen in order from among the numbers 2, 
3, . . . , 12 by throwing a pair of dice. 

SECTION 14.3 

6 Consider a power series y Za n x n in which the coefficients 
repeat cyclically, a n +k = ci n . Show that 

(a) R= 1; 

(b) the sum is 

ap + a\x + a 2 x 2 + • • • + ak-\x k ~ l 


1 — x k . 


W T + Y + l2 + l6 + 

(e) x + 2 3 x 2 + 3 3 x 3 + 4 3 x 4 + • • • ; 

(f) 4 -F 5jc -F 6lt 2 -F 7jc 3 -F * * • . 

SECTION 14.4 

8 Use any method to obtain each of the following Taylor 
series expansions as far as indicated: 

(a) e sinx = 1 + x + jx 2 ~ jx 4 + • • • ; 

(b) 7T^ = H* + > + -" ; 

(c) e* 2 -* = 1 - X + fx 2 - }x 3 + §*4 + . . . . 

9 Consider the binomial series 

(1 + x)P = 1 + px + P(P 2 ~ D X 2 


p( - l)(p - 2) 


x 3 + 


where p is an arbitrary constant. In Problem 13 of Sec¬ 
tion 14.4 the series on the right was obtained as the Tay¬ 
lor series of the function (1 + x) p , and we also saw that 
this series converges for \x\ < 1. We here outline a se¬ 
quence of steps to prove that the series on the right ac¬ 
tually converges to the function on the left for these val¬ 
ues of x. 

(a) Let f(x) denote the sum of the series for |x| < 1, cal¬ 
culate f'(x) and x/'(x), and show that 

(1 + x)/'(x) = pf(x). 

(b) Define g(x) by 

fix) 


g(x) = 


(1 +x)P 


and use part (a) to show that g'{x) = 0 for |x| < 1, 
so that g(x) = c for some constant c. 

(c) Show that c = 1 in part (b), and conclude that 

(1 +xy=f(x). 


SECTION 14.7 


Find the sum of each of the following series: 

10 

X 3 X 5 X 7 

< a )X-^ 2 + -p-j2 + ""’ 

11 

X 2 X 3 X 4 X 5 

1-2 2-33-4 4 ■ 5 + 

12 

(c) 1 + 4x + 9x 2 + 16x 3 + 25x 4 + • • • ; 

*13 


result to find a closed formula for f\(x). 

Use the idea of Problem 10 to find a closed formula for 
f 2 (x) = X^=\nin + \)x n . 

Use the idea of Problems 10 and 11 to find a closed for¬ 
mula for/ 3 (x) = 2~=i n(n + l)(w + 2)x n . 

In the notation of Problems 10 to 12, show that 

X n 2 x n =/ 2 (x) -/,(*) 

n= 1 

and 

X h 3 *” — /b(x) - 3/ 2 (x) +/i(x). 

n=l 

Using these ideas as a starting point, devise a proof of 

the following theorem: If pin) is a polynomial in n, then 

f(x) = Xn=oP(n)x n is a rational function. 

*44 Show that 1/Vl — 2 xt + t 2 = 1^=\P n {x)t n , where P n (x) 

is a polynomial of degree n, by substituting h = 2xt — 

t 2 in the binomial series for 1/ Vl —h (see Problem 17 

in Section 14.7). Find Pp(x\ P\(x), P 2 (x\ and P 3 (x). The 

polynomials P n {x) are called the Legendre polynomials :; 

they are important in mathematical physics, for instance, 

in the study of heat flow in solid spheres. 

15 Calculate the following limits by using Taylor series: 

x cos x — sinx „ x sin x — tan x 
(a) lim - 2 — (b) lim — 

x 2 tan x *->o 


+ pip - l)ip - 2) • ■ • jp - n + 1) + 

n\ 


(c) lim 

x —>0 


V1 + x 2 + cos x — 2 


sin 2 x 
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In this Appendix we derive several formulas discovered by Euler that rank among the most 
elegant truths in the whole of mathematics. We use the word “derive” instead of “prove” 
because some of our arguments are rather formal and require more advanced ideas than 
we can provide here to become fully rigorous in the sense demanded by modern concepts 
of mathematical proof. However, the mere fact that we are not able here to seal every 
crack in the reasoning seems a flimsy excuse for denying students an opportunity to 
glimpse some of the wonders that can be found in this part of calculus. For those who 
wish to dig deeper, full proofs are given in the treatise by K. Knopp mentioned in Sec¬ 
tion 14.7. 


THE BERNOULLI NUMBERS 

Since 


e x ~ 1 


x 




+ - - - 


for x =£ 0, and this power series has the value 1 at x = 0, the reciprocal function x/(e x - 1) 
has a power series expansion valid in some neighborhood of the origin if the value of this 
function is defined to be 1 at jc = 0. We write this series in the form 


x 

e x - 1 




0 + B ' x + 'ff * 2 


+ ■ 


(i) 


The numbers B n defined in this way are called the Bernoulli numbers, and it is clear that 
Bq = 1. It is easy to see that 


jc e x + 
2 \e x — 


-n = -T + f- 

lie 


e x + 1 


1 ' 


( 2 ) 


A routine check shows that the second term on the right is an even function, so B\ = — \ 
and B n — 0 if n is odd and > 1. If we write (1) in the form 


in + iL x + i2 2 + 
0! 1! 2! 


-h — + — + 

1! 2! 3! 


= 1, 


then it is clear that the coefficient of x n 1 in the product on the left equals zero if n > 1. 
By the rule for multiplying power series, this yields 


Bo 1 , B x 1 B 2 1 B n -j J_ 

0! n\ 1! (n-1)! 2! (w — 2)! (w — 1)! 1! 


and by multiplying through by n ! we obtain 


i?2 "f 


(n - 1)! 1! 


Bn- 1 = 0 . (3) 


This equation can also be written more briefly as 




-U - .1*-'-° 


or 



B k = 0, 


where 



is the binomial coefficient n\/[k\(n - k)\]. By taking n = 3, 5, 7, 9, 11,... in 


(3) and doing a little arithmetic, we easily find that 


APPENDIX: 
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&2 ~ 6 > ^4 30 ’ ^6 — 42 ’ ^8 — 30 ’ ^10 66 > 

These calculations can be continued recursively as far as we please, and so all the Bernoulli 
numbers can be considered as known, even though considerable labor may be required to 
make any particular one of them visibly present. We also point out that it is obvious from 
(3) and the mode of calculation that every B n is rational. 


THE POWER SERIES FOR THE TANGENT 

We now begin to explore the uses of these numbers. 

In equation (2) we move the term —x/2 to the left and use the fact that 

x e x + 1 _ x e xt2 + e~ xtl 
2 ' e x - 1 ~~ 2 e xl2 - e~ x/1 


to obtain 


x_ e * /2 + e = Y §2 n 2 n 

2 e x/2 _ e ~x/2 Z- o (2 n)\ ■ 


(4) 


On the left side of this, we now replace x by 2 ix, which yields 
2 ix e u + e~ ,x (e ix + e~«)/2 

- • -:-— = X -:-:- = r f 

2 e tx _ e -tx ( e <x _ e -*)2i 


by the formulas for sin x and cos * that were derived in Section 14.8. Making the same 
substitution on the right side of (4) gives 


so 


I 


n=0 


Bln 

(2/i)! 


(2a) 2 " = ^ (-1)" 

rt=0 


2 2n B 2 „ 
(2 ft)! 


x 


In 


x cot x = y (-1)" 2 ^ 2 " x 2 ". 
n=0 0-n)\ 


(5) 


The trigonometric identity tan x = cot x - 2 cot 2x now enables us to use (5) to write 

-,=| o (-.)^^--2| o (-.)"^(2,) 2 "-> 


= X (— iy x2n ~ l - X 2 2n x 2n ~' 


2 2 "fi 2 „ 

(2n)! 


= X (> — 2 2n )x 2n ~ i , 

n = 0 \^ n )- 


.4-, <2»)! 

This is the full power series for tan x that was encountered several times in truncated form 
in Section 14.7. Based on our knowledge of the Bernoulli numbers, the first few terms of 
this series are easy to calculate explicitly, 

tan a: = x + y* 3 + j^x 5 + + 2 § 5 x9 + ■ ■ ■ . 


THE PARTIAL FRACTIONS EXPANSION OF THE COTANGENT 

By using entirely different methods, Euler discovered another remarkable expansion of 
the cotangent: If x is not an integer, then 

















APPENDIX: THE BERNOULLI NUMBERS AND SOME WONDERFUL DISCOVERIES OF EULER 


527 


7 T COt 7 TX ■ 


+ 2*2 


(6) 


We will examine this formula from two very different points of view, and give two 
derivations. 

First, it is quite easy to see that (6) is analogous to the expansion of a rational function 
in partial fractions. For instance, if we consider the rational function (2x + 1)/ 
( x 2 — 3x + 2) and notice that the denominator has zeros 1 and 2 and can therefore be 
factored into (x — 1 )(x — 2), then this leads to the expansion 


2x + 1 _ 2x + 1 _ c\ c 2 

x 2 - 3x + 2 ~ (x - 1)U - 2) ” x - 1 + * - 2 


for certain constants C\ and c 2 . The constant c\ can now be determined by multiply¬ 
ing through by jc — 1 and allowing x to approach 1, and similarly for c 2 . Formally, 
(6) can be obtained in much the same way by noticing that cot ttx = cos 7rx/sin ttx 
has a denominator with zeros 0, ±1, ±2,..., and should therefore be expressible in 
the form 


cot 7 TX 


= 7 + 2 ( 7 ^ 


x + n 


(7) 


From this, the constants a , b n , and c n can be found by the procedure suggested (they 
are all equal to l/7r), and (7) can then be rearranged to yield (6). For reasons that 
will now be obvious, it is customary to refer to (6) as the partial fractions expansion 
of the cotangent. The main gap in this suggestive but rather tentative derivation is of 
course the fact that we have no prior guarantee that an expansion of the form (7) is 
possible. 

Another way of approaching (6) is to begin with the infinite product (6) in Appendix 
1 at the end of Chapter 13: 




I f w e take the logarithm of both sides t o obtain 


In- 


-2> i — 


and then differentiate, the result is easily seen to be 

—2x 


cot x -= ^ 


r^l n 2 TT 2 — X 2 


nhT 2 ) 


or 


1 - ^ v 1 
cot X = -(- 2x ) -t:- 5 — =- 

r +— 1 — n*-Tr L 


and replacing x by ttx and then multiplying through by ttx yields 


TTX COt TTX - 1 + 2x 2 ^ 

n= 1 



which is equivalent to (6). 


( 8 ) 


EULER’S FORMULA FOR 21 /n 2k 

We now obtain a major payoff from (5) and (8) by replacing x by ttx in (5) and equating 
the two expressions for 7r;c cot ttx. 
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i i ^ ~2x 2 , / n * 2 2k B 2k . 

i + X =i + I <->>* i2«r (TO) • 


(9) 


where we use k as the index of summation on the right for reasons that will appear in a 
moment. Each term of the series on the left is easy to expand in a geometric series, 


-2x 2 


= -2 ■ 


x 2 /n 2 


y 2 k 


= - 2 X(^) 


1 - x 2 ln 2 

so (9) can be written as 

We now interchange the order of summation on the left and obtain 


2 2k B 2 


k= 1 


1 +X (- 2 X )* 2k = 1 +!(-')* 


2 lk B 


(2 k)\ 


2JL „2k r 2k 


and equating the coefficients of x 2k yields 


Y — = f-n*-' ■— 

n 2k { ) 2(2k) 


2k 2k 
- 7T ZK 


for each positive integer k. In particular, for k = 1, 2, 3 we get 

oo-i 9 °°i A °°i 

v — — v -L — v _L 

^ n 2 ~ 6 ’ V ~ 90 5 ^ ” 


945 ’ 


It is very remarkable that for more than 250 years there has been no progress whatever 
toward finding the exact sum of any one of the series 


y - 

A „3’ 





Perhaps a second Euler is needed for this breakthrough, but none is in sight. 
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CONIC 

SECTIONS 


In order to understand the central ideas of Chapters 13 and 14, it was necessary 
to pay close attention to the precise wording of definitions and to the details of 
proofs, so the level of mathematical rigor in those chapters was rather high. How¬ 
ever, we now turn to work that is mostly geometric in nature. We shall therefore 
rely much more heavily on reasoning based on spatial intuition and the kind of 
insight that can be obtained from carefully drawn figures. 

Consider a circle C. Let A be the line through the center of C perpendicular 
to the plane of C, and let V be a point on A not in the plane of C, as shown in 
Fig. 15.1. Let P be a point on C, and draw the infinite straight line through P 
that also passes through V. As P moves around C, the line PV sweeps out a right 
circular cone with axis A and vertex V. Each of the lines PV is called a genera¬ 
tor of the cone, and the angle a between the axis and any generator is called the 
vertex angle. The cone shown in Fig. 15.1 has a vertical axis, and the upper and 
lower portions of the cone that meet at the vertex are called the nappes of the 
cone.* In elementary geometry a cone is usually understood to be a solid figure 
occupying the bounded region of space that lies between V and the plane of C 
and is inside the surface we have just described. However, in the present context 
the cone is this surface itself, and is understood to consist of both nappes, ex¬ 
tending to infinity in both directions. 

The curves obtained by slicing a cone with a plane that does not pass through 
the vertex are called conic sections , or simply conics. If the slicing plane is par¬ 
allel to a generator, the conic is called a parabola. Otherwise, the conic is called 
an ellipse or a hyperbola , depending on whether the plane cuts just one or both 
nappes. The hyperbola is to be thought of as a single curve consisting of two 
“branches,” one on each nappe. These three curves are illustrated in Fig. 15.2. 

The three curves shown in Fig. 15.2 can be described in another way. Imag¬ 
ine a source of light placed at V and a circular ring placed at C. Then the shadow 
cast by the ring on a plane will be a parabola, an ellipse, or one branch of a hy¬ 
perbola, depending on the steepness of the plane. If the plane is parallel to one 
of the lines joining V to C, we get a parabolic shadow; the shadow will be an el¬ 
lipse if the plane is less steep than this, and part of a hyperbola if it is more steep. 


*“Nappe” is from the French word nappe, meaning a sheet of something, perhaps cloth, as in “nap¬ 
kin” or “napery” (household linen). 


15.1 

INTRODUCTION. 
SECTIONS OF A CONE 



Figure 15.1 
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It should be noted that if we move each of the slicing planes in Fig. 15.2 par¬ 
allel to itself until it passes through the vertex, then we get three so-called de¬ 
generate conic sections, namely, a single straight line, a point, and a pair of in¬ 
tersecting straight lines. 

Many important discoveries in both pure mathematics and science have been 
linked to the conic sections. The classical Greeks—Archimedes, Apollonius and 
others—studied these beautiful curves for the sheer pleasure of it, as a form of 
play, without any thought of their possible uses. The first applications appeared 
almost 2000 years later, at the beginning of the seventeenth century. About the 
year 1604 Galileo discovered that if a projectile is fired horizontally from the top 
of a tower and is assumed to be acted on only by the force of gravity—that is, 
if air resistance and other complicating factors are ignored—then the path of the 
projectile will be a parabola. One of the great events in the history of astronomy 
occurred only a few years later, in 1609, when Kepler published his discovery 
that the orbit of Mars is an ellipse and then went on to suggest that all the plan¬ 
ets move in elliptical orbits. And about 60 years after this, Newton was able to 
prove mathematically that an elliptical planetary orbit implies, and is implied by, 
an inverse square law of gravitational attraction. This led Newton to formulate 
and publish (in 1687) his famous theory of universal gravitation as the explana¬ 
tion of the mechanism of the solar system, which has been described as the great¬ 
est contribution to science ever made by one man. These developments took place 
hundreds of years ago, but the study of conic sections is far from outdated even 
today. Indeed, these curves are important tools for present-day explorations of 
outer space, and also for research into the behavior of atomic particles. Artificial 
satellites move around the earth in elliptical orbits, and the path of an alpha par¬ 
ticle moving in the electric field of an atomic nucleus is a hyperbola. These ex¬ 
amples and many others show that the importance of conic sections, both his¬ 
torically and in modem times, is difficult to exaggerate. 

We shall be studying the conic sections as plane curves. For this purpose it is 
convenient to make use of equivalent definitions that refer only to the plane in 
which the curves lie and depend on special points in this plane called foci ( fo¬ 
cus is the singular). An ellipse can be defined as the set of all points in the plane 
the sum of whose distances d\ and dz from two fixed points F and F' (the foci) 
is constant, as shown on the left in Fig. 15.3. A hyperbola is the set of all points 
for which the difference | d\ — d 2 I is constant. And a parabola is the set of all 







15.2 ANOTHER LOOK AT CIRCLES AND PARABOLAS 



d x + d 2 = a constant 
(Ellipse) 



\d\ ~ d 2 \ = a constant 
(Hyperbola) 



(Parabola) Figure 15.3 


points for which the distance to a fixed point F (the focus) equals the distance 
to a fixed line (called the directrix). 

There is a simple and elegant argument which shows that the focal property 
of an ellipse follows from its definition as a section of a cone. This proof uses 
the two spheres shown in Fig. 15.4, which are internally tangent to the cone along 
the horizontal circles C\ and C 2 , and are also tangent to the slicing plane at the 
points F and F. If P is an arbitrary point on the ellipse, we must show that the 
sum of the distances PF + PF' is constant in the sense that it does not depend 
on the particular position of P. To see this, we notice that if Q and R are the 
points on C\ and C 2 that lie on the generator through P, then PF = PQ and PF' = 
PR , because any two tangents to a sphere drawn from a common external point 
have the same length. It follows that PF + PF' = PQ + PR = QR\ and the ar¬ 
gument is completed by observing that QR , as the distance from C\ to C 2 down 
a generator, has the same value for every position of P. 

With slight modifications this proof also works for the hyperbola and the 
parabola. In the case of the hyperbola, we use one sphere in each portion of the 
cone, with both spheres tangent to the slicing plane. And for the parabola we use 
one sphere tangent to the slicing plane. The focus of the parabola is this point of 
tangency, and its directrix is the line of intersection of the slicing plane with the 
plane of the circle along which the sphere is internally tangent to the cone. Stu¬ 
dents should use these hints to draw suitable pictures and prove for themselves 
that the focal properties of the hyperbola and the parabola can be derived from 
their definitions as sections of a cone. 



Figure 15.4 The f ocal property of 

ellipse. 


Circles and parabolas were discussed fairly thoroughly in Chapter 1. However, 
that was a long time ago, and it may be helpful to give a very brief review of the 
main facts in order to assist students in fitting these topics into the context of the 
present chapter. 

CIRCLES 


15.2 

ANOTHER LOOK AT 
CIRCLES AND 
PARABOLAS 


Referring to Fig. 15.4, we see at once that a circle can be thought of as the spe¬ 
cial case of an ellipse obtained by taking the slicing plane perpendicular to the 
axis of the cone, so that the foci coincide. Nevertheless, for several reasons it is 
convenient to discuss circles separately, and to reserve the word “ellipse” for the 
case in which the foci are two distinct points. 
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A circle , therefore—as we very well know—can be defined as a plane curve 
consisting of the set of all points at a given fixed distance (called the radius ) 
from a given fixed point (called the center). If r > 0 is the radius and (h , k) is 
the center, and if (jc, y) is an arbitrary point on the circle (see Fig. 15.5), then by 
using the distance formula we can write the defining condition as 

V(jc — hj 2 + (y — k) 2 = r 


or 

(x - h) 2 + (y - k) 2 = r 2 , (1) 

which is the equation of the circle in standard form. By squaring the terms on 
the left of (1) and rearranging, this equation can be written in the form 

x 2 + y 2 + Ax + By + C = 0. (2) 


Conversely, by completing the square on the x and y terms, any equation of 
the form (2) can be written in the form (1), and therefore represents a circle if 
r 2 > 0. As students will remember, there is a slight difficulty with this procedure 
as a result of the fact that the constant r 2 on the right of (1) may turn out to be 
zero or a negative number. In these cases, (1) can be thought of as the equation 
of a single point or the empty set. 


PARABOLAS 

As we saw in Section 15.1, a parabola can be defined as a plane curve consist¬ 
ing of the set of all points P that are equally distant from a given fixed point F 
and a given fixed line d, as shown on the left in Fig. 15.6. The fixed point is 
called the focus , and the fixed line is called the directrix. To find a simple equa¬ 
tion for this curve, we introduce the coordinate system shown on the right in the 
figure, in which the focus is the point F = (0, p\ where p is a positive number, 
and the directrix is the line y = —p. If P = (jc, y) is an arbitrary point on the 
parabola, then by using the distance formula the defining condition can be writ¬ 
ten as 

V* 2 + (y - P ) 2 = y + p. (3) 




Figure 15.6 A parabola. 
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y 2 = -4 px Figure 15.7 


By squaring and simplifying we get 

x 2 + y 2 — 2py + p 2 = y 2 + 2 py + p 2 


or 


x 2 = 4 py. (4) 

Conversely, by reversing the steps, it can be shown that (3) can be derived from 
(4). Equation (4) is therefore the equation of this particular parabola in standard 
form. The line through the focus perpendicular to the directrix is called the axis 
of the parabola, and the point V where the parabola intersects the axis is called 
the vertex. For the parabola (4), the axis is clearly the y-axis, and the vertex is 
the origin. 

If we change the position of the parabola relative to the coordinate axes, we 
naturally change its equation. Three other simple positions, each with its corre¬ 
sponding equation, are shown in Fig. 15.7. Students should verify the correct¬ 
ness of all three equations. We emphasize that the constant p is always under¬ 
stood to be a positive number; geometrically, it is the distance from the vertex 
to the focus, and also from the vertex to the directrix. 

We illustrate a further point about parabolas by considering the equation 

;t 2 — 8x — y + 19 = 0. 

If we write this as x 2 — 8jc = y — 19 and complete the square on x , then the re¬ 
sult is 


(x - 4) 2 = y - 3. 


If we now introduce new variables jc' and y' by writing 

x' = x-4, 

y' =y ~ 3, 

then our equation becomes 


The graph of this equation is clearly a parabola with vertical axis whose vertex 
lies at the origin in the x\ y' coordinate system, and this origin is located at the 
point (4, 3) in the x, y system, as shown in Fig. 15.8. In exactly the same way, 
any equation of the form 


x 2 + Ax + By + C = 0, 



B* 0 , 


(5) Figure 15.8 
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Figure 15.9 The reflection property. 


represents a parabola with vertical axis. The vertex of this parabola is easily lo¬ 
cated by completing the square on x , and in this way the equation can be writ¬ 
ten in the form 

(x~ h) 2 = 4p(y-k) or (x - h) 2 = -4p( y - k\ 

where the point ( h , k) is the vertex.* Similarly, any equation of the form 
y 2 + Ax + By + C = 0, A =£ 0, 

represents a parabola with horizontal axis, and the geometric features of this 
parabola can be discovered by completing the square on y and writing the equa¬ 
tion as 

(y ~ k ) 2 = 4 p(x ~ h ) or (y - k) 2 = -4p(* - h). 

We conclude this section by describing the so-called reflection property of 
parabolas. Consider the tangent line at a point P = (*, y) on the parabola y 2 = 
Apx (Fig. 15.9), where F = ( p , 0) is the focus. As shown in the figure, let a be 
the angle between the tangent and the segment FP, and let ft be the angle be¬ 
tween the tangent and the horizontal line through P. In Problem 9, students are 
asked to prove that a = ft. 

This geometric property of parabolas has many applications. For example, it 
is used in the design of mirrors for searchlights. To construct such a mirror, re¬ 
volve the parabola about its axis to form a surface of revolution, then coat the 
inside with silver paint to make a reflecting surface. If a source of light is placed 
at F, each ray will be reflected along a line parallel to the axis to form a beam 
of parallel rays. The same principle is used in a more important way in the de¬ 
sign of mirrors for reflecting telescopes and solar furnaces, where rays of light 
that are parallel to the axis and come in toward the mirror are reflected in to the 
focus. This reflection property of parabolas also underlies the design of radar an¬ 
tennas and radio telescopes. 


*We point out here that if B = 0 is allowed in (5), then the graph of the equation can be one straight 
line, or two parallel lines, or the empty set. For the particular equation x 2 — 2x — k = 0, or equiva¬ 
lently (* — l) 2 = k + 1, these cases correspond to k = — 1, /c > —1, and k < — 1. 


PROBLEMS 

1 For each of the following equations, determine the na¬ 
ture of the graph by completing the square: 

(a) x 2 + y 2 - 2x - 6y - 15 = 0; 

(b) * 2 + y 2 + 4x- 1 8y + 88 = 0; 

(c) x 2 + y 2 - 10* -I- 2 y + 26 = 0; 

(d) x 2 + y 2 — 16* + 12y + 96 = 0; 

(e) * 2 + y 2 + 6* - 14y + 58 = 0; 

(f) * 2 + y 2 + 14* + lOy + 95 = 0. 

2 If 0 < a < b, find the radius r and center ( h , k) of the 
circle that passes through the points (0, a) and (0, b) and 
is tangent to the *-axis at a point to the right of the ori¬ 
gin. 

3 For each of the following parabolas, find the vertex, fo¬ 
cus, and directrix: 


(a) * 2 + 4* — 4y = 0; 

(b) y 2 - 8* - 2y + 25 = 0; 

(c) * 2 + 4* + 16y — 76 = 0; 

(d) y 2 + 12* - 2y + 25 = 0; 

(e) y = * 2 + 2* + 3. 

4 A searchlight reflector is designed as stated in the text. 
If it is 2 ft deep and the opening is 5 ft across, find the 
focus. 

5 Find the equation of the parabolic arch with base b and 
height h that is shown in Fig. 15.10. 

6 Show that the area of the parabolic segment in Fig. 
15.10 is | hb. (Notice that this area is four-thirds the area 
of the triangle with the same base and height, a fact that 
was discovered and proved by Archimedes.) 
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Figure 15.11 


7 If the parabolic segment in Fig. 15.10 is revolved about 
its axis, show that the volume of the resulting solid of rev¬ 
olution is three-halves the volume of the inscribed cone. 

8 A parabola with focus F and directrix d that are given 
and marked on a sheet of paper can be constructed as 
follows (see Fig. 15.11). On a drafting board, fasten a 
ruler to the paper with its edge along d , and place the 
short leg AB of a draftsman’s triangle ABC against the 
edge of the ruler. At the opposite vertex C of the trian¬ 
gle fasten one end of a piece of string whose length is 
the same as that of the long leg BC of the triangle, and 
fasten the other end of the string at F. If a pencil point 
at P keeps the string taut, as shown in the figure, then 
the point of the pencil draws part of a parabola as the 
triangle slides along the ruler. Explain why this con¬ 
struction works. 

9 Prove that a = /3 in Fig. 15.9. Hint: Extend FP through 
P and use the subtraction formula for the tangent to 
show that tan a = tan /3. 

10 Show that the lines tangent to a parabola at the ends of 
a focal chord (a chord through the focus) intersect at 
right angles. 

*11 Show that the lines tangent to a parabola at the ends of 
a focal chord intersect on the directrix. 


12 Let C be a circle of radius ro and L a line that lies in 
the same plane and does not intersect C. Show that the 
centers of all circles that do not surround C and are 
tangent to both C and L lie on a parabola (Fig. 15.12). 
State the location of the focus and directrix of this 
parabola. 



In Section 15.2 we gave “set of all points” definitions for both circles and parabo¬ 
las. It is also possible to give “locus” definitions, in which each curve is defined 
— and thought of—as the path of a moving point that satisfies a certain condi¬ 
tion as it moves. This language has the advantage of greater pictorial vividness. 
Thus, a parabola can be defined as the locus of a point that moves in such a way 
that it maintains equal distances from a given fixed point and a given fixed line. 

Similarly, in accordance with Section 15.1, we can define an ellipse as the lo¬ 
cus of a point P that moves in such a way that the sum of its distances from two 
fixed points Fand F' is constant, as shown on the left in Fig. 15.13. To simplify 
later equations, we denote this constant by 2 a and write the defining condition 
as 


15.3 
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PF + PF' = 2a. 


( 1 ) 
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Figure 15.13 


P Sum = 2a 




B' 


The two points F and F' are called the foci (plural of focus) of the ellipse be¬ 
cause of the reflection property discussed in Remark 1 below. Since circles are 
not considered to be ellipses in this discussion, F and F are understood to be 
two distinct points: F =£ F f . 

The definition provides an easy way to draw an ellipse on a sheet of paper. 
We begin by fastening the paper to a drawing board with two tacks placed at F 
and F'. Next, we tie the ends of a piece of string to the tacks and pull the string 
taut with the point of a pencil. It is clear that if the pencil is moved around while 
the string is kept taut, then its point draws an ellipse. Because of this construc¬ 
tion, the defining condition (1) is often called the string property of an ellipse. 

We now introduce several standard notations for the dimensions of an ellipse. 
It is easy to see from the definition that the curve is symmetric with respect to 
the line through the foci, and also with respect to the perpendicular bisector of 
the segment FF' . On the right in Fig. 15.13 the segment AA' is called the major 
axis and the segment BB' is called the minor axis of the ellipse, and the point O 
where these axes intersect is called the center. The two points A and A ' at the 
ends of the major axis are called the vertices of the ellipse. We denote the length 
of the minor axis by 2b and the distance between the foci by 2c. It is clear that 
BF = a , because BF + BF' = 2a and BF = BF', so 

a 2 = b 2 + c 2 . (2) 



Since AF + AF' = 2a and AF' — M', we see that AA' = 2a, so the length of the 
major axis is 2a. The numbers a and b are called the semimajor axis and the 
semiminor axis. 

It is easy to see from equation (2) that b < a. If b is very small compared with 
a, so that the ellipse is long and thin, then (2) shows that c is nearly as large as 
a , and the foci are near the ends of the major axis; and if b is nearly as large as 
a , so that the ellipse is nearly circular, then c is small, and the foci are close to 
the center. The ratio da is called the eccentricity of the ellipse and is denoted 
by e: 


_ c 
a 


Va 2 ^ 2 

a 


(3) 


Notice that 0 < e < 1. Nearly circular ellipses have eccentricity near 0, and long, 
thin ellipses have eccentricity near 1. 

In order to find a simple equation for the ellipse, we take the jc-axis along the 
segment FF' and the y-axis as the perpendicular bisector of this segment. Then 
the foci are F = (c, 0) and F = ( —c, 0), as shown in Fig. 15.14, and the defin¬ 
ing condition (1) yields 


V(jc - c) 2 + y 2 + V(jc + c) 2 + y 2 = 2 a 


(4) 


as the equation of the curve. 


Figure 15.14 
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To simplify this equation, we follow the usual procedure for eliminating rad¬ 
icals, namely, solve for one of the radicals and square. If we move the first rad¬ 
ical in (4) over to the right side, square both sides, and simplify, then we obtain 

PF = V(;t - c) 2 + y 2 = a — — x (5) 

a 

and 

PF' = V(;c + c) 2 + y 2 = a + — (6) 

a 


where (6) follows from (5) because PF' - 2a - PF. By squaring again and sim¬ 
plifying, either of these equations gives 


or 



+ y 2 = a 2 


c 


2 



(7) 


By using (2) to simplify (7) still further, we now put the equation into its final 
form, 



( 8 ) 


This argument shows that (8) is satisfied if (4) is. It can be shown, conversely, 
that (4) is satisfied if (8) is, but we omit the details. Equation (8) is therefore the 
standard form of the equation of the ellipse shown in Fig. 15.14. 

We pause briefly to point out that equation (8) easily yields most of the sim¬ 
pler geometric features of the ellipse that are visible in Fig. 15.14. (i) If y = 0, 
then the equation tells us that x = ±a, and if x = 0, then y = ±b , so the curve 
crosses the x and y axes at the four points (±a, 0) and (0, ±b). (ii) Since both 
terms x 2 la 2 and y 2 /b 2 are nonnegative and their sum is 1, it follows that neither 
of them can be greater than 1, so \x\ < a and |y| < b. This means that the whole 
ellipse is contained in the rectangle whose sides are x — ±a and y = ±b, and is 
therefore—unlike the parabola—a bounded curve, (iii) If (x, y) satisfies the equa¬ 
tion, then so do (x, —y) and (— x, y), so the curve is symmetric with respect to 
both the x-axis and the y-axis. This tells us that to graph the complete curve it 
suffices to sketch the graph in the first quadrant and then extend it to the other 
quadrants by symmetry. The left-right symmetry of the ellipse that is so obvious 
from equation (8) is really rather remarkable, because most people contemplat¬ 
ing Fig. 15.2 for the first time feel quite sure that an ellipse should be an egg- 
shaped oval which has a “small end” at the part of the ellipse nearest the vertex 
of the cone and a “big end” at the part farthest from this vertex—but of course 
this is not true. 

We consider again formulas (5) and (6) for the right and left focal radii PF 
and PF', which can be written as 

PF = a - — x = e 
a 

and 

PF>=a + ±x = e(Z + x') = e 




( 10 ) 
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Figure 15.15 



where e = da is the eccentricity defined earlier. The quantities in brackets can 
be interpreted (see Fig. 15.15) as the distances PD and PD' from P to the lines 
x = ale and* = -ale, respectively. Formulas (9) and (10) can therefore be writ¬ 
ten in the form 


PF 

PD 


e 


and 



( 11 ) 


Each of the lines x = ale and x — —ale is called a directrix of the ellipse. Equa¬ 
tions (11) show that an ellipse can be characterized as the locus of a point that 
moves in such a way that the ratio of its distance from a fixed point (a focus) 
to its distance from a fixed line (the corresponding directrix) equals a constant 
e < 1. We shall see in Chapter 16 and elsewhere that this way of characterizing 
ellipses is often very useful. 



Figure 15.16 


Example 1 Identify the graph of \6x 2 + 25y 2 = 400 as an ellipse, and find its 
vertices, foci, eccentricity, and directrices. Sketch the graph. 

Solution First, we divide by 400 to convert the equation into the standard form 


which on comparison with (8) tells us that the graph is an ellipse. Since a 2 = 25 
and b 2 = 16, we have a = 5 and b = 4, so the vertices are (±5, 0) and the ends 
of the minor axis are (0, ±4), as shown in Fig. 15.16. Next, c 2 - a 2 — b 2 = 25 - 
16 = 9, so c = 3 and the foci are (±3, 0). Finally, the eccentricity is e = da = 
}, and the directrices are the vertical lines x = ±a!e = ±y. 


In the above discussion it is assumed that the ellipse has its center at the ori¬ 
gin and its foci on the x-axis. However, if its center is the origin and its foci lie 
on the y-axis, then its major axis is vertical and the roles of x and y are inter¬ 
changed. 

Example 2 Show that 9x 2 + 4y 2 = 36 represents an ellipse, and find its vertices, 
foci, eccentricity, and directrices. Sketch the graph. 

Solution As before, we divide by 36 to convert the given equation into the rec¬ 
ognizable standard form 
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= i. 


Observe that here the denominator of the y term is larger, so we have an ellipse 
whose major axis is vertical. The semimajor and semiminor axes are clearly a = 3 
and b = 2, so the vertices (see Fig. 15.17) are (0, ±3) and the ends of the minor 
axis are (±2, 0). Since c 2 = a 2 - b 2 = 9 - 4 = 5, c = V5, and the foci are the 
points (0, V5) and (0, — V5) on the y-axis. The eccentricity is_e = da = V5/3, 
and the directrices are the horizontal lines y = ±a!e = ±9/V5 = ±|V5. 


Examples 1 and 2 illustrate the fact that if we have an equation of the form 


/ 


( ) 2 ( ) 2 


with unequal denominators, then the equation represents an ellipse, and the ques¬ 
tion of whether the foci and major axis lie on the x-axis or the y-axis is deter¬ 
mined by which denominator is larger. 

In equation (8), x and y are the horizontal and vertical displacements from the 
axes of the ellipse to the point P = (x, y). If the center is the point ( h , k) instead 
of the origin, then these displacements are x — h and y — k, and the equation of 
the ellipse becomes 



(x - h) 2 (y - k) 2 = 

a 2 b 2 


( 12 ) 


Example 3 Show that 4x 2 + 16y 2 — 24x — 32y = 12 is the equation of an el¬ 
lipse, and find its vertices, foci, eccentricity, and directrices. Sketch the graph. 

Solution The equation can be written as 

4(x 2 -6x) + 16( y 2 - 2y) = 12. 

Completing the squares inside the parentheses, we obtain 

4(jc - 3) 2 + 16( y - 1)2 = 64 


or 


(x - 3) 2 (y~ l) 2 

16 4 

Comparison with (12) shows that this represents an ellipse with center (3, 1), hor¬ 
izontal major axis, and semiaxes a = 4, b = 2, so the vertices (Fig. 15.18) are 
the points (7, 1), (—1, 1) and the ends of the minor axis are (3, 3), (3, — 1). The 
foci are a distance c = Vo 2 — b 2 = vT2 = 2V3 to the right and left of the cen¬ 
ter, and are therefore the points (3 ± 2 V3, 1 ). The eccentricity is e = c/a = y V3, 
and the directrices are vertical lines at a distance ale = 8/V3 = fV3 to the right 
and left of the center. Their equations are x = 3 ± fV3. 



Remark 1 Like parabolas, ellipses also have a remarkable reflection property. 
Let P be a point on an ellipse with foci F and F\ and let T be the tangent at P, 
as shown in Fig. 15.19. If T makes angles a and /3 with the two focal radii PF 
and PF', then a = (3. Students are asked to prove this in Problem 9. 
















CONIC SECTIONS 


540 



This reflection property has no important scientific applications like those we 
saw in the case of parabolas, but there is at least one mildly amusing conse¬ 
quence. Let the ellipse in the figure be revolved about its major axis to form a 
surface of revolution, and imagine that a room is built with its walls and ceiling 
having the shape of the upper part of this surface, with the two foci about shoul¬ 
der height above the floor. Then a whisper uttered at one focus can be clearly 
heard a considerable distance away at the other focus even though it is inaudi¬ 
ble at intermediate points, because the sound waves bounce off the walls and are 
reflected to the second focus, and furthermore arrive together because they all 
travel the same distance. There actually exist several rooms of this kind—known 
as whispering galleries —in certain American museums of science and in the 
castles of a few eccentric European monarchs. 

A less frivolous application is in the new treatment for kidney stones called 
lithotripsy (from the Greek lithos , stone + tripsis, a rubbing or pounding). An 
ellipsoidal reflector is placed in such a position that the offending kidney stone 
is at one focus. High-intensity sound waves generated at the other focus are re¬ 
flected harmlessly through the patient’s body and are concentrated at the stone, 
which they pound into powder. The patient is spared having to go through surgery 
and recovers in a few days. 
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Earth 
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Figure 15.20 Eccentricities of 
planetary orbits. 


Earth 



Remark 2 Except for small perturbations resulting from the influence of the 
other planets, each planet in the solar system revolves around the sun in an el¬ 
liptical orbit with the sun at one focus. As we pointed out in Section 15.1, this 
phenomenon was discovered empirically by Kepler in the early seventeenth cen¬ 
tury, and was explained mathematically by Newton in the later decades of the 
same century. We shall give a detailed treatment of Newton’s ideas at the end of 
Chapter 17. 

Most of the planets, including the earth, have orbits that are nearly circular. 
This can be seen from the eccentricities given in the table in Fig. 15.20. Mer¬ 
cury, however, has a rather eccentric orbit, with e = 0.21, as does Pluto, with e = 
0.25. Other bodies in the solar system have even more eccentric orbits, for in¬ 
stance, the flying mountains known as asteroids. Thus the asteroid Icarus, which 
was discovered at M ount Palomar in 1949 and is about 1 mi in diameter, has an 
orbit so eccentric, with e = 0.83, that at its closest approach to the sun (Fig. 
15.21) it is halfway between the sun and the orbit of Mercury, and at its farthest 
it is out beyond the orbit of the earth.* 

One of the most interesting objects in the solar system is Halley’s Comet, 
which has eccentricity e = 0.98 and an orbit (Fig. 15.22) about 7 astronomical 
units wide by 35 astronomical units long. [One astronomical unit (AU) is the 
semimajor axis of the earth’s orbit, approximately 93 million miles or 150 mil¬ 
lion kilometers.] The period of revolution of this comet around the sun is about 
76 years. It appeared in 1910, and again in 1985-1986. It was observed in 1682, 
and the astronomer Edmund Halley (Newton’s friend) successfully predicted its 
return in 1758, many years after his own death in 1742. This was one of the most 
convincing successes of Newton’s theory of gravitation. At its closest approach. 


*The surface temperature of Icarus has been estimated at about 900°F at its closest approach to the 
sun. Arthur C. Clarke has used this fact as the basis fora fine story, “Summertime on Icarus,” in his 
collection The Nine Billion Names of God (New American Library, 1974). See also Chapter 2, “The 
Little Planets,” in Fletcher G. Watson’s Between the Planets (Doubleday Anchor Books, 1962), es¬ 
pecially p. 29. 












15.3 ELLIPSES 


541 


Pluto 



Figure 15.22 Orbit of Halley’s 
Comet, drawn approximately to scale. 


Halley’s Comet is only 0.59 AU away from the sun. Its previous visits to the near 
neighborhood of the sun have been traced back step by step by means of histor¬ 
ical records to the year 11 B.C., and perhaps even earlier.* 


*For a more detailed account of these remarkable events, see P. L. Brown, Comets, Meteorites and 
Men (Taplinger, 1974); or N. Calder, The Comet Is Coming! (Viking, 1980). 


PROBLEMS 


1 Find the equation of the ellipse 

(a) with foci at (±2, 0) and major axis of length 10; 

(b) with foci at (0, ±4) and minor axis of length 12; 

(c) with major and minor axes of lengths 4 and 3, re¬ 
spectively, center at the origin, and foci on the y-axis; 

(d) with foci at (±3, 0) and eccentricity e = 

(e) with eccentricity e = y, center at the origin, and the 
ends of the major axis at (0, ±6); 

(f) with eccentricity e = j and the ends of the minor axis 
at (0, ±10). 

2 Find the equation of the ellipse 

(a) with vertices (6, 2), (—4, 2) and minor axis of length 
6; 

(b) with major axis 8 units long, and foci at (6, 3) and 
(2,3); 

(c) with minor axis 6 units long, and foci at (1,0) and 
(1,6); 

(d) with eccentricity e = j , and ends of the major axis 
at (10, 1) and (—6, 1). 

3 Find the center, vertices, foci, and eccentricity of each of 
the following ellipses: 

(a) 25* 2 + 9y 2 = 225; 

(b) jc 2 + 4y 2 = 4; 

(c) 2(x + 2) 2 + (y - l) 2 = 2; 

(d) x 2 -I- 4y 2 - lx = 0; 

(e) 4x 2 + 9y 2 - \6x 4- 18y = 11; 

(f) x 2 + 2 y 2 — &y = 0. 

4 Consider an equation of the form 

Ax 2 + By 2 + Cx + Dy + E = 0, 

where A and B are both positive or both negative and 
A ± B. Show that the graph is an ellipse, a single point, 
or the empty set. 

5 Write down the integrals that give (a) the first-quadrant 
area of the circle* 2 + y 2 = a 2 , and (b) the first-quadrant 


area of the ellipse x 2 !a 2 + y 2 /b 2 = 1. Show that the sec¬ 
ond integral is bla times the first, and in this way obtain 
the area of the ellipse from the known area of the circle. 

6 If a cylindrical drinking glass with some water in it is 
tilted slightly, as in Fig. 15.23, then the surface of the 
water is no longer a circle but looks like an ellipse. Prove 
that it really is an ellipse by modifying the argument used 
in Fig. 15.4. 



7 Find the volume of the solid of revolution obtained by 
revolving the ellipse x 2 la 2 + y 2 /b 2 = 1 about 

(a) the x-axis; 

(b) the y-axis. 

If a > b, the first solid is called a prolate spheroid and 
the second an oblate spheroid. 

8 The base of a solid is the region bounded by an ellipse 
with semiaxes 5 and 3. Find the volume of the solid if 
each cross section in a plane perpendicular to the major 
axis is 

(a) a square; 

(b) an equilateral triangle. 

9 Prove that a = (3 in Fig. 15.19. Hint: Extend FP and F'P 
through P and show that tan a = tan f3 by using the sub¬ 
traction formula for the tangent. 
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10 Consider two ellipses with the same eccentricity e , both 
centered at the origin and both with major axis on the 
*-axis. Suppose that their equations are 



and 



Show that these ellipses are similar in the sense that 

(a) there exists a constant k such that 

£l _ h . = £l _ k . 

<22 b2 C2 

(b) if a half-line from the origin O intersects the first el¬ 
lipse at P\ and the second at P 2 , then 


OP\ 

OP 2 


k. 


11 Show that the line tangent to the ellipse x 2 la 2 + y 2 /b 2 = 
1 at a point Pi = Ui, >’i) is 


Ml - 1 
a 2 + b 2 L 

12 If tangent lines to the ellipse x 2 /25 + y 2 1 16= 1 intersect 
the y-axis at (0, 8), find the points of tangency. 

13 If tangent lines to the ellipse x 2 !o 2 + y 2 /b 2 = 1 intersect 
the y-axis at (0, d), where d > b, find the points of tan¬ 
gency. 

14 Let F be a point which is inside a given circle but is not 
the center C. Consider a point P that moves in such a 
way as to be equidistant from Fand the circle. Show that 
the path of P is an ellipse. 

15 Show that the point on an ellipse that is closest to a fo¬ 
cus is the end of the major axis nearest that focus, and 
also that the point on the ellipse farthest from this focus 
is the other end of the major axis. 

16 The apogee of an earth satellite is its maximum altitude 
above the surface of the earth during orbit, and its perigee 
is its minimum altitude during orbit.* If R is the radius 
of the earth, use Problem 15 to show that if a satellite 
has an elliptical earth orbit with the center of the earth 
at one focus and semimajor axis a, then 

2a = 2R + apogee + perigee. 

17 The point of the orbit o f a planet nearest the sun is called 
the perihelion, and the point farthest from the sun is 
called the aphelion. If the ratio of the earth’s distance 
from the sun at perihelion to its distance at aphelion is 
H, find the eccentricity of the earth’s orbit. 

18 A line segment moves with one end A on the y-axis and 
the other end B on the x-axis. A point P fixed on the seg¬ 


ment is a units from A and b units from B. Find the equa¬ 
tion of the path of P. 

19 (a) Show that the length of the part of the ellipse 

x 2 la 2 + y 2 /b 2 = 1 (a> b) that lies in the first quad¬ 
rant is 



(b) Use the change of variable x = a sin 6 to transform 
the integral in (a) into 

a f Vl - e 2 sin 2 6 dO. 

Jo 

This is called a complete elliptic integral of the sec¬ 
ond kind, and cannot be evaluated by means of ele¬ 
mentary functions. 

20 Let P be a point on the ellipse x 2 !a 2 + y 2 /b 2 = 1 that 
does not lie on either axis. If a > b, show without using 
calculus that the distance from P to the origin is greater 
than b and less than a. 

21 There are exactly two lines with given slope m that are 
tangent to the ellipse x 2 ta 2 + y 2 /b 2 = 1. Find their equa¬ 
tions. 

22 Consider two circles centered at the origin with radii a 
and b, where b < a. Draw a half-line from the origin in¬ 
tersecting the smaller circle at Q and the larger circle at 
R. If the horizontal line through Q and the vertical line 
through R intersect at P = (x, y), show that P lies on the 
ellipse x 2 /a 2 + y 2 /b 2 = 1. Flint: Let the coordinates of Q 
and R be ( q , y) and (jc, r), and use the fact that Q and R 
lie on a line through the origin. 

23 Let C\ and C 2 be circles in the same plane with radii r\ 
and r 2 . Assume that r\ > r 2 and that C\ surrounds C 2 , 
but that C\ and C 2 are not concentric. Show that the cen¬ 
ters of all circles that lie between C\ and C 2 and are tan¬ 
gent to both lie on an ellipse whose foci are the centers 
of C\ and C 2 (Fig. 15.24). What is the length of the ma¬ 
jor axis? 



’“These words are also used to mean the corresponding points of the 
orbit. 


Figure 15.24 
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The ideas of Section 15.1 allow us to define a hyperbola as the locus of a point 
P that moves in such a way that the difference of its distances from two fixed 
points F and F' (called the foci ) is constant. If this constant is denoted by 2a, 
with a > 0, then a little thought will show that the locus consists of two branches , 
as shown in Fig. 15.25, where the right branch is the locus of the equation PF' - 
PF — 2 a and the left branch is the locus of the equation PF - PF ' = 2a. The 
defining condition for the complete hyperbola can therefore be written as 


15.4 

HYPERBOLAS 


PF'-PF=±2a. (1) 

To find a simple equation for the hyperbola, we take the x-axis along the seg¬ 
ment FF' and the y-axis as the perpendicular bisector of this segment. If 2c de¬ 
notes the distance between Fand F f , then F — (c, 0) and F f = (—c, 0), as shown 
in Fig. 15.25, and (1) becomes 

V(* + cf + y 2 - V(jc — c ) 2 + y 2 = ±2a. 

By moving the second radical to the right side, squaring, and simplifying, we ob¬ 
tain the focal radius formulas 

x ~ a j (2) 


PF = V(jc — c) 2 + y 2 = ± 



and 


PF' = V(x + c) 2 + y 2 


±\— x + a 
\a 


)■ 


(3) 


where (3) follows from (2) because PF’ — ±2a + PF. As in (1), the plus signs 
here correspond to the right branch of the curve, and the minus signs to the left 
branch. By squaring and simplifying, either of these equations gives 



a 


2 


or 


= 1 . 


(4) 


To simplify this equation still further, we begin by observing that in the triangle 
PF'F with P on the right branch we have PF' < PF + FF', because one side of 
a triangle is less than the sum of the other two sides. Therefore PF' — PF < FF', 
or 2 a < 2c, so a < c and c 2 - a 2 is a positive number which we denote by b 2 . 


This enables us to write (4) as 


b 2 = 


i 1 1 *) 


b 2 


(5) 


( 6 ) 


which is the standard form of the equation of the hyperbola shown in Fig. 15.25. 

We now turn to a careful consideration of equation (6) and the light it sheds 
on the nature of the hyperbola it represents. Our discussion will reveal several 
additional features of this curve that are not obvious from the definition and that 
are indicated in greater detail in Fig. 15.26. 
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Figure 15.26 Features of a hyperbola. 



Since the equation contains only even powers of x and y, the hyperbola is sym¬ 
metric with respect to both coordinate axes. They are therefore called the axes 
of the curve, and their intersection is called the center. The left-right, up-down 
symmetry is perhaps the only feature of the hyperbola that is easy to see directly 
from the definition. 

Wheny = 0, the equation gives x = ±a, but when x = 0, y is imaginary. There¬ 
fore the axis through the foci, called the principal axis, intersects the curve at 
two points called the mrtices, which are located at a distance a on each side of 
the center; but the other axis, called the conjugate axis, does not intersect the 
curve at all. The hyperbola thus consists of two separate parts, its symmetrical 
branches, on opposite sides of the conjugate axis. 

These facts are easier to see if equation (6) is solved for y, 



(7) 


a 


This formula shows that there are no points of the graph in the vertical strip — a < 
x < a, because for these jc’s the quantity inside the radical is negative. When x = 
±a, (7) yields y = 0; these two points are the vertices. And now, as x increases 
from a or decreases from — a , we get two distinct values of y that increase nu¬ 
merically as x moves farther to the right or left; this behavior produces the up¬ 
per and lower arms of each branch of the curve. 

A very significant feature of the graph can be observed by writing (7) in the 
form 



( 8 ) 


When x is numerically large, the quantity inside the radical in (8) is nearly 1, 
and for this reason it appears that the hyperbola is very close to the pair of straight 
lines 


± — x. 
a 


b 


(9) 
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We can verify this guess as follows. In the first quadrant, if x is large, then the 
vertical distance from the hyperbola up to the corresponding line is 

— x — — Vx 2 — a 2 = — (x — Vx 2 — a 2 ) 
a a a 

_ b_ (x - V^c 2 — a 2 )(x + Vx 2 - « 2 ) 
a x + Vx 2 — a 2 

_ _ ab 

x + Vx 2 — a 2 

This clearly approaches zero as x —» «>. The lines (9) are therefore called the 
asymptotes of the hyperbola. The asymptotes provide a convenient guide for 
sketching a hyperbola whose equation is given: Simply plot the vertices, draw 
the asymptotes, and fill in the two branches of the curve in a reasonable way, as 
suggested by the figure. 

The triangle shown in the first quadrant of Fig. 15.26 is a convenient mnemonic 
device for remembering the main geometric features of a hyperbola. Its base a 
is the distance from the center to the vertex on the right; its height b is the dis¬ 
tance from this vertex up to the asymptote in the first quadrant, whose slope is 
b/a\ and since (5) tells us that 

c 2 = a 2 + b 2 , 


the hypotenuse c of this triangle is also the distance from the center to a focus. 
The ratio da is called the eccentricity of the hyperbola, and is denoted by e: 


It is clear that e > 1. When e is near 1, then b is small compared with a , and the 
hyperbola lies in a small angle between the asymptotes. When e is large, then b 
is large compared with a , the angle between the asymptotes is large, and the hy¬ 
perbola is rather flat at the vertices. 

To understand the significance of the eccentricity, we consider again formulas 
(2) and (3) for the right and left focal radii PF and PF'. These formulas can be 
written as 


PF = ± (ex — a) = ±e 


x 


a_ 

e 


( 10 ) 


and 


PF' = ±(ex + a) = ±e 




( 11 ) 


where the plus signs apply to the right branch of the curve (see Fig. 15.27) and 
the minus signs to the left branch. If P lies on the right branch, as shown in the 
figure, then the quantities in brackets can be interpreted as the distances PD and 
PD' from P to the lines x = a/e and x = -a/e, respectively. The same statement 
is true if P lies on the left branch, if the effect of the minus signs is properly 
taken into account. Therefore, in all cases formulas (10) and (11) can be written 
in the form 


PF 

PD 


e 


PF' 

PD' 


e. 


and 


( 12 ) 
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Figure 15.27 



Each of the lines x = ale and x — —ale is called a directrix of the hyperbola. 
Equations (12) show that a hyperbola can be characterized as the locus of a point 
that mows in such a way that the ratio of its distance from a fixed point (a fo¬ 
cus) to its distance from a fixed line (the corresponding directrix) equals a con¬ 
stant e > 1. Just as in the case of ellipses, this way of characterizing hyperbolas 
will be needed in our future work. 

By interchanging the roles of x and y in the preceding discussion, we find that 
the equation 


represents a hyperbola with vertical principal axis, vertices at (0, ±a), and foci 
at (0, ±c), where c 2 = a 2 4- b 2 . This time the asymptotes are the lines 


y = 



as we easily see by writing (13) in a form solved fory, 

y= ±£ V* 2 + b 2 = ±^x.I 7+K. 
b b V 

Notice that the axis containing the foci of a hyperbola is not determined by the 
relative size of a and b, as it was in the case of an ellipse, but rather by which 
term is subtracted from which in the standard form of the equation. The num¬ 
bers a and b can therefore be of any relative size. In particular they can be equal, 
in which case the asymptotes are perpendicular to each other and the hyperbola 
is called rectangular. The equations 

x 2 — y 2 = a 2 and y 2 — x 2 = a 2 

represent rectangular hyperbolas. 


Example 1 Find the equation of the hyperbola with foci (±6, 0) and the lines 
5 y = ±2V5 x as asymptotes. 
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Solution First, the location of the foci tells us that the principal axis is the 
x-axis. We see that c — 6 and bla - fV5, so a = (V5/2 )b. Since a 2 + b 2 = c 2 , 
we have \b 2 + b 2 — 36, so b 2 = £ • 36 = 16 and a 2 — f b 2 = | • 16 = 20. This 
shows that 

20 16 


is the equation of the hyperbola. 


Example 2 Determine the principal axis of the hyperbola 6y 2 -- 9x 2 = 36 and 
find its vertices, foci, and asymptotes. 


Solution The equation can be put in the standard form 



so the principal axis is the y-axis, a 2 = 6, b 2 = 4, and c 2 = a 2 + b 2 = 10. Hence 
the vertices are (0, ± V6), the foci are (0, ± V70), and the asymptotes are y = 

±(V6/2)jc. 


Just as in the case of the ellipse, we can easily write the equation of a hyper¬ 
bola with center (/*, k) and principal axis parallel to one of the coordinate axes. 
The equation is 

(x - h ) 2 (y - k) 2 _ , (y - k) 2 (x - h) 2 _ , 

-^— 1 or -^ =1 - 

according as the principal axis is horizontal or vertical. This suggests that we 
consider equations of the form 

Ax 2 + By 2 + Cx + Dy + E = 0, 

where A and B have opposite signs. Such an equation will usually represent a 
hyperbola, but in certain special cases it may represent a pair of intersecting 
straight lines. The next example illustrates these possibilities. 

Example 3 Identify the graph of 

\6x 2 - 9y 2 - 64x- 18y + E = 0 

for various values of E. 

Solution The procedure is to complete the square on the x and y terms, which 
yields 

16(jc 2 — Ax) — 9(y 2 + 2y)= -E 
and 

1 6(jc — 2) 2 — 9(y + 1) 2 = 55 - E. 

There are now three cases. 

case 1 55 — E > 0; for example, E = —89, so that 55 — E = 144. In this case 

we have 
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Figure 15.28 


U-2) 2 (y + l) 2 _ 

9 16 

which is a hyperbola with center (2, —1) and horizontal principal axis. 

case 2 55 — E < 0; for example, E = 199, so that 55 — E = —144. Here we 

have 

(y + l) 2 (x ~ 2) 2 
16 9 

which is a hyperbola with center (2, —1) and vertical principal axis. 

case 3 55 — E - 0; E = 55. This time our equation becomes 

1 6(x — 2) 2 - 9(y + l) 2 = 0 
or 

4(jc- 2) = ±3(y + 1). 

This represents the two lines 

y + l = ±fU - 2 ), 

which are the asymptotes in the first two cases. 


Remark 1 Hyperbolas have the following reflection property: The tangent line 
at any point P on a hyperbola bisects the angle between the focal radii PF and 
PF'. This means that a = in the notation of Fig. 15.28 (see Problem 21). As 
a consequence of this, if the hyperbola is revolved about its principal axis to form 
a surface of revolution, and if the convex sides of each part are silvered to make 
them reflecting surfaces, then any ray of light that approaches a convex side along 
a line pointing toward a focus (Fig. 15.28, right) is reflected toward the other fo¬ 
cus. 

This property of hyperbolas is the essential principle in the design of reflect¬ 
ing telescopes of the Cassegrain type (Fig. 15.29). As the figure shows, one fo¬ 
cus of the hyperbolic mirror is at the focus of the parabolic mirror and the other 
is at the vertex of the parabolic mirror, where an eyepiece or camera is located. 


The reflection property. 








15.4 HYPERBOLAS 


549 



Other focus 
of hyperbolic 
mirror 


Figure 15.29 Design of Cassegrain 
telescope. 


Faint parallel rays of starlight are therefore reflected off the parabolic mirror to¬ 
ward its focus, then are intercepted by the hyperbolic mirror and reflected back 
toward the eyepiece or camera. 

Remark 2 There are two kinds of comets. Some are permanent members of the 
solar system, like Halley’s Comet described in Section 15.3, and travel forever 
around the sun in elliptical orbits with the sun at one focus. Others enter the 
solar system at high speeds from outer space, swing around the sun in hyper¬ 
bolic orbits with the sun at one focus, and then escape into outer space again. 
The crucial factor is the total energy E of the comet itself, which is the sum of 
the kinetic energy due to its motion and the potential energy due to the gravita¬ 
tional attraction of the sun. It turns out that if E < 0, the orbit is an ellipse, and 
if E > 0, the orbit is a hyperbola. (The case E — 0 corresponds to a parabolic 
orbit, but this is exceedingly unlikely.) 


PROBLEMS 


In Problems 1-8, sketch the graph of the given hyperbola and 17 
find the vertices, foci, asymptotes, eccentricity, and directri- 18 
ces. 19 


1 

1 

II 

7 

11 

1 

20 

l 

36 16 

21 

3 

II 

1 

4 

* 2 y 2 _ 

22 


4 9 


25 16 


5 

4y 2 -x 2 = 16. 

6 

* 2 - 3y 2 = 12. 


7 

y 2 - x 2 = 1. 

8 

*2 9y 2 = 1. 

23 


In Problems 9-16, find the equation of the hyperbola deter¬ 
mined by the given conditions. 

9 Foci (0, ±5), vertex (0, 3). 

10 Vertices (±3, 0), focus (5, 0). 

11 Vertices (±3,0), asymptote y = 2*. 24 

12 Foci (— 1,8) and (— 1, — 2), vertex (-1, 7). 

13 Foci (±8,0), e = y. 

14 Vertices (0, ±5), e = 2. 

15 Vertices (±6, 0), directrix x = 4. 

16 Foci (1,1) and (-1, -1), difference of focal radii ±2. 


In Problems 17-20, identify the graph of the given equation 
as in the discussion of Example 3. 


\6x 2 - 3y 2 - 32* - 12 y - 44 = 0. 

9y 2 - lx 2 + 12y - 70* - 94 = 0. 

36* 2 - 25y 2 + 144* - 50y + 119 = 0. 

11 y 2 - 12* 2 + 88y + 72* + 300 = 0. 

Show that a = (3 in Fig. 15.26. 

Let F be a point which is outside a given circle. Con¬ 
sider a point P that moves in such a way as to be equidis¬ 
tant from F and the circle. Show that the path of P is one 
branch of a hyperbola. 

Suppose that an ellipse and a hyperbola are confocal, that 
is, have the same foci F and F' . Use the reflection prop¬ 
erties of the two curves to give a purely geometric proof 
that they are perpendicular to each other at every point 
P of intersection. 

(a) Show that 


r 


25 -k 16 -k 


represents an ellipse if k < 16 and a hyperbola if 
16 < k < 25, and that all these curves are confocal. 
(b) Find the first-quadrant point of intersection of the 
curves given by k = 0 and k = 20 , and find the tan¬ 
gent line to each curve at this point. 
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(c) Show that the tangent lines in part (b) are perpen¬ 
dicular to each other by showing that the product of 
their slopes is —1. 

25 Consider two hyperbolas with the same eccentricity e, 
both centered at the origin and both with principal axis 
on the *-axis. Suppose that their equations are 




2 


a\ 


2 



and 



28 If two hyperbolas x 2 la 2 — y 2 lb 2 = 1 and y 2 /A 2 - 
x 2 /B 2 = 1 have the same asymptotes, show that their ec¬ 
centricities e and E are related by the equation 


29 Show that the line tangent to the hyperbola x 2 /a 2 - 
y 2 /b 2 = 1 at the point Pi = (*i,yi) is 


Show that these hyperbolas are similar in the sense that 

(a) there exists a constant k such that 

£L = h. = £l = k . 
a 2 b2 c 2 

(b) if a half-line from the origin O intersects the first hy¬ 
perbola at Pi and the second at P 2 , then 


26 Find the locus of the centers of all circles that are tan¬ 
gent to the y-axis and cut off a segment of length 2 a on 
the *-axis. 

27 Let F and F' be two points on a sheet of paper whose 
distance apart is 2c. Take a piece of string and tie a knot 
K in it so that the difference between the lengths of the 
two parts into which K divides the string is 2a, where 
0 < a < c. Tie the ends of the string to two tacks placed 
at F and F\ and loop the string around the point of a 
pencil, as shown in Fig. 15.30. If the string is held taut 
and the knot K is carefully pulled, show that the pencil 
at P draws one branch of a hyperbola. 



Figure 15.30 


**1 yy\ = 1 

a 2 b 2 

30 If tangent lines to the hyperbola x 2 /25 - y 2 /16 = 1 in¬ 
tersect the y-axis at (0, 8), find the points of tangency. 

31 If tangent lines to the hyperbola x 2 /a 2 - y 2 lb 2 = 1 in¬ 
tersect the y-axis at (0, d ), find the points of tangency. 

32 A line through a point P on a hyperbola and parallel to 
the nearest asymptote intersects the nearest directrix at 
Q. If F is the corresponding focus, show that PQ = PF. 

33 Let Ci and C 2 be circles in the same plane with differ¬ 
ent radii r\ and r 2 . Assume that C\ and C 2 do not inter¬ 
sect and that neither surrounds the other. Show that the 
centers of all circles that are outside both, do not sur¬ 
round either, and are tangent to both, lie on one branch 
of a hyperbola whose foci are the centers of Ci and C 2 
(Fig. 15.31). 



15.5 

THE FOCUS-DIRECTRIX- 
ECCENTRICITY 
DEFINITIONS 


Students have already seen that there are several distinct but equivalent ways of 
defining the conic sections, each with its own merits. We began with the defini¬ 
tion by means of a given cone and a slicing plane that cuts through the cone more 
or less steeply, yielding our three types of curves by varying the degree of steep¬ 
ness. This three-dimensional approach is vivid and geometric, and provides a 
clear visual impression of what the curves look like. However, for the purpose 
of obtaining Cartesian equations for use in precise quantitative studies, we needed 
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two-dimensional characterizations, and for this the focal properties discussed at 
the end of Section 15.1 turned out to be convenient. The concepts of eccentric¬ 
ity and directrix emerged in the course of our detailed work on ellipses and hy¬ 
perbolas, and we saw that each of these curves can be given yet another two- 
dimensional characterization by means of a focus, a directrix, and an eccentricity. 
Our purpose in this brief section is to show that all three of the conic sections— 
parabolas, ellipses, and hyperbolas—can in this way be given unified definitions 
that depend directly on our original concept of these curves as sections of a cone.* 

Our discussion is based on Fig. 15.32, which shows a cone with vertex angle 
a and a slicing plane with tilting angle /3. This tilting angle can be defined as 
the angle between the axis of the cone and a normal line to the plane, but it plays 
its main role in our argument as the indicated acute angle of the right triangle 
PQD. The figure is drawn to illustrate the case of an ellipse, but the argument is 
valid for the other cases as well. 

We begin at the beginning. Let there be inscribed in the cone a sphere which 
is tangent to the slicing plane at a point F, and tangent to the cone along a cir¬ 
cle C. If d is the line in which the slicing plane intersects the plane of the circle 
C, we shall prove that the conic section has F as its focus and d as its directrix, 
and the facts about the eccentricity will emerge in the course of our discussion. 

To this end, let P be a point on the conic section, let Q be the point where the 
line through P and parallel to the axis of the cone intersects the plane of C, let 
R be the point where the generator through P intersects C, and let D be the foot 
of the perpendicular from P to the line d. Then PR and PF are two segments 
which are tangent to the sphere from the same point P, and therefore have the 
same length, 

PR = PF. (1) 


Tor reasons that will soon be clear, circles must be excluded from this discussion, because the nec¬ 
essary geometric constructions are not possible when the slicing plane is perpendicular to the axis of 
the cone. 



Figure 15.32 
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Also, from the right triangle PQR we have 

PQ = PR cos a\ 

and from the right triangle PQD we have 

PQ = PD sin (3. 


It follows that 


PR cos a = PD sin (3 , 


so 


PR _ sin /3 
PD cos a 


In view of (1) this means that 

PF _ sin (3 
PD cos a 

This can be written in the slightly more convenient form 

PP = cos y . 

PD cos a’ K ) 

where y is the other acute angle in the right triangle PQD. If we now define the 
eccentricity e by 


cos y 
£ » 
cos a 

then this number is constant for a given cone and a given slicing plane, and (2) 
becomes 


pp f< 1 for an ellipse, 

= e\= 1 for a parabola, 

[> 1 for a hyperbola, 

where the statements on the right are easily verified by inspecting the figure. 
Thus, for a parabola, we see that PD is parallel to a generator of the cone, so 
y = a and e — 1 ; for an ellipse, we have y > a, so cos y < cos a and e < 1 ; and 
for a hyperbola, we have y < a, so cos y > cos a and e > 1. 

The words “parabola,” “ellipse,” and “hyperbola” come from three Greek 
words meaning “a comparison,” “a deficiency,” and “an excess,” referring to the 
fact that for the corresponding curves we have e = 1, e < 1, and e > 1. One 
should also compare these words with the words “parable,” “ellipsis,” and “hy¬ 
perbole” in modem English. 


15.6 

(OPTIONAL) SECOND- 
DEGREE EQUATIONS. 
ROTATION OF AXES 


The general equation of the second degree in x and y is 

Ax 2 + Bxy + Cy 1 + Dx + Ey + F = 0, (1) 

where at least one of the coefficients A, B , C is different from zero. The latter 
requirement, of course, guarantees that the degree of the equation really is 2, 
rather than 1 or 0. In the preceding sections we have found that circles, parabo- 
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las, ellipses, and hyperbolas are all curves whose equations are special cases of 
(1). Thus, for example, the circle 

U - h) 2 + (y - k) 2 = r 2 
can be obtained from (1) by taking 

A = C= 1, B = 0, D=-2h, E = —2k, 

F = h 2 + k 2 — r 2 , 


and the parabola 


by taking 


A = 1 , 


x 2 = 4 py 


E = -4 p, B = C = D=F = 0. 


In addition to the conic sections mentioned here, we have also noted various “ex¬ 
ceptional cases” that can arise as graphs of (1) from special choices of the co¬ 
efficients. Thus, the graph of 

x 2 T y 2 — 0 

is a point, and the graph of 

x 2 + y 2 -f- \ =0 

is the empty set. Further, the graph of 

jt 2 = 0 


is a single line, namely, the y-axis, and the graph of 

x 2 - y 2 = 0, or equivalently (x + y)(x - y) = 0, 

is a pair of lines, namely, x + y = 0 and x — y = 0. Our purpose in this section 
is to investigate the full range of possibilities of the curves represented by (1). 
Briefly, we shall find that the eight graphs we have just listed exhaust all possi¬ 
bilities: 


The graph of every second-degree equation of the form (1) is a circle, a parabola, an 
ellipse, a hyperbola, a point, the empty set, a single line, or a pair of lines. 


The main problem before us is posed by the so-called mixed term Bxy in (1), 
because when this term is present we have no idea how to identify the graph. No 
such terms have arisen in our previous work on the conic sections. The reason 
for this is that in every case we have been careful to choose the coordinate axes 
in a simple and natural position, so that at least one axis is parallel to an axis of 
symmetry of the curve under discussion. In order to see what can happen when 
a curve is placed in a skew position relative to the axes, let us find the equation 
of the hyperbola (see Fig. 15.33) with foci F — (2, 2) and F' = (—2, —2), where 
PF f - PF= ±4. We have 

V(x + 2) 2 + (y + 2) 2 - V(jc -2) 2 + (y - 2) 2 = ±4, 

and when we move the second radical to the right side, square, solve for the rad¬ 
ical that still remains, and square again, this reduces to 



*y = 2 - 


( 2 ) 


Figure 15.33 
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This is really a very simple equation, but nevertheless it does provide a special 
case of (1) in which the mixed term is present. The asymptotes of the hyperbola 
(2) are evidently the x- and y-axes, and its principal axis is the line y - x, which 
makes a 45° angle with the x-axis. It will become clear that a mixed term is pre¬ 
sent only when a curve is “tilted” in this way with respect to the coordinate axes, 
and also that this term can be removed by rotating the axes to “untilt” the curve. 
In the case of (2), it is easy to see by looking at the figure that this curve can be 
untilted by rotating the axes through a 45° angle in the counterclockwise direc¬ 
tion. 

To construct the machinery that is necessary for carrying out an arbitrary ro¬ 
tation of axes, we start with the yy-system and rotate these axes counterclock¬ 
wise through an angle 6 to obtain the x'y'- system, as shown in Fig. 15.34. A 
point P in the plane will then have two pairs of rectangular coordinates, (x, y) 
and (V, y'). To see how these coordinates are related, we observe from the fig¬ 
ure that 


x = OR = OQ - RQ = OQ - ST 


= x ' cos 9 — y' sin 9 


and 


y = RP = RS + SP = QT + SP 


= x' sin 0 + y' cos 6. 


We write these equations together for convenient reference, 

x = x' cos 0 — y sin 6, 
y = x' sin 9 + y' cos 9, 


(3) 


they are called the equations for rotation of axes. For example, if 9 = 45°, then, 
since sin 45° = cos 45° = {V2 = 1/V2, we have 


= * 

V2 


x 1 + / 

V2 


(4) 


And for another example, if 9 = 30°, then since sin 30° = \ and cos 30° = 
{V3, we have 


Vlx’ - 


y = • 


x’ + V3y' 


(5) 


As a simple illustration of the use of these equations, we substitute (4) into 
(2) and obtain 



This is immediately recognizable as a rectangular hyperbola whose principal axis 
is the x'-axis. Of course, we already knew this from the way (2) was obtained. 
However, if we had started with (2) without knowing anything about the nature 
of its graph, then this procedure for removing the mixed term would have en¬ 
abled us to identify the curve without difficulty. 

In the case of equation (2), the 45° rotation represented by equations (4) 
worked. But how could we have known this in advance? Can we be sure that a 
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suitable rotation will always remove the xy term if one is present? And if so, how 
do we find a suitable angle of rotation? 

To answer these questions we return to the general second-degree equation (1), 

Ax 2 + Bxy + Cy 2 + Dx + Ey + F = 0, 

and we apply the general rotation (3) through an unspecified angle 6 , which 
yields 

A(x' cos 8 - y’ sin Oj 2 + B(x ' cos 8 - y' sin 8)(x' sin 8 + y' cos 8) 

+ C(x' sin 8 + / cos 0) 2 + D(x' cos 6 — y' sin 8) 

+ E(x' sin 8 + y' cos 8) + F = 0. 

When we collect coefficients for the various terms, we get a new equation of the 
same form, 

A'x' 2 + B'x'y' + C'y' 2 + D'x' + E'y' + F’ = 0, (6) 

with new coefficients related to the old ones by the following formulas: 


A 

= A cos 2 8 + B sin 8 cos 8 + C sin 2 

8 , 

B' 

= -2 A sin 8 cos 8 + £(cos 2 8 - sin 

2 8) + 2C sin 8 cos 8 , 

C 

= A sin 2 8 - B sin 8 cos 8 + C cos 2 

8 , 

D' 

= D cos 8 + E sin 8 , 


E' 

= —D sin 8 + E cos 8 , 



F' = F. 

We have written down all these formulas for future reference, but for the mo¬ 
ment we are only interested in B'. If we start out with a second-degree equation 
(1) in which the mixed term is present, B ^ 0, then we can always find an angle 
6 of rotation such that the new mixed term is eliminated. To find a suitable an¬ 
gle 6, we simply put B' = 0 in (7) and solve for 8. To do this most easily, we 
use the double-angle formulas 

sin 28 = 2 sin 8 cos 8 


and 


cos 28 = cos 2 8 - sin 2 8 


to write 


B' = B cos 28 + (C - A) sin 28. 
Then B' = 0 if we choose 6 so that 


™ A - C 
cot 28 = —-—. 
B 


( 8 ) 


Since we are assuming that B =£ 0, it is clear that this is always possible, and fur¬ 
thermore that 6 can always be chosen in the first quadrant, 0 < 6 < 7t/2. 


Example 1 Determine the nature of the curve whose equation is 

4x 2 + 2^y + 4y 2 = 15. 


( 9 ) 



556 


CONIC SECTIONS 


Solution Here we have A — 4, B — 2, and C — 4. The mixed term will be re¬ 
moved by choosing 0 according to (8), which in this case gives 

cot 20 — 0, 20 = 90°, 0 = 45°. 

We therefore substitute equations (4) into (9) and obtain 

'2 '2 

5x' * 1 2 3 4 + 3y' 2 = 15 or = 1 

after simplification. This is clearly an ellipse with its foci on the new /-axis. 

We observe that when B ^ 0 and A = C, a rotation through 45° is always ap¬ 
propriate. 

Example 2 Determine the nature of the curve whose equation is 

1 \x 2 + 10V3.ry + y 2 - 32 = 0. (10) 


Solution Here we have 

cot 20 = — = _L , 20= 60°, 0 = 30°. 

10V3 V3 

In this case we use equations (5), which transform (10) into 
\6x' 2 — 4y' 2 - 32 = 0 or -= 1 

2 o 

after simplification. This is a hyperbola with its principal axis along the new 
x'-axis. 


We now return to our original problem of classifying all possible graphs of the 
second-degree equation (1). Since the axes can always be rotated to eliminate the 
mixed term, there is no loss of generality in assuming that this has been done. 
We are therefore confronted by equation (6) with B' = 0, and we drop the primes 
to simplify the notation, 

Ax 2 + Cy 2 + Dx + Ey + F = 0. (11) 

Our experience in the preceding sections enables us to distinguish four cases and 
to be certain that there are no others. The graph of equation (11), with B — 0, is: 

1 A circle if A = C =£ 0. In special cases the graph can be a single point or the 
empty set. 

2 An ellipse if A and C are both positive or both negative and A =£ C. Again, 
in special cases the graph can be a single point or the empty set. 

3 A hyperbola if A and C have opposite signs. In special cases the graph can 
be a pair of intersecting straight lines. 

4 A parabola if either A = 0 or C = 0 (but not both). In special cases the graph 
can be one straight line, or two parallel lines, or the empty set. 
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In Chapter 16 we will meet many equations of the third degree, fourth degree, 
etc. The exhaustive list given here of possible graphs of second-degree equations 
is relatively simple and stands in sharp contrast to the wilderness of bizarre curves 
that awaits us in connection with these higher-degree equations. 


PROBLEMS 


In Problems 1-11, determine and carry out a suitable rotation 
of axes to eliminate the mixed term, find the new equation, 
and identify the curve. 

1 5;t 2 - 6 xy + 5y 2 = 8 . 

2 x 2 — 2 xy + y 2 + x + y = V2. 

3 2x 2 + 4\/3* y — 2 y 2 = 8 . 

4 11 * 2 + 4V3jcy + 7 y 2 - 65 = 0. 

5 x 2 + 2xy + y 2 + 8 * - 8 y = 0 . 

6 x 2 ~ 3 *y + y 2 = 10. 

7 3 * 2 + 2xy + 3 y 2 = 8 . 

8 x 2 + 2\/3 *y + 3 y 2 + 2\/3* — 2y = 0. 

9 5*2 -I- 4V3*y -I- 9y 2 = 33 . 

10 3*2 + 4\/3*y - y 2 = 30. 

*11 6*2 - 6xy + 14y 2 = 5 . 

12 An ellipse has foci (1,0) and (0, V3) and passes 
through the point ( — 1,0). Use the focus definition to 
find its equation. Through what angle should the axes 
be rotated to eliminate the mixed term from this equa¬ 
tion? 

13 Use equations (7) to show that B 2 - 4AC = B' 2 - 
4A'C'. For this reason, the number B 2 — 4 AC is said to 
be invariant under rotations . 

1 4 The number B 2 — 4 AC i s called the discriminant of 
equation (1). To understand why, rotate the axes to re¬ 
move the mixed term and use Problem 13 to show that 


B 2 - 4AC 


<0 
= 0 
>0 


for circles and ellipses, 
for parabolas, 
for hyperbolas, 


where the various special cases are considered to be¬ 
long to the appropriate categories. 

15 Verify the statement in Problem 14 for Problems 1-11. 

16 Asa check on the equations for rotation of axes, show 
that the equation of a circle centered at the origin, * 2 + 
y 2 = r 2 , is unchanged in form when the axes are rotated 
through an arbitrary angle 8. 

17 If a rotation of axes through an angle 8 is followed by 
a rotation through an angle (/>, this obviously amounts 
to a rotation through the angle 8 + c/>. Use the formu¬ 
las 

* = *' cos 8 — y sin 8 
y = *' sin 8 + y' cos 8 

and 

*' = *" cos <f) — y" sin </> 
y' = *" sin cf) + y" cos </> 

to show that 

* = *" cos ( 8 + <£) — y" sin (8 A <£) 
y = x " sin (8 + 4>) T y" cos (8 + <£). 

18 Show that a 45° rotation of axes transforms the equa¬ 
tion * 4 + 6* 2 y 2 + -y4 — 32 j nt0 x >4 + y 4 = 16 Sketch 
the curve and both sets of axes. 


CHAPTER 15 REVIEW: DEFINITIONS, PROPERTIES 


Think through the following. 

1 Conic sections from a cone. 

2 Parabola, focus, directrix, axis, vertex. 

3 Equation of parabola—standard form. 

4 Reflection property of parabolas. 

5 Ellipse, foci, major and minor axes, eccentricity. 


6 Equation of ellipse—standard form. 

7 Reflection property of ellipses. 

8 Hyperbola, foci, principal axis, asymptotes, eccentricity. 

9 Equation of hyperbola—standard form. 

10 Reflection property of hyperbolas. 

11 Focus-directrix-eccentricity definitions. 
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CONIC SECTIONS 


ADDITIONAL PROBLEMS FOR CHAPTER 15 

SECTION 15.2 

1 The chord of a parabola through the focus and perpen¬ 
dicular to the axis is called the latus rectum. If (as usual) 
p is the distance between the vertex and the focus, find 
the length of the latus rectum. 

2 Find the equation of the circle tangent to the directrix of 
the parabola x 2 = 4 py with the focus of the parabola as 
its center. What are the points of intersection of the 
parabola and the circle? 

3 Show that every upward-opening parabola with focus at 
the origin has an equation of the form x 2 = 4 p(y + p), 
and every downward-opening parabola with focus at the 
origin has an equation of the form x 2 = —4p(y - p), 
where p and p are positive constants. 

4 Show that every upward-opening parabola with focus at 
the origin intersects at right angles every downward¬ 
opening parabola with focus at the origin. 

5 Let P be a point on the parabola x 2 = 4 py other than the 
vertex. If Q and R are the points at which the tangent and 
normal at P intersect the axis of the parabola, and if S is 
the foot of the perpendicular from P to this axis, then the 
segments QS and RS are called the subtangent and sub¬ 
normal. 

(a) Show that the vertex V bisects the subtangent. 

(b) Show that the subnormal has constant length 2 p. 

(c) Show that P and R are the same distance from the 
focus F. 

(d) Show that P and Q are the same distance from the 
focus F, so that F bisects the segment QR. 

(e) If the tangent at P intersects the directrix at a point 
T, show that PFT is a right angle. 

(f) If the tangent at P intersects the tangent at V (the 
*-axis) at a point U , show that PDF is a right angle. 

6 Show that the reflection property of parabolas follows 
easily (without calculation) from part (d) of Problem 5. 

7 Show that the vertex is the point on a parabola that is 
closest to the focus. 

SECTION 15.3 

8 The reflection property of ellipses is an easy consequence 
(without calculation) of the following geometric prop¬ 
erty of ellipses. In Fig. 15.35, let the line Tbe tangent at 
P to the ellipse with foci F and F'. Let G and G' be the 
reflections in T of F and F\ so that T is the perpendic¬ 
ular bisector of the segments FG and F'G'. Then, as sug¬ 
gested by the figure, the segments FG' and F'G inter¬ 
sect at P. To prove this, let Q be any point on T different 
from P and verify that 

(a) FP + PF' < FQ + QF'\ 

(b) FP + PG' <FQ + QG'\ 

(c) P lies on the segment FG'. 



Similarly, P lies on the segment F'G , so it is the point 
of intersection of these segments. Finally, use this fact to 
infer that the angles FPH and F' PH' are equal. 

9 A barrel has the shape of a prolate spheroid with its ends 
cut off by planes through the foci. If the barrel is 4 ft 
high and the diameter of its top (and bottom) is 2 ft, find 
the volume. 

K) A latus rectum of an ellipse is a chord through a focus 
and perpendicular to the major axis. If the equation of 
the ellipse is x 2 ta 2 + y 2 /b 2 = 1 with a> b, show that 

(a) the length of a latus rectum is 2b 2 /a\ 

(b) the slope of the tangent line at the upper end of the 
latus rectum to the right of the y-axis is — e\ 

(c) the tangent line in part (b) intersects the corre¬ 
sponding directrix on the *-axis. 

11 Show that every parabola of the form y = Ax 2 intersects 
every ellipse of the form x 2 T- 2 y 2 = B at right angles. 

12 Suppose that the tangent to an ellipse at a point P inter¬ 
sects a directrix at a point Q. If F is the corresponding 
focus, show that PFQ is a right angle. 

13 Let P be a point on the ellipse x 2 /a 2 + y 2 /b 2 = 1 and let 
Q be the point where the tangent at P intersects the line 
x = -a. If A is the point (a, 0), show that AP is parallel 
to OQ. 

14 Show that the product of the distances from the foci of 
an ellipse to a tangent has the same value for all tangents. 
Hint: Use Additional Problem 21 in Chapter 1. 

SECTION 15.4 

15 Show that the product of the distances from the foci of 
a hyperbola to a tangent has the same value for all tan¬ 
gents. 
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16 Show that the product of the distances from a point P on 
a hyperbola to the asymptotes has the same value for all 
P’s. 

17 Just as for an ellipse, a latus rectum of a hyperbola 
x 2 /a 2 - y 2 /b 2 = 1 is a chord through a focus and per¬ 
pendicular to the line on which the foci lie. Show that 

(a) the length of a latus rectum is 2b 2 !a\ 

(b) the slope of the tangent line at the upper end of the 
latus rectum to the right of the origin is e\ 

(c) the tangent line in (b) intersects the corresponding 
directrix on the *-axis. 

18 Suppose that the tangent to a hyperbola at a point P in¬ 
tersects the nearest directrix at a point Q. If F is the cor¬ 
responding focus, show that PFQ is a right angle. 

19 Show that the distance from either focus of the hyper¬ 
bola x 2 /a 2 - y 2 /b 2 = 1 to either asymptote is b. 

20 Sketch the graphs of the equations 

a 2 b 2 

on a single coordinate system. These two hyperbolas have 
the same asymptotes, and their four branches “enclose” 


a region that stretches out to infinity in four directions. 
Determine whether the area of this region is finite or in¬ 
finite. 

21 Let c be a given positive number. 

(a) Show that for all positive values of h the ellipses 



have the same foci (±c, 0). 

(b) Show that for all positive values of k < c 2 the hy¬ 
perbolas 



have the same foci as the ellipses in part (a). 

(c) If P\ = Ui,yi) is a point of intersection of one of 
the ellipses in (a) with one of the hyperbolas in (b), 
show that the tangents to the two curves at this point 
are perpendicular. 

22 Show that a line through a focus of a hyperbola and per¬ 
pendicular to an asymptote intersects the asymptote on 
the corresponding directrix. 





POLAR 

COORDINATES 


16.1 

THE POLAR 
COORDINATE SYSTEM 




As we know, a coordinate system in the plane allows us to associate an ordered 
pair of numbers with each point in the plane. This simple but powerful idea en¬ 
ables us to study many problems of geometry—especially the properties of curves 
—by the methods of algebra and calculus. Up to this stage of our work we have 
considered only the rectangular (or Cartesian) coordinate system, in which the 
emphasis is placed on the distances of a point from two perpendicular axes. How¬ 
ever, it often happens that a curve appears to have a special affinity for the ori¬ 
gin, like the path of a planet whose journey around its orbit is determined by the 
central attracting force of the sun. Such a curve is often best described as the 
path of a moving point whose position is specified by its direction from the ori¬ 
gin and its distance out from the origin. This is exactly what polar coordinates 
do, as we now explain. 

A point is located by means of its distance and direction from the origin, as 
shown in Fig. 16.1. Direction is specified by an angle 0 (in radians), measured 
from the positive x-axis. This angle is understood to be described in the coun¬ 
terclockwise sense if 0 is positive and in the clockwise sense if 6 is negative, just 
as in trigonometry. Distance is given by the directed distance r, measured out 
from the origin along the terminal side of the angle 6. The two numbers r and 
6 , written in this order as an ordered pair (r, 6), are called polar coordinates of 
the point. The direction 6 = 0 (the positive x-axis) is called the polar axis. 

Every point has many pairs of polar coordinates. For instance, the point P in 
Fig. 16.2 has polar coordinates (3, 7 t/ 4), but it also has polar coordinates (3, 7r/4 + 
hr), (3, 7t/4 — 47t), etc. Any multiple of 277 added to or subtracted from the 
0-coordinate of a point yields another angle with the same terminal side, and 
therefore another 0-coordinate of the same point. 

The term “directed distance” is intended to suggest that we often meet situa¬ 
tions in which r is negative. In this case it is understood that instead of moving 
out from the origin in the direction indicated by the terminal side of 0, we move 
back through the origin a distance — r in the opposite direction. Thus, another 
pair of polar coordinates for the point P in Fig. 16.2 is (—3, 7r/4 + i t). In Fig. 
16.3 we plot the two points Q = (2, tt/6) and R = (-2, tt/6). 

The value r = 0 specifies the origin, regardless of the value of 0. For instance, 
the pairs (0, 0), (0, ttH), (0, — 7t/4) are all polar coordinates of the origin. 
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The fact that a point does not determine a unique pair of polar coordinates is 
a nuisance, but only a minor nuisance. Nevertheless, it is true that when any par¬ 
ticular pair of polar coordinates is given, this pair determines the corresponding 
point without any ambiguity. 

Even though it is incorrect to speak of the polar coordinates of a point because 
they are not unique, this error of usage is very common and is tolerated for the 
sake of euphony. 

It is important to know the connection between rectangular and polar coordi¬ 
nates. Figure 16.1 shows at once that 

x = r cos 0 and y = r sin 6. ( 1 ) 



When r and 0 are known, these equations tell us how to find x and y. We also Figure 16.3 
have the equations 


r 2 = x 2 + y 2 and tan 6 = -y, (2) 

which enable us to find r and 0 when x and y are known. In using these equa¬ 
tions, it is necessary to take a little care to make sure that the sign of r and the 
choice of 0 are consistent with the quadrant in which the given point (x, y) lies. 


Example 1 The rectangular coordinates of a point are (— 1, V3). Find a pair of 
polar coordinates for this point. 

Solution We have 

r=± Vl + 3 — ±2 and tan 6 = - V3. 

Since the point is in the second quadrant, we can use our knowledge of trigonom¬ 
etry to choose r — 2 and 6 = 27 t/ 3 , so one pair of polar coordinates for the point 
is (2, 2ir/3). Another acceptable pair with a negative value of r is (—2, — tt/3). 
Students should plot the point and have a clear visual understanding of each of 
these statements, as suggested by Fig. 16.4. 


Just as in the case of rectangular coordinates, the graph of a polar equation 

F(r, 0) = 0 (3) 



is the set of all points P = (r, 0) whose polar coordinates satisfy the equation. 
Since the point P has many different pairs of coordinates, it is necessary to state 
explicitly that P lies on the graph if any one of its many different pairs of coor¬ 
dinates satisfies the equation. 


Example 2 Show that the points (1, 7t/2) and (0, 7t/2) both lie on the graph of 
r = sin 2 0 . 


Solution The point (1, tt/2) lies on the graph because the given coordinates sat¬ 
isfy the equation: 1 = sin 2 tt/2. On the other hand, the point (0, tt/2) lies on the 
graph even though 0 =£ sin 2 7t/2. The reason for this seemingly strange behavior 
is that (0, 0) is also a pair of coordinates for the same point, and 0 = sin 2 0. 
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In most of the situations we will encounter, equation (3) can be solved for r 
and takes the form 


r =m. (4) 

If the function f(6) is reasonably simple, the graph is fairly easy to sketch. We 
merely choose a convenient sequence of values for 6 , each determining its own 
direction from the origin, and compute the corresponding values of r that tell us 
how far out to go in each of these directions. We begin by discussing the sim¬ 
plest possible equations. 

Example 3 The equation 6 = a, where a is a constant, has as its graph the line 
through the origin that makes an angle a with the positive x-axis (Fig. 16.5). 



Example 4 The equation r = a, where a is a positive constant, has as its graph 
the circle with center at the origin and radius a (Fig. 16.6). 


Our next example is more complicated, and serves to introduce several im¬ 
portant methods. 

Example 5 The graph ofr = 2cos0is another circle, but this is not obvious. 
One way to try to get an idea of the shape of an unknown polar graph is to com¬ 
pute a short table of selected values and plot the corresponding points, as shown 
in Fig. 16.7. 

A better procedure than computing values and plotting points is to sketch the 
graph as the path of a moving point, by direct analysis of the polar equation, as 
follows. When 6 = 0, r = 2 cos 0 = 2. As 6 increases through the first quadrant, 
from 0 to 7t/2, 2 cos 6 decreases from 2 to 0, and we obtain the upper part of 
the curve shown in Fig. 16.7. As 6 increases from 7t/2 to 7r, 2 cos 6 decreases 
from 0 to —2, and the lower part of the curve is traced out. As 6 increases from 
7 r to 37t/2, the upper part of the curve is retraced, and as 6 increases from 37t/2 
to 27r, the lower part is retraced. 


0 

r 

0 

2 

7T/6 

73 

7T/4 

72 

7T/3 

1 

nil 

0 

2n/3 

-1 

37T/4 

-72 

5n/6 

-73 

n 

-2 



Figure 16.7 
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It is clear that the resulting graph is some kind of oval, perhaps even a circle. 
To verify that it really is a circle, we find and recognize the rectangular equation 
of the curve. To accomplish this, we multiply the given equation r = 2 cos 0 by 
r and use the change-of-variable equations (1) and (2) to write 

r 2 = 2r cos 0, 

* 2 + y 2 = 2x, 
x 2 - 2x + y 2 = 0, 

(x - l) 2 + y 2 = 1. 

This last equation tells us that the graph is a circle with center (1,0) and radius 
1. It should be pointed out that multiplying the given equation by r introduces 
the origin as a point on the graph; however, since this point is already on the 
graph, nothing is changed. 


The method illustrated here, sketching a polar graph by direct examination of 
the polar equation r = f(6 ), will often be important in our future work. Briefly, 
the process is this: We imagine a radius swinging around the origin in the coun¬ 
terclockwise direction, with our curve being traced out by a point attached to this 
turning radius which is free to move toward the origin or away from it in accor¬ 
dance with the behavior of the function /(0). In many of our examples and prob¬ 
lems, /(0) will be a simple expression involving the trigonometric functions 
sin 0 or cos 0. In these circumstances it will clearly be very useful to have a solid 
grasp of the way these functions vary as the radius makes one complete revolu¬ 
tion, that is, as 0 increases from 0 to 7t/2, then from 77/2 to 7 t, it to 37t/2, and 
377/2 tO 277. 


PROBLEMS 

1 Find the rectangular coordinates of the points with the 
given polar coordinates, and plot the points: 


(a) 

(2, 

77/4); 

(b) 

(4, -77/3); 

(c) 

(0, 

- 77 ); 

(d) 

(- 1 , 7 77 / 6 ); 

(e) 

(2, 

-77/2); 

(f) 

(4, 377/4); 

(g) 

(3. 

77 ); 

(h) 

(-6, -77/4); 

(i) 

(1. 

0); 

(j) 

(0, 1); 

00 

(2, 

-5 77/3); 

0) 

(13, tan -1 x 

(m) 

(- 

4, 11 tt/6); 

(n) 

(3, -377/2). 


2 Find two pairs of polar coordinates, with r’s having op¬ 
posite signs, for the points with the following rectangular 
coordinates: 


(a) 

(-2, 2); 

(b) 

(4, 0); 

(c) 

(2V3, 2); 

(d) 

(2, 2\A); 

(e) 

(V3, 1); 

(f) 

(0, 4); 

(g) 

(-3, -3); 

(h) 

(5, 5); 

(i) 

(0, -2); 

(j) 

(-V3, 1); 

00 

(5, -12); 

0) 

(-3, 4); 

(m) 

(-1,0); 

(n) 

(1,2). 


A regular pentagon is inscribed in the circle r = 1 with 


one vertex on the positive x-axis. Find the polar coordi¬ 
nates of all the vertices. 

4 Show that the point (3, 37r/4) lies on the curve r = 
3 sin 26. 

5 Show that the point (3, 3tt/2) lies on the curve r 2 = 
9 sin 6. 

6 Sketch each of the following curves and show that each 
is a circle by finding the equivalent rectangular equation: 

(a) r = 6 sin 6; 

(b) r = — 8 cos 0; 

(c) r = -4 sin 6. 

7 Sketch the curve r = 4 (sin 6 + cos 0), and identify it by 
finding the equivalent rectangular equation. 

8 Show that the graph of r = 2a cos 0 + 20 sin 0 is either 
a single point or a circle through the origin. Find the cen¬ 
ter and radius of the circle. 

9 Sketch and identify each of the following graphs: 

(a) r — 2 esc 0; (b) r = 4 sec 0; 

(c) r = -3 esc 0; (d) r = -2 sec 0. 


3 
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16.2 

MORE GRAPHS OF 
POLAR EQUATIONS 



Figure 16.8 A cardioid. 



Figure 16.9 A lima^on. 


POLAR COORDINATES 


We continue our program of getting better acquainted with polar graphs. In this 
section we concentrate particularly on sketching polar equations r =f(0) of the 
type mentioned earlier, where f(0) involves sin 6 or cos 6 in some simple way. 

We again emphasize the change in point of view that is necessary for sketch¬ 
ing polar equations. With rectangular coordinates and y = f(x), we are accus¬ 
tomed to the idea of a point x moving along the horizontal axis and y as the di¬ 
rected distance measured up or down to the corresponding point (jc, y) in the 
plane. We think in terms of “left-right” and “up-down.” 

With polar coordinates and r = f(0), however, we must think of the angle 6 
swinging around like the hand of a clock turning in the wrong direction. For each 
6 we measure out from the origin a directed distance/(0), and our moving point 
is farther out or closer in according as f(6) is larger or smaller. We must think 
in terms of “around and around” and “in and out.” 

Example 1 The curve r — a( 1 + cos 6) with a > 0 is called a cardioid. When 
0=0, cos 6 = 1 and r = 2a. As 6 increases from 0 to 7t/2 and on to tt, cos 6 
decreases from 1 to 0 to — 1, so r decreases steadily from 2 a to a to 0. This is 
shown in the upper half of Fig. 16.8. As 6 continues to increase through the third 
and fourth quadrants, we see that cos 6, and with it r, retraces its values in re¬ 
verse order, reaching cos 0 = 1 and r = 2a at 6 = 2tt. Since cos 0 is periodic 
with period 2tt, values of 0 less than 0 or greater than 2tt give points already 
sketched. The complete cardioid shown in the figure is evidently symmetric about 
the x-axis. The strange name this curve bears is accounted for by its fancied re¬ 
semblance to a heart. 


When facing a polar equation, it is a natural temptation to try to return to fa¬ 
miliar ground by converting immediately to rectangular coordinates. This is ac¬ 
complished by using the transformation equations mentioned in Section 16.1, 

r 2 = x 2 + y 2 , sin 6 = —, cos 0 = tan 6 = —. 

r r x 


In the case of the cardioid discussed in Example 1, its equation r = a( 1 + cos 6) 
becomes 


r = a 



r 2 = a{r + x), x 2 + y 2 — ax = ar , 


and finally, 


(jc 2 + y 2 - ax) 2 = a 2 (x 2 + y 2 ). 


This rectangular equation of the cardioid doesn’t really tell us much. Clearly, it 
is better in this case to think exclusively in the language of polar coordinates. 
Nevertheless, there is a certain interest in seeing that the cardioid is a fourth- 
degree curve, in contrast to the second-degree curves discussed in Chapter 15. 


Example 2 The curve r = a( 1 + 2 cos 6) with a > 0 is called a limagon (French 
for “snail”). When 6 = 0, r = 3a. As 6 increases, r decreases, becoming 0 when 
2 cos 0 = — 1, that is, when 6 = 2tt/3. As 6 continues increasing to tt, r contin¬ 
ues to decrease through negative values from 0 to —a, and the point whose move¬ 
ment we are following traces the lower half of the inner loop shown in Fig. 16.9. 
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Just as in Example 1, as 6 continues to increase through the third and fourth 
quadrants, r retraces its values in reverse order; the inner loop is completed at 
6 = 47r/3, and the outer loop is completed at 6 = 2i r. 


The curves in Examples 1 and 2 are both symmetric about the x-axis. We al¬ 
ways have this kind of symmetry when r is a function only of cos 6 , because of 
the identity cos ( — 6) = cos 6. Similarly, if r is a function only of sin 0, then the 
curve is symmetric about the y-axis, because of the identity sin {it — 6) = sin 6. 

We sometimes encounter curves whose equations have the form r 2 = f (6). In 
this case, if 6 is an angle for which f(6) < 0, then there is no corresponding 
point on the graph, because we must have r 2 > 0. But if 6 is an angle for which 
f(6) > 0, then there are two corresponding points on the graph, with r = ± V f(Q). 
These points are equally far from the origin in opposite directions, so the graph 
of r 2 = f{6) is always symmetric with respect to the origin. 

Example 3 The curve r 2 = 2 a 2 cos 26 is called a lemniscate. For each 6 there 
are two r’s, 

r = ±V2 a Vcos 26. (1) 

As 6 increases from 0 to 7 t/4, 26 increases from 0 to 7 t/ 2 and cos 26 decreases 
from 1 to 0. Accordingly, the two r’s in (1) simultaneously trace out the two parts 
of the curve shown on the left in Fig. 16.10. As 6 continues to increase through 
the second half of the first quadrant and the first half of the second quadrant, 26 
varies through the second and third quadrants and cos 26 is negative, so there is 
no graph for these 6' s. Through the second half of the second quadrant, cos 26 
is positive again, and the two r’s given by (1) simultaneously complete the two 
loops begun on the left in the figure. Further investigation reveals that no addi¬ 
tional points are obtained, and the complete lemniscate is shown on the right. 
The name of this curve comes from the Latin word lemniscus, meaning a ribbon 
tied into a bow in the form of a figure eight.* 


*The lemniscate was introduced by James Bernoulli in 1694. It played a considerable role in some 
of the early work of Gauss (in 1797) and Abel (in 1826) on elliptic functions and ruler-and-compass 
constructions in geometry. See M. Rosen, “Abel’s Theorem on the Lemniscate,” Amer. Math. Monthly, 
88 (1981), pp. 387-395. 



Figure 16.10 A lemniscate. 
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Example 4 The curve r = a sin 26 with a > 0 is called a four-leaved rose , for 
reasons that will become clear. To sketch it, we observe that as 6 increases from 
0 to 7t/4, 26 increases from 0 to tt/2 and r increases from 0 to a\ and as 6 in¬ 
creases from 7t/4 to 7t/2, 26 increases from tt/2 to tt and r decreases from a to 
0. This gives the leaf in the first quadrant (Fig. 16.11). Values of 6 between tt/2 
and tt (26 between tt and 27 t) yield negative r’s which trace out the leaf in the 
fourth quadrant; those between tt and 37t/2 (26 between 2tt and 37t) yield pos¬ 
itive r’s which trace out the leaf in the third quadrant; and those between 3 7t/2 
and 2tt (26 between 37T and 47 t) produce negative f s and the leaf in the second 
quadrant. 


We sometimes need to find the points of intersection of two curves that are 
defined by polar equations. It is natural to try to do this by solving the equations 
simultaneously. Unfortunately, this may not give all the points of intersection. 
The reason for this is that a point can lie on each of two curves and yet not have 
a pair of polar coordinates that satisfies both equations simultaneously. An ex¬ 
treme example of this behavior is provided by the two equations 

r = 1 + cos 2 6 and r = - 1 - cos 2 6 , 


whose graphs are identical. In this case there are no simultaneous solutions be¬ 
cause all the first f s are positive and all the second r’s are negative, and yet there 
are infinitely many points of intersection. 

What can be done about finding intersections? The most sensible approach is 
to depend on drawing good enough graphs of both equations on a single figure 
to see whether there are any points of intersection. When there are, it is usually 
possible to find the polar coordinates of these points either by solving simulta¬ 
neous equations or else by observing where the points are by direct inspection 
of the figure. 


Remark People who enjoy geometry in school usually take special pleasure in 
construction problems. As students will perhaps recall, the Greek mathematicians 
of antiquity learned how to perform a great variety of intricate constructions with 
only ruler and compass allowed as tools for drawing straight lines and circles: 
For instance, an angle can be bisected; a segment can be trisected; the perpen¬ 
dicular bisector of a segment can be drawn; regular polygons with n sides, where 
n = 3, 4, 5, 6, can be constructed; etc. All of these constructions and many more 
have been known since the time of Euclid and Archimedes, and the details form 
an important part of the study of plane geometry.* 

The creation of geometric constructions with ruler and compass alone, when 
considered as an intellectual game played according to clearly understood rules, 
was certainly one of the most fascinating and enduring games ever invented. The 
complicated constructions that turn out to be possible must be seen to be be¬ 
lieved. Nevertheless, after ingenious and persistent efforts extending over more 
than 2000 years, there were three classical Greek construction problems that still 
remained unsolved at the beginning of the nineteenth century. These problems 
were: 


*See Chapters III, V, and IX of H. Tietze, Famous Problems of Mathematics (Graylock Press, 1965). 
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1 To trisect an angle , that is, to divide a given angle into three equal parts 

2 To double a cube , that is, to construct the edge of a cube with twice the vol¬ 
ume of a given cube 

3 To square a circle , that is, to construct a square whose area equals that of a 
given circle 

In the course of the nineteenth century all three constructions were conclusively 
proved to be impossible under the stated conditions. 

The traditional restriction to the use of ruler and compass alone seems to have 
originated with the ancient Greek philosophers, but the Greek mathematicians 
themselves did not hesitate to use other tools. In particular, they invented vari¬ 
ous bizarre curves for the specific purpose of solving one or another of the clas¬ 
sical construction problems. Some of these curves are described in the problems 
that follow. 


PROBLEMS 

1 The following curves are also called cardioids. Sketch 
them, observing the way the position of the curve 
changes as the form of the equation changes.* 

(a) r = a( 1 - cos 0). (b) r = a{\ + sin 0). 

(c) r = a(\ — sin 0). 

2 All curves of the form r = a ± b cos 0 or r = a ± 
b sin 0 with a, b > 0 are called limagons. If a > b, the 
graph is a single loop. If a < b, the graph consists of a 
smaller loop inside a larger one, as in Fig. 16.9. Sketch 
the following Unions: 

(a) r = 3 + 2 cos 0; (b) r = 1 + 2 sin 0; 

(c) r = 1 - V2 cos 0; (d) r = 5-3 sin 0. 

3 Sketch the lemniscate r 2 = 2a 2 sin 20. 

4 Sketch the graphs of the following polar equations: 


(a) 

r = 2a cos 0; 

(b) r — 2a sin 0; 

(c) 

r = 2 - cos 0; 

(d) r = 2 + cos 0; 

(e) 

r 2 - cos 0; 

(f) r = 4 sin 2 0; 

(g) 

r - cos 20; 

(h) r = 1 + sin 20; 

(i) 

r = 2 + sin 20; 

(j) r = cos \6', 

(k) 

r = sin j0; 

(1) r = 2 sin 2 ^0; 

(m) 

r = 1 -F 2 sin 0; 

(n) r = tan 0; 

(o) 

r = cot 0; 

(p) r = sin 30; 

(q) 

r = cos 30. 



5 Transf orm the given rectangular equation into an equiv¬ 
alent polar equation: 

(a) x = 5; (b) y = -3; 

(c) X 2 + y 1 = 9; (d) x 2 - y 1 = 9; 

(e) y = x 2 \ (f) xy = 1; 

(g ) y* = x(x 2 - y*y, (h) y 2 = * 2 (yzf)- 


^Unless the contrary is explicitly stated, it is customary to assume 
that the constant a that occurs in polar equations like these is a pos¬ 
itive number. 


6 Transform the given polar equation into an equivalent 
rectangular equation: 

(a) r = 2; (b) 0 = tt/4; 

(c) rcos 0=3; (d) r = 4 sin 0; 

(e) the lima^on of Example 2, r = a(\ + 2 cos0); 

(f) the lemniscate of Example 3, r 2 = 2 a 2 cos 20; 

(g) the rose of Example 4, r = a sin 20; 

(h) r = tan 0; (i) r 2 = cos 40. 

7 Let a be a positive number and consider the points 
F = ( a , 0) and F' = (- a , 0). The lemniscate r 2 = 
2 a 2 cos 20 has the following simple geometric prop¬ 
erty: It is the set of all points P such that the product 
of the distances PF and PF' equals a 2 . Prove this by 
first finding the rectangular equation of the curve and 
then transforming this equation into its polar form r 2 = 
la 2 cos 20. 

8 Use the formula y = r sin 0 to find the largest value of 
y on 

(a) the cardioid r = 2(1 + cos 0); 

(b) the lemniscate r 2 = 8 cos 20. 

9 Use the formula x = r cos 0 to find the polar coordi¬ 
nates of the points on the cardioid r = 2( 1 + cos 0) with 
the smallest ^-coordinate. What is this smallest Jt-coor- 
dinate? 

10 Find all points of intersection of each pair of curves: 

(a) r = 4 cos 0, r = 4A/3 sin 0; 

(b) r = V2 sin 0, r 2 = cos 20; 

(c) r 2 = 4 cos 20, r 2 = 4 sin 20; 

(d) r = 1 - cos 0, r = cos 0; 

(e) r = a, r = 2>a sin 0; 

(f) r= a, r 2 = 2a 2 sin 20. 

11 A line segment of length 2 a slides in such a way that 
one end is always on the x-axis and the other end is al¬ 
ways on the y-axis. Find the polar equation of the lo- 




568 


POLAR COORDINATES 


cus of the point P in which a line from the origin per¬ 
pendicular to the moving segment intersects the seg¬ 
ment. 

*12 Consider a circle of diameter 2a that is tangent to the 
y-axis at the origin (Fig. 16.12). Let OA be the diame¬ 
ter that lies along the x-axis, AB a segment tangent to 
the circle at A, and C the point at which OB intersects 
the circle. If P — (r, 8) lies on OB in such a position 
that OP = CB, find the polar equation of the locus of 
P. This curve is called a cissoid, meaning “ivy-shaped” 
—or, more precisely, the cissoid of Diocles (Greek, sec¬ 
ond century B.C.). f 



13 Find the rectangular equation of the cissoid in Problem 

12 . 

14 Show that the line x = 2a is an asymptote of the cis¬ 
soid in Problem 12. 


15 Let a and b be given positive numbers and consider the 
line whose rectangular equation is x = a and whose po¬ 
lar equation is r cos 8 = a or r = a sec 8 (Fig. 16.13). 
The line OA in the figure intersects the line * = a at the 
point A, and Pisa distance b beyond A. The locus of 
P is called a conchoid, meaning “shell-shaped” — or, 
more precisely, the conchoid ofNicomedes (Greek, third 
century B.C.). The polar equation of the conchoid is 
clearly r = a sec 8 + b. Find its rectangular equation.* 



16 If a rotation of the polar axis through a specified angle 
transforms one polar equation into another, then their 
graphs are clearly congruent. Show that a counter¬ 
clockwise rotation of the polar axis through an angle a 
can be accomplished by replacing 8 by 8 + a. Use this 
method with suitable choices of a to show that 

(a) the cardioids in Problem 1 are congruent to the car- 
dioid r — a(\ + cos 8) discussed in Example 1; 

(b) the lima 9 on r = a ± b sin 8 in Problem 2 is con¬ 
gruent to the limagon r = a ± b cos 8\ 

(c) the lemniscate r 2 = 2 a 2 sin 28 in Problem 3 is con¬ 
gruent to the lemniscate r 2 = la 2 cos 28 discussed 
in Example 3. 


f In the Additional Problems we explain how the cissoid can be used 
to solve the problem of doubling a cube. 


*In the Additional Problems we explain how the conchoid can be used 
to solve the problem of trisecting an angle. 










16.3 POLAR EQUATIONS OF CIRCLES, CONICS, AND SPIRALS 


569 




We have already had considerable experience in transforming the rectangular 
equation of a given curve into an equivalent polar equation for the same curve. 
Our basic tools for this procedure are the transformation equations listed in Sec¬ 
tion 16.2. 

Consider, for example, the circle (Fig. 16.14, left) with center ( a , 0) and ra¬ 
dius a: 

(x — a) 2 + y 2 = a 2 or x 2 + y 2 = 2 ax. (1) 

Since x 2 + y 2 = r 2 and x = r cos 0, this equation becomes 

r 2 = 2 ar cos 0, 


which is equivalent to 

r = 2a cos 0 (2) 

because the origin r = 0 lies on the graph of (2). 

This example illustrates one way to find the polar equation of a curve, namely, 
transform its rectangular equation into polar coordinates. Another method that is 
better whenever it is feasible is to obtain the polar equation directly from some 
characteristic geometric property of the curve. In the case of the circle just dis¬ 
cussed, we use the fact that the angle OPA in the figure on the right is a right 
angle. Since OPA is a right triangle with r the adjacent side to the acute angle 0, 
we clearly have 

r = 2a cos 0, 

which of course is the same equation previously obtained, but derived in a very 
different way. 

We shall use this second and more natural method to find the polar equations 
of various curves in the following examples. 

Example 1 Find the polar equation of the circle with radius a and center at the 
point C with polar coordinates (b, a), where b is assumed to be positive. 

Solution Let P — (r, 6) be any point on the circle, as shown in Fig. 16.15, and 
apply the law of cosines to the triangle OPC to obtain 

a 2 = r 2 + b 2 — 2br cos (0 — a). 


Figure 16.14 


16.3 

POLAR EQUATIONS OF 
CIRCLES, CONICS, AND 
SPIRALS 


P - (r, 6) 



0 

Figure 16.15 
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This is the polar equation of the circle. For circles that pass through the origin 
we have b = a, and the equation can be written as 

r = 2a cos (6 - a). (3) 

In particular, when a = 0, then (3) reduces to (2), and when a = 7t/2, so that the 
center lies on the y-axis, then cos (6 — 7t/2) = sin 6 , and (3) reduces to 

r = 2a sin 6. (4) 

In this case the right triangle OPA in Fig. 16.16 provides a more direct geomet¬ 
ric way of obtaining (4), since here r is the opposite side to the acute angle 6. 


Example 2 Let F\ and F 2 be the two points whose rectangular coordinates are 
(< a , 0) and ( — a , 0), as shown in Fig. 16.17. If b is a positive constant, find the 
polar equation of the locus of a point P that moves in such a way that the prod¬ 
uct of its distances from F\ and F 2 is b 2 . 

Solution If p = (r, 6) is an arbitrary point on the curve, then the defining con¬ 
dition is 

d\d 2 = b 2 or d\ 2 d 2 2 = b 4 , 

where d\ — PF\ and d 2 = PF 2 . We apply the law of cosines twice, first to the 
triangle OPF\, 

d\ 2 = r 2 + a 2 - 2 ar cos 6 , (5) 


and then to the triangle OPF 2 , 

d 2 2 = r 2 + a 2 - 2ar cos ( 7 r - 6). 


( 6 ) 


Since cos ( 7 r — 6) = -cos 6 , we can write (6) as 

d 2 2 = r 2 + a 2 + 2 ar cos 6, (7) 


and by multiplying (5) and (7) we obtain 

d\ 2 d 2 2 = ( r 2 + a 2 ) 2 - (2 ar cos 0) 2 


or 

b 4 = r 4 + a 4 + 2 a 2 r 2 { 1 - 2 cos 2 6). 

The trigonometric identity 2 cos 2 6 = 1 + cos 26 permits us to write this equa¬ 
tion as 

b 4 = r 4 + a 4 - 2a 2 r 2 cos 26. (8) 


In the special case b = a, the curve passes through the origin, and the equation 
takes the much simpler form 

r 2 = 2a 2 cos 26. (9) 


We recognize this as the equation of the lemniscate discussed in Example 3 of 
Section 16.2. When b> a, the curve consists of a single loop, but when b < a 
it breaks into two separate loops. The cases b < a and b = a are illustrated in 
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Fig. 16.18, along with two cases of b > a. Collectively, these curves are called 
the ovals of Cassini. * 


Polar coordinates are particularly well suited to working with conic sections, 
as we see in the next example. 

Example 3 Find the polar equation of the conic section with eccentricity e if 
the focus is at the origin and the corresponding directrix is the line x = -p to 
the left of the origin. 



Figure 16.18 The ovals of Cassini. 


Solution With the notation of Fig. 16.19, the focus-directrix-eccentricity char¬ 
acterization of the conic section is 


~ e or PF ~ e • PD. 


( 10 ) 


We recall that the curve is an ellipse, a parabola, or a hyperbola according as e < 
1, e = 1, or e > 1. By examining the figure, we see that PF = r and 

PD = QR = QF + FR 

= p + r cos 0, 


so (10) is 


r = e(p + r cos 0). 



This is easily solved for r, which gives 


e P 

1 - e cos 0 


(ID 


as the polar equation of our conic section. 


We give two concrete illustrations of the ideas in Example 3. 

Example 4 Find the polar equation of the conic with eccentricity y, focus at the 
origin, and directrix x = —4. 

Solution We merely substitute e = y and p = 4 in equation (11), which yields 

7(4) = 4 

1 — y cos 6 3 — cos O' 

The curve is an ellipse. Observe that the denominator here is never zero, so r is 
bounded for all 0’s. 


*The Italian astronomer Giovanni Domenico Cassini thought of these ovals in 1680 in connection 
with his efforts to understand the relative motions of the earth and the sun. He proposed them as al¬ 
ternatives to Kepler’s ellipses before Newton settled the matter with his theory of the solar system 
in 1687. Cassini discovered several of the satellites of Saturn, and also the so-called Cassini division 
in Saturn’s ring, thereby showing that this ring consists of more than one piece. 
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Example 5 Given the conic with equation 

25 

r =- 

4—5 cos 6 

find the eccentricity, locate the directrix, and identify the curve. 



Solution We begin by dividing numerator and denominator by 4 to put the equa¬ 
tion in the exact form of (11), 

25 

4 

r = ---. 

1 — f cos 6 

This tells us that e = j and ep = x* so P ~ 5. The directrix is the line x — -5, 
and the curve is a hyperbola. Observe that the denominator here is zero when 
cos 6 = j, so r becomes infinite near these directions. 


In connection with Example 3, it is worth pointing out that if the directrix is 
the line x — p to the right of the origin, as in Fig. 16.20, then PD = p - 
r cos 0. The equation PF - e • PD now has the form 


r = e(p — r cos 6 ), 


and instead of (11) we have 


r = 


ep 

1 + e cos 6' 



Figure 16.21 The spiral of 
Archimedes. 



Figure 16.22 A hyperbolic spiral. 


Polar coordinates are very convenient for describing certain spirals. 

Example 6 The spiral of Archimedes (Fig. 16.21) can be defined as the locus of 
a point P that starts at the origin and moves outward at a constant speed along a 
radius which, in turn, is rotating counterclockwise at a constant speed from its 
initial position along the polar axis, where both motions start at the same time.* 
Since both r and 6 are proportional to the time t measured from the beginning 
of the motions, r is proportional to 6 and the polar equation of the spiral is r = 
a6, where a is a positive constant. In the figure, it is assumed that 6 starts at zero 
and increases into positive values, as implied by the definition. However, if we 
wish to allow 6 to be negative, then there is another part to the spiral which we 
have deliberately not sketched for the sake of keeping the figure uncluttered. 


Example 7 In the spiral discussed in Example 6, r is directly proportional to 6, 
r = ad. We now consider the case in which r is inversely proportional to 6 , 

r = or rO = a, (12) 

6 

where a is a positive constant. For positive values of 0 , the graph is the curve 
shown in Fig. 16.22; it is called a hyperbolic spiral because of the resemblance 
of rO - a to the equation xy - a , which represents a hyperbola in rectangular co¬ 
ordinates. 


*These are almost the same words which Archimedes himself uses to define his spiral. See his trea¬ 
tise “On Spirals” in The Works of Archimedes, T. L. Heath, ed. (Dover, n.d.), especially p. 154. 
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The essential features of the graph are easy to see by considering r = a/6. 
When 0 = 0, there is no r; when 0 is small and positive, r is large and positive; 
and as 0 increases to ©©, r decreases to 0. This tells us that a variable point P on 
the graph comes in from infinity and winds around the origin in the counter¬ 
clockwise direction in an infinite number of shrinking coils as 0 increases in¬ 
definitely. To understand the behavior of this curve for small positive 0’s, we 
need to think about what happens to 


We know that 


r sin 0 
lim ——- = 1, 

0->O 0 


and therefore 


r r sin 0 

lim y = lim a —— = a. 

0->O J 0->O 0 

It follows that the line y = a is an asymptote of the curve, as shown in the fig¬ 
ure. 

If 0 is allowed to be negative, we get another part of the curve, which again 
we do not sketch in order to avoid cluttering up the figure. The nature of this 
other part is easily understood by observing that if r and 0 are replaced by — r 
and -0, then equation (12) is unaltered. This means that for every point (r, 0) 
on the curve, the point ( — r, — 0 ), which is symmetrically located with respect to 
the y-axis, is also on the curve. Thus, the other part is a second mirror-image spi¬ 
ral that winds in to the origin in the clockwise direction as 0 —> — ©©. 

PROBLEMS 


In Problems 1-6, find the polar equation of the circle deter¬ 
mined by the stated conditions. 

1 Center (4, 77 / 6 ), radius 3. 

2 Center (-3, 77/3 ), radius 4. 

3 Center (5, 0), radius 5. 

4 Center (3, 77 / 2 ), radius 3. 

5 Center on the line 0 = tt/3 and passing through ( 6 , tt/2) 
and ( 0 , 0 ). 

6 Center (5, 77 / 4 ) and passing through ( 8 , 0). 

7 A line is drawn from the origin perpendicular to a tan¬ 
gent to the circle r = 2a cos 0. Find the equation of the 
locus of the point of intersection and sketch the curve. 

8 Find the rectangular equation of the ovals of Cassini 
(Example 2) by direct use of the condition d 2 d 2 2 = b 4 . 

9 The largest and smallest values of r on the lemniscate 
(9) are clearly r = \/~2a and r = 0. Find the largest and 
smallest values of r on the ovals of Cassini ( 8 ) 

(a) in the one-loop case b = 2 a\ 

(b) in the two-loop case b = \a. 

10 The equation r = 4/(3 — cos 0) in Example 4 represents 
an ellipse with one focus at the origin. Sketch the curve, 


find both of its directrices, and locate the center. 

11 If a conic section with eccentricity e has focus at the 
origin and directrix y = —p below the origin, show that 
its polar equation is r = epl{\ - e sin 0). What is the 
polar equation if the directrix is the line y = p above 
the origin? 

Find the eccentricity of each of the following conic sections 
(in Problems 12-15) and sketch the curve. 


12 

6 

n r - 10 

1 - cos 6' 

2 - cos 0 * 

14 

4 

1? is 

Y 2 + 4 cos 0’ 

6 + cos 0 ' 

16 

One focus of a hyperbola with eccentricity e 


the origin, and the corresponding directrix is the line 
x = 1 (or r cos 0 = 7). Find the polar equation and the 
polar coordinates of the second focus and center, and 
sketch the curve. 

17 When e > 1 , the equation r = ep/(\ — e cos 0 ) repre¬ 
sents a hyperbola. Use this equation to determine the 
slopes of the asymptotes. 
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18 Transform the equation r = ep/(\ - e cos 6) into rec¬ 
tangular coordinates. Use the facts established in Chap¬ 
ter 15 about the equations of conics in rectangular co¬ 
ordinates to show that the given equation represents a 
parabola if e = 1, an ellipse if 0 < e < 1, and a hyper¬ 
bola if e > 1 . 

19 If e < 1, use calculus to find the polar coordinates of 
the point on the ellipse r = ep/( 1 - e cos 6) that is 

(a) farthest from the origin; 

(b) closest to the origin. 

Find the length of the major axis of the ellipse in Prob¬ 
lem 19. Also find the polar coordinates of its center. 
21 In his attempts to trisect an angle, Hippias of Elis, a 
leading sophist of the time of Socrates, invented a new 
curve, as follows. Consider a square OABC of side a lo¬ 
cated in the first quadrant of the xy-plane (Fig. 16.23). 
Let OA rotate clockwise about O at a constant speed to 
the position OC. In the same time, let AB move down¬ 
ward at a constant speed to the position OC. The 
quadratrix APG is the locus of the point P of intersec¬ 
tion of the turning radius OD and the moving segment 
EFA 



Figure 16.23 The quadratrix. 

(a) Find the rectangular equation of the quadratrix 
[hint: yla = 0/(7t/2); why?]. 

(b) Find the polar equation of the quadratrix. 

(c) Use part (b) to show that OG = 2ahz, 

(d) Pappus of Alexandria (fourth century A.D.) proved 
geometrically that ADC/OC = OC/OG . Show that 
this is equivalent to the result stated in part (c). 

(e) Verify the validity of the following procedure for 
using the quadratrix as a tool for trisecting an arbi¬ 
trary acute angle 6: Construct the point E' that tri¬ 
sects OE, so that OE' = \OE\ draw E'F' parallel to 
OC, and let P' be the point where this line inter¬ 
sects the quadratrix; draw OP' and conclude that 
/.COP' = [Part (e) requires that the segment 


OE be trisected by the point E'. Figure 16.24 shows 
how this point can be produced by a Euclidean 
(ruler-and-compass) construction: Measure off any 
length b three times in any direction OQ, join R to 
E, and draw the line parallel to RE through the first 
point of division, intersecting OE at E '.] 



Figure 16.24 Trisection of 
segment OE. 


1 22 i n Problem 2 1, let a = 7t/2, so that the rectangular equa¬ 
tion of the quadratrix is jc = y cot y . Interchange x and y 
so that the equation becomes y = jc cot jc, and use divi¬ 
sion of power series to obtain Newton’s result that the area 
under the quadratrix y = jc cot jc from 0 to jc < 7t/2 is 

v. _ 1 3 _ 15 _2 7 _ . . . 

a 9 a 225* 6615* 

23 Verify the validity of the following method for trisect¬ 
ing an angle AOB by using the spiral of Archimedes, 
r=aO( Fig. 16 . 25 ): 



Figure 16.25 Trisection of angle AOB. 


(a) let OB intersect the spiral at P, and construct the 
point Q that trisects OP, so that OQ = \OP\ 

(b) construct the circle with center O and radius OQ, 
and let this circle intersect the spiral at R\ 

(c) draw OR and conclude that /.AOR = \/.AOB . 8 


fThe point G is not defined as part of the quadratrix because OA 
reaches OC at the same moment AB reaches OC, so there is no point 
of intersection. However, G is the limiting position of the points on 
the quadratrix that approach the x-axis. 


*In the Additional Problems we show how the quadratrix can also be 
used to square a circle. 

8 In the Additional Problems we show how the spiral of Archimedes 
can also be used to square a circle. 
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Consider a curve whose polar equation is r =/(0), and let 5 denote arc length 
measured along the curve from a specified point in a specified direction (Fig. 
16.26). By Section 7.5 we know that the differential element of arc length ds is 
given by the formula 

ds 2 = dx 2 + dy 2 . 

But x = r cos 6 and y = r sin 6 , and by differentiating with respect to 6 by the 
product rule, we obtain 


dx . _ .dr , 

— = —r sin 9 + cos 9 — and 
d 9 do 


dy a i ■ dr 

- = r cos 9+sine-. 


or equivalently, in the notation of differentials, 

dx = —r sin 9 d6 + cos 9 dr and dy = r cos 9 d9 + sin 9 dr. (1) 
It follows from these formulas that 
ds 2 = dx 2 + dy 2 

= r 2 sin 2 9 d9 2 - 2 r sin 9 cos 9 dr d9 + cos 2 9 dr 2 

+ r 2 cos 2 9 d9 2 + 2 r sin 9 cos 9 dr d9 + sin 2 9 dr 2 

= r 2 d9 2 + dr 2 . 

Thus, we have 

ds 2 = r 2 d9 2 + dr 2 (2) 

or ,_ 


ds = Vr 2 d9 2 + dr 2 = +1 [r 2 + d9 2 


= Jr 2 + 


dry 

d9 


d9. 


This formula enables us to compute arc lengths of polar curves by integration, 
as suggested by the figure: 


arc length from 9 = a to 9 = (3 equals j ds = Jr 2 + j ^ 1 


dO. 


Example 1 Find the total length of the cardioid r = a(l — cos 6). 

Solution This curve is quite familiar to us and is shown in Fig. 16.29. From the 
equation of the curve, we have dr = a sin 6 dd, so formula (2) gives 

ds 2 = a 2 ( 1 — cos 9) 2 d9 2 + a 2 sin 2 9 d9 2 
= a 2 [( 1 - cos 9) 2 + sin 2 9] d9 2 
= 2a 2 (\ - cos 9) d9 2 . 

Therefore 

ds = V2flVl - cos 9 d9 
= 2a|sin y0| d9, 

since 1 - cos 6=2 sin 2 \d. We know that sin \6 > 0 for 0 < 6 < 2ir, so we 
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Pa 

Figure 16.26 
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can drop the absolute value signs and write 


2tt 

2a sin —0 dd 


= —4a cos — 0 
2 


2ir 


— 4a - (~4a) = 8 a. 


The symmetry of this curve about the horizontal axis tells us that we can also 
obtain the total length by integrating from 0 to 7r and multiplying by 2, 


= 2 r 

Jo 


2a sin ~r0 d6 = — 8 a cos ^-0 
2 2 


= 0 - (~ 8 a) = 8 a. 


As a matter of routine, we should accustom ourselves to simplifying the calcu¬ 
lation of integrals as much as possible by exploiting whatever symmetry is avail¬ 
able. 


The above formula for ds in polar coordinates can also be used to find areas 
of surfaces of revolution, as explained in Section 7.6. 

Example 2 Find the area of the surface generated when the lemniscate r 2 = 
la 2 cos 26 is revolved about the x-axis. 



Solution An element of arc length ds (Fig. 16.27) generates an element of sur¬ 
face area 


where 


so 


dA = 27 ry ds, 

y = r sin 6 and ds = V r 2 dd 2 + dr 2 , 


Figure 16.27 


dA = 2irr sin O'V r 2 dd 2 A dr 2 = 277 sin r A d6 2 + r 2 dr 2 . (3) 

From the equation of the curve we have 

r dr = -2a 2 sin 26 d6, 


so 


r 4 d6 2 + r 2 dr 2 = (4a 4 cos 2 26 + 4a 4 sin 2 2 6) d6 2 
= 4 a 4 d6 2 


and (3) becomes 


dA = 47 ra 2 sin 6 d6. 


The total surface area is twice the area of the right half, which is generated as 
ds moves along the part of the lemniscate in the first quadrant, that is, as 6 in¬ 
creases from 0 to tt/4. We therefore have 


A 



sin 6 dd = —87 to 2 cos 6 


77/4 

0 



47ra 2 (2 - V2). 
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When working with rectangular coordinates, we specify the direction of a curve 
y — f(x ) at a point by the angle a from the positive x-axis to the tangent line. 
However, in the case of a polar curve r = /(0), it is easier to work with the an¬ 
gle ip (psi) from the radius vector to the tangent line, as shown in Fig. 16.28. We 
see from the figure that a — 0 + ip, so ip = a — 0; and since tan a = dy/dx and 
tan 0 — y/x, the subtraction formula for the tangent gives 

tan ip = tan (a - 0) 

tan a — tan 0 
1 + tan a tan 0 

dy/dx — y/x 
1 + (dy/dx) • ( y/x) 



Figure 16.28 


x dy — y dx 
x dx + y dy' 


(4) 


The reason why ip is a convenient angle to use with polar coordinates is that (4) 
can be put into a very simple form. First, the fact that x 2 + y 2 — r 2 tells us that 
x dx -F y dy — r dr. Next, from (1) we obtain 

x dy — y dx = r 2 cos 2 0 d6 + r sin 6 cos 6 dr + r 2 sin 2 6 d6 — r sin 6 cos 6 dr 
= r 2 d6. 


By substituting these expressions into (4), we find that 

. r dO r 

,an ' /, = ^r = ^- (5) 

This formula is the basic tool for working with tangent lines to polar curves. 


Example 3 Find the angle ip for the cardioid r = a( 1 — cos 0). 


Solution This curve is shown in Fig. 16.29. The equation of the curve gives 


dr 

To 


0 , 


so 


tan i/j = 


r 

dr/dO 


a (1 - cos 0) 
a sin 0 


2 sin 2 \e 
2 sin cos \0 


= tan ^-0. 

2 

We therefore have ip = yO, and as 0 increases from 0 to 2i r, ip increases from 0 
to 77, as indicated in the figure. 














As another example of the use of the formula for tan ip, we consider an inter¬ 
esting curve called the exponential spiral. 

Example 4 Find the angle ip for the curve r = ae be , where a > 0 and b # 0. 

Solution If b > 0, we see that r increases as 6 increases, as shown in Fig. 16.30. 
Further, it is clear that r^«>as^oo and r—> 0 as 6 —» — ©o. The distinctive 
feature of this curve is that ip is constant, because 

, r ae bd 1 

tan dr/dO abe be b' 

This enables us to find ip in the form ip = tan -1 {Mb). If b < 0, the curve spirals 
in to the origin instead of outward as 6 increases. The curve r — ae be is some¬ 
times called the equiangular spiral because of the constancy of ip. 



The two main facts of this section, formulas (2) and (5), are easy to re¬ 
member by using Fig. 16.31 as a mnemonic device. In this figure we have an 
arc of length ds joining two points with polar coordinates r, 6 and r + dr, 6 + 
dd. The outer part of the figure is approximately a rectangle, and the “differen¬ 
tial triangle” on the right is approximately a right triangle with hypotenuse 
ds and with r dd and dr as the legs opposite and adjacent to the angle ip. The 
formulas 

ds 2 = r 2 dO 2 + dr 2 


and 


tan w = 


r dd 
dr 


are now self-evident from this triangle, by the theorem of Pythagoras and the 
right triangle definition of the tangent. Needless to say, this way of reasoning 
is not a proof, but it is very useful nevertheless. It is also a good example of 
the true Leibnizian spirit in calculus, in the sense repeatedly explained in 
Chapter 7. 
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PROBLEMS 

1 For the spiral of Archimedes r = a6 (0 > 0), show that 
0 = 45° when 0 = 1 radian, and also that ip —> 90° as 
the spiral winds on around the origin in the counter¬ 
clockwise direction. Sketch the curve and show the an¬ 
gle 0 for the direction 0 = 1 radian. 

2 If two curves r = f\(6) and r = j2(0) intersect at a point 
other than the origin for a common value of 0, show by 
examining a figure that the angle y between their tan¬ 
gents can be found from the formula 

tan ip 2 ~ tan 0 i 

tan y = -—;-;-—. 

1 + tan ip 2 tan \p\ 

Under what circumstances will the two curves intersect 
orthogonally (at right angles)? 

3 Sketch each of the following pairs of curves on a sin¬ 
gle figure and use the result of Problem 2 to show that 
they intersect orthogonally: 

(a) r = 2a sin 0, r = 2b cos 0; 

(b) r = a( 1 + cos 0), r = b( 1 — cos 0), except at the 
origin; 

(c) r = a/( 1 - cos 0), r = b/( 1 + cos 0); 

(d) r = a/( 1 - cos 0), r = a{\ - cos 0); 

(e) r 2 = 2 a 2 cos 20, r 2 = 2 b 2 sin 20, except at the ori¬ 
gin. 

4 Show that the spirals r = 0 and r = 1/0 intersect or¬ 
thogonally at 0 = 1. 

5 Find the area of the surface generated by revolving the 
cardioid r = a( 1 - cos 0) about the *-axis. 

6 The lemniscate r 2 = 2 a 2 cos 20 is revolved about the 
y-axis. Find the area of the surface of revolution gen¬ 
erated in this way. 

7 Consider the tangent at a point P on the spiral r = a6 
(0^0), and let the line OT which is perpendicular to 
OP at the origin O meet this tangent at T (Fig. 16.32). 
Show that the segment OT equals the circular arc ASP 
with center O which is drawn from the polar axis to the 
point P 

8 At what angle does the lemniscate r 2 = 2 a 2 cos 20 in¬ 
tersect the circle r — al 



9 


10 

11 


12 

13 

14 

15 

*16 

17 

18 


*19 


On the upper half of the parabola r = a/( 1 - cos 0), 
show that tan 0 = -tan y0, and conclude that 0 = 
it - {0. Show that this establishes the following re¬ 
flection property: The tangent at any point on the 
parabola makes equal angles with the horizontal 
through that point and the line from the origin through 
the point. 

Show that the part of the hyperbolic spiral rQ = 1 from 
0 = 7r/2 to 0 = oo has infinite length. 

Find the surface area generated by revolving the circle 
r = 2a cos 0 about the line 

(a) 0=0; 

(b) 0 = 77/2. 

Find the length of one turn of the spiral r = 0, from 0 = 
0 tO 0 = 277. 

If a curve r = /(0) has the property that 0is a constant 
=£ 7r/2, show that the curve must be the exponential spi¬ 
ral r = ae be . 

Use integration to find the circumference of the circle 
r = 2a cos 0. 

Show that if a point moves at constant speed along the 
exponential spiral r = ae be , then the radius r changes 
at a constant rate. 

If a curve r = /(0) has the property that 0 = y0, show 
that the curve must be the cardioid r = a( 1 — cos 0). 
Find the length of the exponential spiral r = e~ 9 from 
r = 1 to the origin. 

Sketch the exponential spiral r = ae b9 for the case in 
which a is positive and b is negative, and show that the 
arc length from 0 = 0 to 0 = is equal to the length 
of the part of the tangent at 0 = 0 that is cut off by the 
x- and y-axes. 

Show that the length L of the right loop of the lemni¬ 
scate r 2 = 2 a 2 cos 20 can be expressed in the form 



dO 

Vcos 20 


= V2 a l 


7T/4 

tt/ 4 yj" 


d0 


2 sin 2 0 


Introduce the new variable u = tan 0 and show that 


sin 2 0 = 


1 + M 2 


and 


dd = 


du 

1 + u 2 ' 


and that therefore 


L = 



du 

Vl - M 4 


t 


+ This is a special elliptic integral that played a large part in the in¬ 
vestigations of Gauss mentioned in a footnote of Section 16.2. See 
also Problem 16 in Appendix A.9. 
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16.5 

AREAS IN POLAR 
COORDINATES 



Q — —, so area = -k rs 
r 1 




Our problem here is to find the area A of the region bounded by a polar curve 
r —f(6) and two half-lines 6 = a and 6 = /3, as shown in Fig. 16.33. Our ap¬ 
proach is modeled on the “differential element of area” idea of Section 7.1. 

In working with areas in rectangular coordinates, we use thin rectangular strips 
and rely on the fact that the area of a rectangle equals length times width. Here 
we need the fact (Fig. 16.34) that the area of a sector of a circle of radius r and 
central angle 6 (measured in radians) is \r 2 6. In Fig. 16.33 our element of area 
dA is the area of the very thin sector with radius r and central angle d6, so 

dA = \r 2 dO. (1) 

In the manner of Section 7.1, we think of the total area A as the result of adding 
up these elements of area dA as our thin sector sweeps across the region, that is, 
as 6 increases from a to /3: 

A = J dA = r 2 dO. (2) 

Again, the essence of the process of integration is that we calculate the whole of 
a quantity by cutting it up into a great many convenient small pieces and then 
adding up these pieces. 

We shall give a more mathematically sophisticated approach to formula (2) in 
Remark 2. First, however, we illustrate its use in several examples. As students 
will observe in these examples, it is always essential in solving area problems to 
have a good idea of what the curve looks like, because the correct limits of in¬ 
tegration will be determined from the figure. 


Example 1 Use integration to find the area of the circle r = 2a cos 6. 


Solution The complete circle (Fig. 16.35) is swept out as 0 increases from 
-7t/2 to 7t/2. By symmetry we can integrate from 0 to 7t/2 and multiply by 2, 

fTT/2 1 „ Ctt/2 1 „ 

A = 2 — r 2 d6 = 2 — ■ 4 a 2 cos 2 0 dO 

Jo 2 Jo 2 

„ [' n/2 1 

= 4a 2 J o -j(l + cos 20) dO 


■■ 2a 2 ( 0 -I- — sin 20 


77/2 


Naturally, we expected this answer because our circle has radius a, but it is re¬ 
assuring to obtain a familiar result by a new method. 


Example 2 Find the total area enclosed by the lemniscate r 2 = 2 a 2 cos 20 (Fig. 
16.36). 

Solution By symmetry, we calculate the area of the first quadrant part and mul¬ 
tiply by 4: 


A 


= 4 


77/4 1 

-r 2 d0 = 4 


[ nl4 — • 2 a 2 cos 20 dO 
Jo 2 


_ f 773 

= 4 a 2 

Jo 


cos 2 0 dO = 2a 2 sin 20 


7T/4 

0 


= 2a 2 . 
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This problem provides a good illustration of the value of exploiting symmetry; 
for if we carelessly integrate all the way around from 0 to 2tt , forgetting that r 2 
is sometimes positive and sometimes negative, then our final answer turns out to 
be 0, which is obviously wrong. 


Example 3 Find the area inside the circle r = 6a cos 6 and outside the cardioid 
r = 2a{\ + cos 6 ). 

Solution By equating the r’s and solving for 0 , we see that the curves intersect 
in the first quadrant at 6 — 7t/3, as shown in Fig. 16.37. The indicated element 
of area is 



Figure 16.36 


dA — 2(^circle)^ ^Q 2 (^cardioid)^ d6 

~ Ti(^circle)^ — (^cardioid)^l d9 

= {[36 a 2 cos 2 9 - 4a 2 (\ + cos 6) 2 ] d6 
= 2a\8 cos 2 9 - 1 - 2 cos 6) d9. 

By symmetry, the area we seek is double the first quadrant area, so 


Ctt/3 

A = 2 J () 2a 2 (8 cos 2 9 - 1 - 2 cos 9) d9 

= 4 a 2 f 73 [4(1 + cos 26) — \ - 2 cos 6] d6 


= 4 a 2 30 + 2 sin 29 - 2 sin 6 


-tt/3 


= 4raz 2 



Figure 16.37 


Remark 1 The ideas of this section have an important application to the as¬ 
tronomy of the solar system. Consider a point P moving along a polar curve 
r = /(#)- We can think of P as a planet moving along its orbit, with the sun at 
the origin. If A is the area swept out by the radius OP from a fixed direction a 
to a variable direction 6, as shown in Fig. 16.38, then we have 

dA = \r 2 dO . 

If both A and 6 are thought of as functions of time t, then we see that 

dA 1 7 d6 
dt 2 r dt' 

The derivative dA/dt is, of course, the rate of change of the area A. Kepler’s sec¬ 
ond law of planetary motion states that a planet moves in such a way that the ra¬ 
dius joining the planet to the sun sweeps out area at a constant rate. This means 
that dA/dt is constant, which in turn means that 



Figure 16.38 


9 dO 

r 1 —— = a constant 
dt 


(3) 


for any given planet. Thus, for example, if a planet’s orbit takes it in twice as 
close to the origin, then its angular velocity dO/dt must increase by a factor of 4. 
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This fact has far-reaching implications which we shall examine more thoroughly 
in the last section of the next chapter. 


Remark 2 We briefly reconsider formula (2) for the area A shown in Fig. 16.33. 
Our purpose is to remind students of the point of view developed in Section 6.4, 
namely, that a definite integral is defined to be a limit of approximating sums. 
As usual, we begin with a subdivision of the interval of integration [a, /?]: 

a = 0 O < 0j < • • • < 6 n = p. 



For each k = 1, 2, . .. , n, let and M k be the minimum and maximum values 
of f(6) on the kth subinterval [6k- i, 0*] of length A0* = Ok — 6k- 1 - Also, let 
AAfc be the area within the curve r = f (6) corresponding to this subinterval. In 
Fig. 16.39 we show the area AA* squeezed between the areas of the inscribed 
sector with radius r = ra* and the circumscribed sector with radius r = Mk . We 
therefore have 

\rrik 2 A 0* < A A k ^ \M k 2 A d k . 

By adding these inequalities from k = 1 to k = n, we obtain 

M* 2 A0*. 

1 Z k— 1 Z 

because A is the sum of the AA*’s. We now vary the subdivision in the manner 
described in Section 6.4, so that max A0*—» 0. Then each of these sums ap¬ 
proaches the definite integral 



dO 


for any continuous function r = /(0), and since the area A is squeezed between 
the sums, we legitimately conclude that 


A =Ljf(v 2 M=Li r2d6 ’ 


which is (2). 


PROBLEMS 

1 Find the area enclosed by the cardioid r = a( 1 + cos 6). 

2 Find the area between the parabola r = 8/( 1 — cos 6) and 

the y-axis. 

3 Show that the area inside the first turn of the spiral r = 
ad , that is, for 0 ^ 6 < 27T, equals one-third the area of 
the circle that passes through the endpoint of this turn 
and has center at the origin.* 

4 Find the total area inside the rose r = sin 26. 

5 Find the total area inside the rose r = sin 3 6. 

6 Find the area inside the smaller loop of the lima 9 on r = 
1 + 2 cos 6. 


"This statement and the result of Problem 7 in Section 16.4 are the 
main theorems proved by Archimedes in his treatise On Spirals 
(Propositions 20 and 24). 


7 Find the area between the two loops of the lima 9 on r = 
1 + 2 cos 6. 

8 Find the area between the circle r = 2a cos 6 and the line 
? = ■* 

(a) by integration; 

(b) by elementary geometry. 

9 Find the area that lies inside both curves r = a cos 6 and 
r = a(l - cos 6). 

10 Find the area that lies inside both curves r = a and r 2 = 
2 a 2 cos 26. 

11 Use integration i n polar coordinates to show that the area 
of the rectangle bounded by jc = 0, y = 0, x = a, y = b 
is ab. 

12 Find the area common to the two circles r = 2a cos 6 
and r = 2b sin 6. 





ADDITIONAL PROBLEMS FOR CHAPTER 16 


583 


13 Show that the area between the cissoid r = 2a (sec 0 - 2AJT, where A e is the area of the elliptical orbit and T is the 

cos 0) and its asymptote x = 2a (see Problem 12 in Sec- time required for the planet to go once around its orbit, 

tion 16.2) is 3 times the area of the generating circle. 

14 In equation (3), show that the value of the constant is 


CHAPTER 16 REVIEW: CONCEPTS, FORMULAS 


Think through the following. 

1 Equations connecting rectangular and polar coordinates. 

2 Graphs of polar equations: circles, cardioids, lemniscates, 
spirals. 


3 Arc length formula: ds 2 = r 2 d6 2 + dr 2 . 

4 Tangent line formula: tan i p = r d6/dr . 

5 Area formula: dA = \r 2 d6. 


ADDITIONAL PROBLEMS FOR CHAPTER 16 


SECTION 16.2 

1 Transform the given rectangular equation into an equiv¬ 
alent polar equation: 

(a) y — Ax’, 

(b) 4x 2 + 9y 2 = 36; 

(c) x 2 + y 2 — 2x + Ay = 0; 

(d) 2x — 5 y = 3; 

(e) y 2 = Ax’, 

(f) x 2 + y 2 - Ay = V* 2 + y 2 ', 

(g) x 3 + y 3 = 12 xy. 

2 Transform the given polar equation into an equivalent 
rectangular equation: 

(a) r= -3; (b) 0 = 3tt/4; 

(c) r sin 0 = -5; (d) r = 2 sec 0; 

(e) r 2 = sin 20; (f) r = cos 20; 

(g) r = sin 30; (h) r = cos 30; 

(i) r 2 = sin 2 0 tan 0. 

3 Find all points of intersection of each pair of curves: 

(a) r sin 0 = a, r cos 0 = a’, 

(b) r = a( 1 + cos 0), r = a{ 1 — sin 0); 

(c) r = a cos 20, Ar cos 0 = V3 a’, 

(d) r sin 0 = 3, r = 6 sin 0; 

(e) r = 1 + cos 0, r 2 = \ cos 0; 

(f) r = a, r 2 = 2 a 2 cos 20; 

(g) r=a{ 1 + sin 0), r = 2a cos 0; 

(h) r cos 0 = 2, r sin 0 + 2 V3 = 0; 

(i) r = 2 sin 2 0, r = —2; 

(j) r cos 0 = 1, r = 2 cos 0 + 1; 

(k) r = 1 + cos 0, r = 3 cos 0; 

(l) r = a(l + cos 0), r = a(\ + sin 0); 

(m) r = a sin 20, r = a( 1 - cos 20). 

4 Diodes invented his cissoid to solve the classical Greek 
problem of doubling a cube, that is, constructing a sec¬ 
ond cube whose volume is twice that of a given cube. 
If OA in Problem 12 of Section 16.2 is the edge of the 
given cube, then the edge of the second cube must have 
length *V2 OA. Verify that the following construction 
produces such a length. Let D be a point on the posi¬ 


tive y-axis such that OD = 20A, let AD intersect the cissoid 
at E, and extend OE to intersect the asymptote x = 2a at E. 
Then (AF) 3 = 2(OA) 3 , so AF is the required length.* Hint: If 
E = (x, y), show that y 3 = 2x 3 . 

5 Verify that the following construction, using the con¬ 
choid discussed in Problem 15 of Section 16.2, trisects 
the given angle BOP (Fig. 16.40). Draw any line x = 
a, and let A be the point where it intersects OP. Form 



■"Hippocrates of Chios (see Section 6.2) reduced the problem of dou¬ 
bling a cube of edge a to the problem of constructing two mean pro¬ 
portionals x and y between a and 2a: 

a _ x _ y 

x y~ 2a 

From these equations we have x 2 = ay and xy = 2a 2 , and eliminat¬ 
ing y we find that x 3 = 2a 3 . Thus x is the edge of a cube that has 
twice the volume of the first cube. This value is the ^-coordinate of 
the point of intersection of the parabola x 2 = ay and the hyperbola 
xy = 2a 2 , so the problem of doubling a cube is solved if we allow 
ourselves to use these curves as tools. Historians of mathematics be¬ 
lieve that the conic sections may have arisen in just this way. 
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the conchoid determined by the numbers a and b, where 
b = 20A. If C is the point where the horizontal line 
through A intersects this conchoid, let D be the point 
where OC intersects the line x = a, so that DC = b = 
20A. Let a be the angle POC and /3 the angle BOC, so 
that the given angle BOP is a + /3. Then by the law of 
sines we have 

OA = AC 
sin /3 sin a’ 

so 

OA = 2 OA cos p 
sin P sin a 

and sin a = 2 sin P cos p = sin 2/3. Therefore a = 2/3 
and LBOP = 3/3, so /3 = \/LBOP and the line OC 
trisects the given angle. 

6 A general conchoid can be defined as follows. Let O be 
a fixed point and C a given curve. On the line OA from 
O to a point A on C, continue to a point P, where AP 
is a positive constant b. Then the locus of P is called 
the conchoid of C with pole O and constant b. If C is 
a straight line and O is any point not on C, we get the 
conchoid of Nicomedes (Problem 15 in Section 16.2) 
as a special case. Show that a conchoid of a circle r = 
a cos 0 with pole O at the origin and constant b is the 
limagon r = a cos 6 + b, so that conchoids include li- 
magons— and therefore cardioids—as special cases. 1 " 

SECTION 16.3 

7 Verify the validity of the following procedure for squar¬ 
ing a circle of a radius a by using the quadratrix de¬ 
fined in Problem 21 of Section 16.3: 

(a) By part (d) of the problem just mentioned, 

OG = OC 
OC ADC ’ 

so a segment whose length is \ the circumference 
of our circle can be constructed as the third pro¬ 
portional to the segments OG and OC. (In an equa¬ 
tion of the form dd = dtx, x is called the third pro¬ 
portional to the given segments of lengths c and d, 
and jc can be constructed by ruler and compass as 
indicated in Fig. 16.41.) 


f The limagon appears to have been named by the French mathemati¬ 
cian Roberval, who used it as an example in one of his writings in 
the 1630s and called it the “limagon de monsieur Pascal.” This refers 
to Etienne Pascal, father of the famous Blaise Pascal. Etienne was a 
friend and correspondent of Mersenne, which made it easy for his son 
to enter the elite intellectual circles of France at an early age. 



(b) By part (a), a segment can be constructed whose 
length b is \ the circumference of our circle, with 
ab the area of the circle. The side 5 of a square 
whose area is ab can now be constructed as the 
mean proportional to a and b. (In an equation of the 
form a/s = s/b, s is called the mean proportional to 
a and b, and can be constructed by ruler and com¬ 
pass as indicated in Fig. 16.42.) 



8 Verify the validity of the following procedure for squar¬ 
ing a circle of radius a by using the spiral of 
Archimedes: 

(a) draw the circle with center O and radius a, and su¬ 
perimpose the spiral r = a6 with constant of pro¬ 
portionality equal to the radius of this circle (Fig. 
16.43); 



Figure 16.43 
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(b) since OP and the arc AB both have length a 9 and 
are therefore equal, it follows that the radius OQ for 
9 = 7t/ 2 is a segment whose length is \ the cir¬ 
cumference of our circle; 

(c) the squaring of the circle is now completed exactly 
as in part (b) of Problem 7. 


SECTION 16.4 

9 For the hyperbolic spiral rd = a (9 > 0), show that ip = 
135° when 6 = 1 radian, and also that ip^> 90° as the 
spiral winds on around the origin in the counter clock¬ 
wise direction. Sketch the curve and show the angle ip 
for the direction 9 = 1 radian. 

10 If a point moves along a polar curve r =f(9) at a con¬ 
stant speed, and is also moving away from the origin at 
a constant speed, show that the curve must be the ex¬ 
ponential spiral r = ae be . 

*11 Show that the arc length of one leaf of each of the fol¬ 
lowing roses equals the total arc length of the corre¬ 
sponding ellipse (but do not try to evaluate the integrals 
involved, because it cannot be done): 

(a) r= 2 sin 2 9, x 2 + 4y 2 = 1; 

(b) r = 6 cos 3 9, x 2 + 9y 2 = 9. 

*12 The distances n and r 2 from the foci to any point on 
an ellipse satisfy the equation 


SECTION 16 .5 

*19 Suppose that a polar curve r = r(9), a < 9 ^ /3, also 
has a representation as a rectangular curve y = y(x), a ^ 
x < b, as shown in Fig. 16.44. (It is convenient here to 



denote each function and its dependent variable by the 
same letter.) From the point of view of rectangular co¬ 
ordinates, the area of the region OCD is 

Aqcd — Aqad + A AB cd ~ Aqbc 


=• \r\P) sin ft cos 



y dx 


j_ 

4 


r 2 (9) sin 29 


— {r 2 (a ) sin a cos a 


r i + r 2 — a constant. 

By differentiating both sides of this equation with re¬ 
spect to arc length s and interpreting the result in terms 
of differentials, show that the tangent at any point on 
the ellipse makes equal angles with the lines to the foci. 
13 Establish the reflection property of parabolas by an 
adaptation of the argument used in Problem 12. 


14 Consider the part o f the lemniscate r 2 = 2 a 2 cos 2 9 that 20 
lies in the first quadrant, and show that at any point on 

this curve the angle between the radial direction and the 21 

outward normal is 29. 

15 Find the angle at which the circles r — a and r— 22 

2a cos 9 intersect. 

16 Show that the length of an arc o f the exponential spiral 23 
r = ae be is proportional to the difference of the radii at 

its ends. 

17 Find the length of the spiral r = a9 2 from 9 = 0 to 9 = 24 

27r. Sketch the curve. 

18 Find the length of the curve r = a sin 3 \9 from 9 = 0 25 

to 9 = 37t/2. What is the total length of this curve? 


+ r{9) sin 9[r\9 ) cos 9 — r{9) sin 9] d9. 

B y integrating by parts at the right moment, show that 
this formula reduces to 

CL 3 1 

A ocd = l 2 r2 dO- 

Find the area inside the circle r = a and outside thecar- 
dioid r = a(l — cos 9). 

Find the area outside the circle r = 4 cos 9 and inside 
the lima$on r = 1+2 cos 9. 

Find the area outside the circle r = a and inside the cir¬ 
cle r = 2a cos 9. 

Show that the area inside the first turn of the exponen¬ 
tial spiral r = ae be is (r 2 2 - r\ 2 )/4b, where r\ is the ini¬ 
tial radius and r 2 is the terminal radius. 

Find the area inside one loop of the curve r 2 = 
a 2 sin 9. 

Find the total area outside the circle r = a and inside 
the lemniscate r 2 = 2 a 2 cos 29. 






PARAMETRIC 
EQUATIONS. 
VECTORS IN THE 

PLANE 


17.1 

PARAMETRIC 
EQUATIONS OF 
CURVES 


When we think of a curve as the path of a moving point, it is often more con¬ 
venient to study the curve by using two simple equations for x and y in terms of 
a third independent variable t, 

x=f(t) and y = g(t), (1) 

than by using a single more complicated equation of the form 


Fix, y) = 0. 


( 2 ) 


t = t 2 



Figure 17.1 


In physical problems we often consider a moving point, and t is understood to 
be the time measured from the moment at which the motion begins. The point P 
whose coordinates are x and y then traces out the curve as t traverses some def¬ 
inite interval, say fj < r < ^ This provides not only a description of the path on 
which the point moves, but also information about the direction of its motion and 
its location on the path for various values of t, as suggested in Fig. 17.1. The 
third variable in terms of which x and y are expressed is called a parameter (from 
the Greek para , meaning “together,” and meter, meaning “measure”), and equa¬ 
tions (1) are called parametric equations of the curve. If we want the rectangu¬ 
lar equation of the curve in the form (2), we must eliminate the parameter from 
equations (1). 



Example 1 Sketch the curve x = r 2 , y = f 3 and find its rectangular equation. 

Solution We can plot a few points by calculating x and y for several values of 
t, as indicated by the table in Fig. 17.2. A few calculations are worthwhile to give 
us something concrete to start with. However, it is more profitable to study how 
x and y vary as t varies, instead of merely plotting points. Here we see that as t 
increases from 0 to x and y both start at 0 and increase into positive values, 
but y increases faster than x. This means that for large f s the point P = (jc, y) 
moves away from the x-axis faster than from the y-axis, as shown. For negative 
f s x is still positive, but y is negative, so this part of the curve is a reflection 
about the x-axis of the upper part which we have just described. The general 
shape of the curve can be discovered from the behavior of the slope of the 
tangent dy/dx, which can easily be calculated as a function of t by dividing 
dy = 3 1 2 dt by dx = 21 dt : 
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dy _ 3t 2 dt _ 3_ 
dx 21 dt 2 


0 as t —> 0, 

oo as t oo, 

k — oo as t —> —oo. 


Finally, we notice from the parametric equations that the square of y equals the 
cube of x, so 

y 2 = x 3 or y = x 312 

is the rectangular equation of the curve. 


As we remarked earlier, the use of parametric equations is very natural if we 
think of a curve as the path of a moving point whose position depends on the 
time t measured from some convenient initial moment. 


Example 2 Let a projectile be fired from the origin at time t — 0 with an initial 
velocity of magnitude vq ft/s (or m/s) and direction given by the angle of eleva¬ 
tion a (Fig. 17.3), and assume that the only force acting on the projectile is the 
force of gravity. Discuss the subsequent motion. 


Solution We consider the x- and y-components of the acceleration separately. 
Since the force of gravity acts downward, we have 


= ^ = o 

dt 


and 


dv y 

dt 


~gy 



where g = 32 ft/s 2 (or 9.80 m/s 2 ) is the acceleration due to gravity. Therefore 

v x = c\ and v y = —gt + c 2 

for certain constants c\ and c 2 . But when t - 0, we have v x = cos a and v y - 
Vq sin a, and consequently 

dx , dy . 

v x = — = Vo cos a and ix = -f- = —gt + Vo sin a. 
dt dt 


Another integration yields 


x = {vq cos a)t + C'i and y = —\gt 2 + {vq sin a)t + C 4 . 


But x = y = 0 when t = 0, so c 3 = C 4 = 0 and 

* = (^0 cos a)t , y = ~{gt 2 + (^o sin a)t. (3) 


These are parametric equations for the path of the projectile. We can use equa¬ 
tions (3) to show that the projectile follows a parabolic path. To do this, we elim¬ 
inate the parameter by solving the first equation for t and substituting in the sec¬ 
ond: 


vq cos a 



Vq 2 cos 2 a 


+ (^ sin a) • 


JC 

vo cos a 


8 

2 Vq 2 cos 2 a 


x 2 + (tan a) jc. 
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The fact that y is a quadratic function of jc shows that the point (jc, y) moves on 
a parabola. Further properties of this motion are developed in Problem 13. 


In any motion problem like that discussed in Example 2, it is natural to use 
the time t as the parameter. However, in problems that are more concerned with 
geometry than with physics, the most convenient parameter is likely to have some 
geometric significance. In Example 1, for instance, it is perfectly possible to think 
of the parameter t as a pure variable, without any connotation at all. On the other 
hand, in this example we have y/x = t 3 /t 2 = t, so we can also think of t as the 
slope of the radial line from the origin to a variable point on the curve, and this 
certainly lends additional vividness to our conception of the way the curve is 
traced out as the parameter varies. Needless to say, there is nothing sacred about 
the letter t, and we are always free to use any letter we wish as a parameter. 



Example 3 Consider the circle shown in Fig. 17.4, with radius a and center at 
the origin. It is easy to see that 

x = a cos 0, y = a sin 0 (4) 

are parametric equations for this circle, where 0 is the indicated central angle. 
As 0 varies from 0 to 277, the point P = (jc, y) starts at (a, 0) and moves once 
around the circle in the counterclockwise direction. If we didn’t already know 
the rectangular equation of this circle, we could obtain it from the identity 
cos 2 0 + sin 2 0 = 1 , which yields 

—y + = 1 or x 2 + y 2 = a 2 . (5) 

a 1 a L 

But students should notice that this equation is a static thing, and in passing from 
(4) to (5) we lose our sense of the circle as a curve traversed by a moving point. 


Example 4 The ellipse 



shown in Fig. 17.5 can be parametrized as follows. Since 



there exists an angle 0 such that cos 0 = x/a and sin 0 = y/b, so 

x = a cos 0, y = b sin 0. 

As 0 varies from 0 to 277, the point P = ( x , y) starts at (a , 0) and moves once 
around the ellipse in the counterclockwise direction. Observe from the figure that 
0 is not the central angle of the point P = (jc, y)\ instead, it is the central angle 
of the points A and B on the two circles, one circumscribed about the ellipse and 
the other inscribed in the ellipse, and P is the intersection of the vertical line 
through A and the horizontal line through B. 
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Figure 17.5 


Example 5 The parabola x 2 = 4 py (Fig. 17.6) can be parametrized in many ways. 
One method is to use the slope of the tangent at (x, y ) as parameter, 



Since 


2x = 4p^j- or 

ax 


dy _ _x_ 
dx 2p ’ 


the parametric equations in this case are 

x = 2 pt, y = pt 2 . 


Another method is to use as parameter the number 



Figure 17.6 



which is the slope of the radial line to (. x , y). Here we have 
y — mx and x 2 = 4 py — 4pmx, 


so 


x = 4pm, y = 4pm 2 


are the parametric equations. In each case the entire parabola is traced out as the 
parameter increases from — oo to ©o. 


Our next example illustrates the fact that a parametric curve is often only a 
part of the corresponding rectangular curve. 

Example 6 Sketch the curve x = cos 2 (7r/2)r, y = sin 2 (7r/2)r and find its rec¬ 
tangular equation. 

Solution Since cos 2 (7r/2)r + sin 2 (tt/ 2)t = 1, the point P = (t, y) moves on the 
straight line x + y = 1 (Fig. 17.7). But neither x nor y can be negative, so the 
point is confined to that portion of this line that lies in the first quadrant. It is 



Figure 17.7 


x |v 
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easy to see that the point is at A = (1, 0) when t = 0; that it moves to B = 
(0, 1) as t increases from 0 to 1; that it moves back to A as t increases from 1 to 
2 ; and so on. 


Example 7 In Example 6 we discussed a parametric curve consisting of only 
part of a straight line. We now show that the parametric equations 

x = t - 1, y = 2r + 3 (6) 

represent all of a straight line. By eliminating t from these equations, that is, by 
multiplying the first by 2 and subtracting the second, we obtain 

2x — y = —5. (7) 


Thus all points ( x , y) satisfying ( 6 ) also satisfy (7), which is the equation of a 
straight line. Conversely, given a point ( x, y) satisfying (7), let t = 1 + jc [we ob¬ 
tain this by solving the first equation in ( 6 ) for t\. Then 

x = t- 1, 

and from (7) we have 


y — 2 x + 5 — 2(t — 1) + 5 — 2r + 3, 


so the point (jc, y) lies on the parametric curve ( 6 ), as we wished to show. 


Our previous ways of representing curves, by rectangular coordinates and by 
polar coordinates, are easy to fit into our present system of parametric represen¬ 
tation. Thus, if we have a curve y = fix), then we can write 

y = fix) and x = x, 

so that x itself is used as the parameter. Also, a curve that is given in polar co¬ 
ordinates by the polar equation r = F(6) can be viewed as a parametric curve 
with parameter 0. To see this, we use the transformation equations x = r cos 0 
and y = r sin 0 to write 

x = F{6) cos 0, y = F(6) sin 0. 

For example, the spiral of Archimedes r = aO becomes 
x = ad cos 0, y = a6 sin 0; 

and the cardioid r = a( 1 + cos 0) can be expressed as 

x = a {cos 0 + cos 2 0), y = a (sin 0 + sin 0 cos 0). 


PROBLEMS 


1 In each case, sketch the curve represented by the given 
parametric equations, describe the way the point (x, y) 
moves as t varies from large negative to large positive 
values, and find the rectangular equation: 

(a) x = 1 + r, y = 1 — r; 

(b) x = -1 + 2u y = 2 + At. 

2 If x and y are linear functions of t, 

x = *o + at, y =y 0 + bt , 


show that the graph is always a complete straight line 
unless both a — 0 and b = 0. Can every straight line be 
represented in this way? 

3 Sketch the graph of x = I - r 2 , y = 2 + t 2 . Describe 
the way the point ( x, y) moves as t varies from large 
negative to large positive values, and find the rectan¬ 
gular equation of the curve. 

For each of the following pairs of parametric equations (in 
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Problems 4-9), sketch the curve and find its rectangular equa¬ 
tion. 

4 x = 3 cos t, y = 2 sin t. 

5 x = 1 + f 2 , y = 3 - t. 

6 x = sin r, y = — 3 + 2 cos r. 

7 x = sec f, >’ = tan t. 

8 x = f 3 , y = 1 - f 2 . 

9 x = sin ?, y = cos It. 

10 Sketch the graph represented by 


and find its rectangular equation. Hint: Square and sub¬ 
tract. 

11 Are the parametric curves 



and 

x = e r + e~ r , y = e r — e~ ! 

identical? Explain. 

12 We know by Example 3 that the unit circle x 2 + y 2 = 
1 (Fig. 17.8) can be parametrized by the equations 

x = cos 6 , y = sin0, 0 < 6 < 2tt. 



A very different parametrization can be obtained by us¬ 
ing as the parameter the tangent of the angle \d shown 
in the figure. 


t - 


1 +x 


(a) By first eliminating y from this equation an< 
y 2 = 1, show that the equations 


1 -r 2 


2 1 


\+t 2 ' y 1 + r 2 ’ 


-oo < t 


parametrize the entire circle except for the point 

(- 1 , 0 ). 

(b) A point (x, y) in the plane such that both x and y 
are rational numbers is called a rational point. Show 
that for rational values of t the equations in (a) give 
all rational points on the unit circle, except the point 
(- 1 , 0 ).* 

13 Consider the motion of the projectile described in Ex¬ 
ample 2. 

(a) Show that the maximum height of the projectile is 

Vq - sin 2 a 

ymax — 

2 8 

(b) Show that the range R of the projectile, i.e., the dis¬ 
tance from the origin to the point where the pro¬ 
jectile reaches the x-axis on its descent, is given by 
the formula 


R = sin 2a. 

8 

(c) What angle of elevation a produces the maximum 
range? 

(d) Show that doubling the magnitude of the initial ve¬ 
locity multiplies both the maximum height and the 
range by a factor of 4. 

14 The witch is a bell-shaped curve that can be defined as 
follows. Consider the circle of radius a which is tan¬ 
gent to the x-axis at the origin (Fig. 17.9). The variable 
line OA through the origin intersects the line y = 2a at 
the point A and the circle at the point B. The point P is 
the intersection of the vertical line through A and the 
horizontal line through B, and the witch is the locus of 
P as OA varies. Find parametric equations for this curve 



*In general it is very difficult to find rational points on curves, and it 
is quite remarkable that we are able to do so for the case of the unit 
circle. For instance, it has been asserted that if n is any integer > 2, 
the only rational points on the curve x n + y n = I are those for which 
x = 0 or y — 0. This statement is called Fermat's last theorem. It has 
been proved for many particular values of n over the past 350 years, 
but in its full generality it remains to this day one of the most famous 
(and intractable) unsolved problems of mathematics. See the last foot¬ 
note in Section 1.4. 
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by using as parameter the angle 6 from the positive 
x-axis to the line OA. Also find its rectangular equa- 
tion. f 

15 The involute of a circle is the curve traced out by the 
point at the end of a thread as the thread is held taut 
and unwound from a fixed spool, as shown in Fig. 17.10. 
If the center of the spool is placed at the origin and its 
radius is a , and if the thread begins to unwind at the 
point A = (a, 0), find parametric equations for the in¬ 
volute by using the angle AOT shown in the figure as 
the parameter 6. 

*16 The folium of Descartes, shown in Fig 17.11, is the 
graph of the equation x 3 + y 3 = 3axy} 

(a) Introduce the parameter t = y/x and find paramet¬ 
ric equations for the curve. 

(b) Use the equations found in (a) to show that the line 
x + y + a = 0isan asymptote, by showing that* + 
y —> — a as t —> — 1. 

(c) The folium is clearly symmetric about the line y = 
x, because interchanging * and y leaves the equa¬ 
tion unaltered. Use this, together with the geomet¬ 
ric meaning of t and the results of (a) and (b), to 
verify as much as possible of the general nature of 


The witch is sometimes called the witch of Agnesi after the Italian 
mathematician Maria Agnesi (1718-1799), who referred to it in her 
book on calculus published in 1748. This curve had previously been 
called the versoria, a Latin word meaning “a guy rope attached to a 
sail,” but she apparently confused this word with a different Latin 
word, versiera, meaning a “witch,” and the name has stuck. 

The word folium means “leaf” in Latin. This curve was originally 
used by Descartes as a challenge to Fermat to find its tangent line at 
an arbitrary point. Fermat succeeded immediately, much to Descartes’s 
dismay. 



Figure 17.10 The involute of a circle. 


the curve as suggested by the figure. In particular, 
decide how various parts of the curve are traced out 
as t varies over various ranges of values. 



Figure 17.11 The folium of Descartes. 


17.2 

THE CYCLOID AND 
OTHER SIMILAR 
CURVES 


The cycloid is the curve traced out by a point on the circumference of a circle 
when the circle rolls along a straight line in its own plane, as shown in Fig. 17.12. 
We shall see that this curve has many remarkable geometric and physical prop¬ 
erties. 

The only convenient way of representing a cycloid is by means of parametric 
equations. We assume that the rolling circle has radius a and that it rolls along 
the x-axis, starting from a position in which the center of the circle is on the pos¬ 
itive y-axis. The curve is the locus of the point P on the circle which is located 
at the origin O when the center C is on the y-axis. The angle 0 in the figure is 
the angle through which the radius CP turns as the circle rolls to a new position. 
If x and y are the coordinates of P, then the rolling of the circle implies that OB = 
arc BP = aO , so x = OB — AB — OB — PQ = a6 — a sin 0 = a(0 — sin 6). 
Also, y = BC — QC — a — a cos 6 = a( 1 — cos 6). The cycloid therefore has the 
parametric representation 


* = a(0 — sin 6 ), 


y = a( 1 - cos 6). 


(1) 
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It is clear from Fig. 17.12 that y is a function of x, but it is also clear from equa¬ 
tions (1) that it is not possible to find a simple formula for this function. The cy¬ 
cloid is one of many curves for which the parametric equations are much sim¬ 
pler and easier to work with than the rectangular equation. 

From equations (1) we have 

, _ dy _ a sin 6 dO _ sin 6 
* V dx a( 1 — cos 0) d6 1 — cos 6 


2 sin {6 cos 
2 sin 2 \0 


cot y 6. 


( 2 ) 


We observe that the derivative y' is not defined for 6 = 0, ±277, ±477, etc. These 
values of 6 correspond to the points where the cycloid touches the x-axis; these 
points are called cusps. The tangent to the cycloid is vertical at the cusps. 

In the following examples we establish the main geometric properties of the 
cycloid. 


Example 1 Show that the area under one arch of the cycloid is three times the 
area of the rolling circle ( Torricelli's Theorem). 


Solution One arch is traced out as the circle turns through one complete revo¬ 
lution. The usual area integral can therefore be written as follows, using the pa¬ 
rameter 6 as the variable of integration: 


A = 



C 2n dx f l7T 

y~^dO = a(\ — cos 0)a(l 

atf Jn 


- cos 6) do 


= a 


2 



— cos 6) 2 d6 = a 2 



— 2 cos 0 + cos 2 6) d6 


= a 2 \ (1 + cos 2 6) dO = a 2 i d6 + a 2 f y(l ± cos 20) d6 = 377a 2 . 

J o J o J o 


Example 2 Show that the length of one arch of the cycloid is four times the di¬ 
ameter of the rolling circle (Wren's Theorem). 

Solution Since dx = a( 1 - cos 6) dO and dy = a sin 6 dO, the element of arc 
length ds is given by 












594 


PARAMETRIC EQUATIONS. VECTORS IN THE PLANE 


ds 2 = dx 2 + dy 2 = a 2 [( 1 - cos 0) 2 + sin 2 6] dd 2 

= 2a 2 [\ — cos 6] d6 2 = 4a 2 sin 2 yd d6 2 , 


SO 

ds = 2 a sin jd dd. 


The length of one arch is therefore 

f f 27r . i " 

2 IT 

L = \ ds = 2a sin id dd — —4 a cos id 

= 8a. 

J J 0 

0 


Example 3 Show that the tangent to the cycloid at the point P in Fig. 17.12 
passes through the top of the rolling circle. 


Solution The point at the top of the circle has coordinates (ad, 2a). The slope 
of the tangent at P is given by (2). The equation of the tangent at P is therefore 

y — a( 1 - cos 6) = S * n & (x — ad + a sin 6). 

1 - cos 6 


We substitute x = ad in this equation and solve for y, which gives 


y — a(\ — cos 6) + 


sin 6 
1 - cos 6 


a sin 6 = 


a( 1 - cos d) 2 + a sin 2 6 
1 - cos 6 


= 2a. 


This shows that the tangent at P does indeed pass through the point (ad, 2d) at 
the top of the circle. 


Galileo seems to have been the first to notice the cycloid and investigate its 
properties, in the early 1600s. He didn’t actually discover any of these proper¬ 
ties, but he gave the curve its name and recommended its study to his friends, 
including Mersenne in Paris. Mersenne informed Descartes and others about it, 
and in 1638 Descartes found a construction for the tangent which is equivalent 
to the property given in Example 3. In 1644 Galileo’s disciple Torricelli (who in¬ 
vented the barometer) published his discovery of the area under one arch. The 
length of one arch was discovered in 1658 by the great English architect Christo¬ 
pher Wren.* The list of famous men who have worked on the cycloid will be 
continued, but first we consider some other related curves. 

If a circle rolls on the inside of a fixed circle, the locus of a point on the rolling 
circle is called a hypocycloid. If a circle rolls on the outside of a fixed circle, the 
locus of a point on the rolling circle is called an epicycloid . t 

We show how to represent a hypocycloid parametrically. Let the fixed circle 
have radius a and the rolling circle radius b, where b < a. Let the fixed circle 
have its center at the origin (Fig. 17.13), and let the smaller rolling circle start 
in a position internally tangent to the fixed circle at the point A on the positive 


*Wren was an astronomer and a mathematician—in fact, Savilian Professor of Astronomy at Ox¬ 
ford—before the Great Fire of London in 1666 gave him his opportunity to build St. Paul’s Cathe¬ 
dral, as well as dozens of smaller churches throughout the city. 

t The distinction between these words is easy to remember because the Greek prefix hypo means un¬ 
der or beneath, as in “hypodermic,” and epi means on or above, as in “epicenter.” 
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x-axis. We consider the point P on the rolling circle that was initially at A. With 
0 and </> as shown in the figure, the rolling of the small circle implies that the 
arcs AB and BP are equal: a 6 = bcf). We can then see that the coordinates of P 
are 


* = (a — b) cos 0 + b cos (</> — 0), 
y = (a — b) sin 0 - b sin (<£ — 0). 



But <f> — 0 = [(a — b)/b] 6, so the parametric equations of the hypocycloid are 


x = (a — b) cos 0 + b cos a . ^ 0, 

b 

y = {a — b) sin 0 — b sin ~ 7 ~~~ 0. 


(3) 


The arc length along the fixed circle between successive cusps of the hypocy¬ 
cloid is 277 b. If 2.77(2 is an integral multiple of hrb, so that a/b is an integer n, 
then the hypocycloid has n cusps and the point P returns to A after the smaller 
circle rolls off its circumference n times on the fixed circle. We leave it to stu¬ 
dents to decide when P will return to A if a/b is a rational number but not an in¬ 
teger, for example, if alb = f. A discussion of the case in which a/b is irrational 
is beyond the scope of this book; it suffices to say that as the smaller circle rolls 
around and around indefinitely, the cusps of the resulting hypocycloid are evenly 
and densely distributed on the fixed circle.* 

The parametric equations of a hypocycloid of four cusps can be written in a 
very simple form by using some trigonometric identities. If a = \b, equations 
(3) become 

x — 3b cos 0 + b cos 30, y =3b sin 0 - b sin 30. 


*The curious reader will find additional information in Theorem 439 of G. H. Hardy and E. M. 
Wright, Introduction to the Theory of Numbers (Oxford, 1954); or in Theorem 6.3 of I. Niven, Irra¬ 
tional Numbers (Wiley, 1956). 
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cos 3 6 = cos (26 + d) = cos 26 cos 6 - sin 26 sin 6 
= (2 cos 2 6 - 1) cos 6 - 2 sin 2 6 cos 6 
= [2 cos 2 6 - 1 - 2(1 - cos 2 0)] cos 6 
= 4 cos 3 6 - 3 cos 6, 
and a similar calculation yields 

sin 30 = 3 sin 6 - 4 sin 3 6 . 


Our parametric equations therefore become 

x = 4b cos 3 6 = a cos 3 6 , y = 40 sin 3 6 = a sin 3 6. (4) 


From these equations it is easy to obtain the corresponding rectangular equation, 


^ 2/3 + ^ 2/3 = ^ 2/3 


(5) 



Because of its appearance (Fig. 17.14), a hypocycloid of four cusps is often called 
an astroid. 

Example 4 Consider the tangent to the astroid at a point P in the first quadrant. 
Show that the part of this tangent which is cut off by the coordinate axes has 
constant length, independent of the position of P. 

Solution By equations (4), the slope of the tangent is 

, _ dy _ 3 a sin 2 6 cos 6 d6 _ _ 

^ dx -3a cos 2 6 sin 6 d6 tan 

so the equation of the tangent is 

y — a sin 3 6 = -tan 6 (x — a cos 3 6). 


We find the x-intercept by putting y = 0 and solving for x , 
x = a cos 3 6 + a sin 2 6 cos 6- a cos 6. 


Similarly, the y-intercept is y = a sin 6. The length of the part of the tangent cut 
off by the axes is therefore 

V a 2 cos 2 6 + a 2 sin 2 6 = a , 

which is constant. 


y 



Figure 17.15 


We now return to the cycloid discussed earlier, and reflect both it and the 
y-axis about the x-axis, as shown in Fig. 17.15. The parametric equations (1) are 
still valid, and the resulting curve has several interesting physical properties, 
which we now describe and analyze. 

In 1696 John Bernoulli conceived and solved the now famous brachistochrone 
problem. He published the problem (but not the solution) as a challenge to other 
mathematicians of the time. The problem is this: Among all smooth curves in a 
vertical plane that join a given point Pq to a given lower point P\ not directly be¬ 
low it, find that particular curve along which a particle will slide down from Pq 
to P i in the shortest possible time.* We can think of the particle as a bead of 


*The word “brachistochrone” comes from two Greek words meaning “shortest time.’ 
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mass m sliding down an ideal frictionless wire, with the downward force of grav¬ 
ity mg as the only force acting on the bead. 

If we assume that the points P 0 and P\ lie at the origin and at (jci, y\) in the 
first quadrant, as shown in Fig. 17.16, then Bernoulli’s problem can be stated in 
mathematical language as follows. The bead is released from rest at P 0 , so its 
initial velocity and initial kinetic energy are zero. The work done by gravity in 
pulling it down from the origin to an arbitrary point P = (x, y) is mgy. This must 
equal the increase in the kinetic energy of the bead as it slides down the wire to 
this point, so 

\mv 2 = mgy , 

and therefore 




(6) 


This can be written as 


_ ds _ \/dx 2 + dy 2 _ V1 + (dy/dx) 2 dx ^ 

V2 ~gy V2gy V2gy 

The total time T\ required for the bead to slide down the wire from P 0 to P\ will 
depend on the shape of the wire as specified by its equation y = f(x)\ it is given 

The brachistochrone problem therefore amounts to this: to find the particular 
curve y = f(x) that passes through P 0 and P\ and minimizes the value of the in¬ 
tegral (8). 

Since the straight line joining P 0 and P\ is clearly the shortest path, we might 
guess that this line also yields the shortest time. However, a moment’s consider¬ 
ation of the possibilities will make us more skeptical about this conjecture. There 
might be an advantage in having the bead slide down more steeply at first, thereby 
increasing its speed more quickly at the beginning of the motion; for with a faster 
start, it is reasonable to suppose that the bead might reach P\ in a shorter time, 
even though it travels over a longer path. And this is the way it turns out: The 
brachistochrone curve is an arc of a cycloid through P 0 and P\ with a cusp at the 
origin. 
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B 


Figure 17.17 Beads released on the 
cycloidal wire at O, A, C will reach B 
in the same amount of time. 


Leibniz and Newton, as well as John Bernoulli and his older brother James, 
solved the problem.* John’s solution, which is very ingenious but rather special¬ 
ized in the methods it uses, is given in the Appendix at the end of this chapter. 
The cycloid was well known to all these men through the earlier work of the 
great Dutch scientist Huygens on pendulum clocks (see below). When John found 
that the cycloid is also the solution of his brachistochrone problem, he was as¬ 
tounded and delighted. He wrote: “With justice we admire Huygens because he 
first discovered that a heavy particle slides down to the bottom of a cycloid in 
the same time, no matter where it starts. But you [his readers] will be petrified 
with astonishment when I say that this very same cycloid, the tautochrone of 
Huygens, is also the brachistochrone we are seeking.”* 

Huygens was a profound student of the theory of the pendulum, and in fact 
was the inventor of the pendulum clock. He was very well aware of the theoret¬ 
ical flaw in such a clock, which is due to the fact that the period of oscillation 
of a pendulum is not strictly independent of the amplitude of the swing.* We can 
express this flaw in another way by saying that if a bead is released on a fric¬ 
tionless circular wire in a vertical plane, then the time the bead takes to slide 
down to the bottom will depend on the height of the starting point. Huygens won¬ 
dered what would happen if the circular wire were replaced by one having the 
shape of an inverted cycloidal arch. But he did more than merely wonder, for he 
then went on to make the remarkable discovery referred to in the passage previ¬ 
ously quoted, that for a wire of this shape the bead will slide down from any 
point to the bottom in exactly the same time, no matter where it is released (Fig. 
17.17). This is the tautochrone (“equal time”) property of the cycloid, and we 
now prove it by using the formulas given above. 

If we write (8) in the equivalent form 

^ _ f I dx 2 + dy 2 

T ' ~ J \ Igy 


and substitute equations (1) into this, we obtain 

= f 0 ' 1 2« 2 (1 - cos 6) 

1 j 0 V 2*g(l - cos 6) 


dd= 0 



as the time required for the bead to slide down a cycloidal wire from P 0 to P j. 
The time needed for the bead to reach the bottom of this wire is the value of T\ 
when 6\ = i r, namely, ir^/a/g. Huygens’ tautochrone property amounts to the 
statement that the bead will reach the bottom in exactly the same time if it starts 
at any intermediate point (to, yo)- To prove this, we replace (6) by 

= V2*O->> 0 ). 


*Newton published his solution anonymously. When John Bernoulli saw it, he wryly remarked, “I 
recognize the lion by his track.” 

f For an English translation of Bernoulli’s writings on this subject, see pp. 644-655 of D. E. Smith, 
A Source Book in Mathematics (Macmillan, 1929). Bernoulli’s vivid, enthusiastic, personal style is 
in sharp contrast to the dead, gray, impersonal style of most of the writing in scientific journals nowa¬ 
days. 

*See the remark about the “circular error” in Example 3 in Section 9.6. 
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The total time required for the bead to slide down to the bottom is therefore 


, _ r ir / 2<3 2 3 (1 - cos 0) ~ 

j 0o V 2<3g(COS 00 “ cos 0) 


d0 = 



1 - cos e de 
g J e Q y cos 0 O ~ cos 0 


cin rrf) rift 



where the last step makes use of the trigonometric identities 2 sin 2 0 = 1 — 
cos 20 and cos 20 = cos 2 0 — sin 2 0 = 2 cos 2 0 — 1. If we now use the substi¬ 
tution 


cos {0 1 sin {0 d0 

-:—, du = - — -j-, 

cos 2 0o 2 cos 20 o 


then the integral (9) becomes 


T = 



du 

Vi - « 2 




This shows that T has the same value as before and i s therefore independent of 
the starting point, and the argument is complete.* 

Once Huygens established the tautochrone property of the cycloid, a further 
problem presented itself: How could he arrange for a pendulum in a clock to 
move along a cycloidal, rather than a circular, path? Here he made a further beau¬ 
tiful discovery. If we suspend from the point P at the cusp between two equal 
inverted cycloidal semiarches a flexible pendulum whose length equals the length 
of one of the semiarches (Fig. 17.18), then the bob will draw up as it swings to 
the side in such a way that its path is another cycloid. 1 " 


P 



Figure 17.18 A flexible pendulum 
constrained by cycloidal jaws swings 
along another cycloid. 


■"Instead of a “frictionless” bead sliding down a cycloidal wire, we can use a “frictionless” steel ball 
rolling down a cycloidal channel. A piece of eighteenth century furniture built to demonstrate this 
can be seen in the Science Museum in Florence, Italy. The fine woodworker John H. Lewis of Col¬ 
orado Springs has built a work of art based on this principle, with two parallel cycloidal channels 
for two balls; when released from different positions at the same moment, the balls roll to the bot¬ 
tom in the same time. 

f For a proof of this statement, see Section B .23 of the author’s Calculus Gems (McGraw-Hill, 1992). 


PROBLEMS 


1 Find the rectangular equation of the cycloid by elimi¬ 
nating 0 from the parametric equations (1). Observe how 
hopeless it is to try to solve this for y as a simple func¬ 
tion of x. 

2 Show that for the cycloid (1) the second derivative is 
given by y" = dy'/dx = ~a/y 2 . Observe that this fact im¬ 
plies that the cycloid is concave down between the cusps, 
as shown in Fig. 17.12. 

3 Use the equation of the normal to the cycloid at P (in 
Fig. 17.12) to show that this normal passes through the 
point B at the bottom of the rolling circle. Also, obtain 


this conclusion from the result of Example 3 by using 
elementary geometry. 

4 Assume that the circle in Fig. 17.12 rolls to the right 
along the jc-axis at a constant speed, with the center C 
moving at Vo units per second, (a) Find the rates of change 
of the coordinates x and y of the point P. (b) What is the 
greatest rate of increase of x, and where is P when this 
is attained? (c) What is the greatest rate of increase of y, 
and for what value of 0 is this attained? 

5 If a polygon ABCD rolls (awkwardly) on a straight line 
A'D\ as shown in Fig. 17.19, then the point A will trace 
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out in succession several arcs of circles with centers B', 
C\ D'. The tangent to any such arc is evidently perpen¬ 
dicular to the line joining the point of tangency to the 
corresponding center. Therefore, if the rolling circle that 
generates a cycloid is thought of as a polygon with an 
infinite number of sides, then the tangent to the cycloid 
at any point is the line perpendicular to the line joining 
the point of tangency to the bottom of the rolling circle. 
This is Descartes’s method for finding the tangent at any 
point of a cycloid. Verify that it is correct. 



Figure 17.19 

6 Find the area inside the astroid (5). 

7 Find the total length of the astroid (5). 

8 Find the area of the surface generated by revolving the 
astroid (5) about the x-axis. 


9 Show that the hypocycloid of two cusps, with a = 2b, \s 
simply the diameter of the fixed circle that lies along the 
jc-axis. In this case, if the center C of the rolling circle 
moves around with constant angular velocity co, so that 
dQ/dt = co, show that P moves back and forth on the 
x-axis with simple harmonic motion (Section 9.6) of pe¬ 
riod IttIco and maximum speed aco. 

10 If the astroid (4) i s generated b y the small circle rolling 
around counterclockwise with constant angular velocity 
co, find the position of the point P in the first quadrant 
for which y is increasing most rapidly. 

11 The hypocycloid of three cusps, with a = 3b, is called a 
deltoid. Sketch this curve, find its parametric equations, 
and find its total length. 

12 Find parametric equations for the epicycloid generated 
by a circle of radius b rolling on the outside of a fixed 
circle of radius a. Use a figure similar to Fig. 17.13, 
where the fixed circle has its center at the origin and the 
point P is initially at ( a, 0). 

13 Show that the equations in Problem 12 can be obtained 
from equations (3) in the text by replacing b by —b. 

14 The epicycloid of two cusps, with a = 2b, is called a 
nephroid (meaning “kidney-shaped”). Sketch this curve, 
find its parametric equations, and calculate its total 
length. 


17.3 

VECTOR ALGEBRA. 
THE UNIT VECTORS i 
AND j 


A physical quantity such as mass, temperature, or kinetic energy is completely 
determined by a single real number that specifies its magnitude. These are called 
scalar quantities, or simply scalars. In contrast to this, other entities called vec¬ 
tor quantities, or vectors, possess both magnitude and direction. As examples we 
mention velocities, forces, and displacements. 



Example 1 Let us briefly consider the case of velocity. When we discuss a point 
moving along a straight line, we can specify its position by means of a coordi¬ 
nate, which may be positive or negative, and the velocity of the moving point is 
the derivative of this coordinate with respect to time, that is, the rate of change 
of position. Direction is certainly important in such a discussion, but in this sim¬ 
ple one-dimensional case all questions about direction are easy to handle by us¬ 
ing positive and negative numbers. 

However, to specify the velocity of a point moving along a curved path in the 
plane, it is essential to give both the speed of the point (the rate at which it tra¬ 
verses distance) and the direction of its motion. This combination of two ingre¬ 
dients is the velocity vector, or simply the velocity, of the moving point. It is nat¬ 
ural to represent this vector (see Fig. 17.20) by an arrow or directed line segment 
v whose tail is placed at the current position of the point, whose length is the 
speed in some agreed system of measurement, and whose direction is the direc¬ 
tion of motion. 


Figure 17.20 
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Example 2 A force applied to an object is also a vector quantity, whose mag¬ 
nitude is the strength of the force and whose direction is the direction in which 
the force acts. For instance, the gravitational force F exerted by the earth on a 
circling artificial satellite (Fig. 17.21) is directed toward the center of the earth 
and its magnitude is proportional to 1/r 2 , where r is the distance from the satel¬ 
lite to the center of the earth. 


From the mathematical point of view, we don’t merely represent a vector by 
a directed line segment; we say that a vector is a directed line segment. This frees 
us to develop the algebra of vectors independently of any particular physical in¬ 
terpretation. 

As we have indicated, vectors are often denoted in print by boldface type. A 
good substitute for this in the case of handwritten work is to use letters with ar¬ 
rows over them. Thus, v and v denote the same vector. Also, if a vector extends 
from a point P to a point Q, we can place an arrowhead at Q and denote the vec¬ 
tor by PQ (Fig. 17.22). We then call P the tail or initial point and Q the head or 
terminal point of the vector. The vector PQ can be thought of as representing the 
displacement of a point along the line segment from P to Q, that is, the path 
taken by a point as it moves from P to Q. Such vectors describe the relative po¬ 
sitions of points. The length or magnitude of a vector PQ is denoted by the sym¬ 
bol \PQ[, this notation is used because the length of a vector is in many ways 
similar to the absolute value of a real number. 

Two vectors PQ and RS are said to be equal, and we write PQ = RS, if they 
have the same length and direction (Fig. 17.22). This definition of equality en¬ 
ables us to move a vector from one position to another without changing it, as 
long as its length and direction are unaltered. Thus, the vectors shown in Fig. 
17.23 are all equal to each other; in other words, they are the same vector in dif¬ 
ferent positions. The position vector of a point P in the coordinate plane is the 
vector OP from the origin O to the point P (Fig. 17.24). Such vectors describe 
the positions of points relative to the origin. As suggested in Fig. 17.23, any vec¬ 
tor A can be placed with its tail at the origin, and thereby becomes the position 
vector of the point P that lies at its head. 

We shall discuss two algebraic operations on vectors. The first operation is 
that of adding two vectors to get another vector, and the second is that of mul¬ 
tiplying a vector by a number to get another vector. In any discussion involving 
vectors, it is customary to refer to numbers as scalars, and this second operation 
is usually called scalar multiplication. 

First, addition. Suppose a vector A = PQ represents the displacement of a point 
along the line segment from P to Q. As shown in Fig. 17.25, when the dis- 




Figure 17.22 PQ = R? 



Figure 17.23 The same vector in 
different positions. 
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Figure 17.26 The commutative and 
associative laws. 





placement A = PQ is followed by a displacement B = QR, the final result is 
equivalent to the single displacement PR. It is therefore natural to think of PR 
as the sum of PQ and QR , and to write 

PR = PQ + QR. 

This suggests the definition we adopt for vector addition: If A and B are any two 
vectors, we add them as shown in the figure, by placing the tail of B at the head 
of A; the vector from the tail of A to the head of B is then written A + B and 
called the sum of A and B. Figure 17.26 shows that addition is commutative and 
associative, 

A + B = B + A and A + (B + C) = (A + B) + C. 

The associative law enables us to omit parentheses, writing A + B + C for A + 
(B + C). 

These ideas suggest another equivalent way to find the sum of two vectors A 
and B. If we place their tails together, as in Fig. 17.27, and form the parallelo¬ 
gram with A and B as adjacent sides, then A + B is the vector from the com¬ 
mon tail to the opposite vertex. This shows that our definition of addition is well 
suited to working with forces in physics; for if A and B are interpreted as two 
forces acting at their common tail, then it is known from experiment that A + B 
is the resultant force , that is, the single force that produces the same effect as the 
two combined forces. This is called the parallelogram rule , for the addition of 
forces and also for vector addition. 



Figure 17.28 


Example 3 Velocities are also combined by the parallelogram rule. For instance, 
a man in a canoe wishes to paddle across a river to the point on the other bank 
directly opposite to his starting point (Fig. 17.28). The river flows at 3 mi/h, and 
he can paddle at 6 mi/h. In what direction should he aim his canoe? 

Solution His actual velocity is the vector sum of the velocity of the water and 
his velocity relative to the water. For this sum to be perpendicular to the bank, 
he must aim his canoe upstream at an angle 6 for which sin 6 = f = i, so 
6= 30°. 


Now for scalar multiplication. If we add a vector A to itself, we obtain a vec¬ 
tor in the same direction but twice as long, and it is natural to write this as A + 
A = 2A. By a natural extension, if c is any real number, then cA is defined to 
be the vector which is \c\ times as long as A, in the same direction as A if c is 
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positive and in the opposite direction if c is negative (Fig. 17.29). A vector of 
zero length is denoted by 0 and called the zero vector ; this vector has no direc¬ 
tion. Evidently 1 • A = A and 0 • A = 0. The properties 

c(d\) = (cd) A, 

(c + d) A = cA + d \, 



c(A + B) = cA + cB 


are valid and easy to establish, but we shall not pause to discuss them in detail. 
It is worth noting, however, that a proof of the last property for the case c > 0 
is implicit in Fig. 17.30. Also, we agree that the factor c can be written on either 
side of the vector, cA = Ac; we will not employ this clumsy usage very often, 
but it is occasionally convenient. 

The vector ( - 1) • B is written -B; it is evidently a vector equal in length to 
B but having the opposite direction. Just as in elementary algebra, A + (-B) is 
written A - B. There is a simple geometric construction for A — B, resulting 
from the fact that A - B is what must be added to B to give A: When A and B 
are placed so that their tails coincide, then A — B is the vector from the head of 
B to the head of A (Fig. 17.31). 

Since the laws governing addition and scalar multiplication of vectors are iden¬ 
tical with those that we know from elementary algebra, we are justified in using 
the familiar rules of algebra to solve linear equations involving vectors. The fol¬ 
lowing examples illustrate the efficiency of these procedures for solving certain 
types of geometric problems. 




Figure 17.31 Subtraction. 


Example 4 In Fig. 17.32, the ratio of the segment AP to the segment AB is r, 
where 0 < t < 1 . Express the vector R in terms of A, B, and t. 

Solution The vector AB is B - A, and the vector AP is r(B - A). Since 
R = A + AP, we have 

R = A + r(B - A) = (1 - r)A + rB. 

In particular, if P is the midpoint of AB so that t = -j, then 

R = = j(A + B). 


A 



Example 5 Use vector methods to show that the three medians of any triangle 
intersect at a point which is two-thirds of the way from each vertex to the mid¬ 
point of the opposite side.* 

Solution Let A, B, C be the vertices of a triangle (Fig. 17.33), and let A, B, C 
be the vectors from an outside point O to these vertices. If M is the midpoint of 
BC, then OM = y(B + C),AM = OM - A = i(B + C) - A, and if Pis the point 
two-thirds of the way from A to M, then we have 

OP = A + §AM = A + |[y(B + C) - A] 

= yA + y(B + C) = y(A + B + C). (1) 

"Recall that a median of a triangle is a segment drawn from a vertex to the midpoint of the opposite 
side. 



Figure 17.33 
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Similarly, if N is the midpoint of AC and Q is two-thirds of the way from B to 
N, then we see that 

OQ = B + f&Y = B + f(CW - B) 

= B + M(A + C) - B] = {(A + B + C). (2) 

Comparison of (1) and (2) shows that the two points P and Q coincide, and in the 
same way we obtain the same point yet again if we go two-thirds of the way from 
C to the midpoint of AB. This completes the proof. We can also draw our con¬ 
clusion more elegantly by observing that since (1) is symmetric in the three vectors 
A, B, C, we clearly get the same point no matter which midpoint we start with. 



A vector of length 1 is called a unit lector. It is easy to see that if we divide 
any nonzero vector A by its own length, we obtain a unit vector A/|A| in the same 
direction. This simple fact is surprisingly useful. 

When we are working with vectors in the coordinate plane, it is often conve¬ 
nient to use the standard unit vectors i and j; as shown in Fig. 17.34, i points in 
the direction of the positive x-axis and j points in the direction of the positive 
y-axis. We have seen that any vector A in the xy-plane can be placed with its tail 
at the origin, and in this way becomes the position vector OP of the point P at 
its head. If P has coordinates a \ and a 2 , then the vectors a \i and a 2 j run from the 
origin to the points a\ and a 2 on the axes, and by the parallelogram rule we have 

\ = aii-^a 2 y (3) 

The number a\ in (3) is called the x-component or i -component of the vector A, 
and a 2 is called its y-component or ycomponent. These components are scalars, 
and should be distinguished from the vector components a \i and a 2 j. By the 
Pythagorean theorem, we clearly have 

| A | = V^i * i 2 3 + a 2 2 . 


Formula (3) tells us that every vector in the plane is a linear combination of 

i and j. The value of this formula is based on the fact that such linear combina¬ 
tions can be manipulated by the ordinary rules of algebra. Thus, if 

A = a\\ + and B = b\\ + by], 

then 

A + B = (fl]i + a 2 j) + (b\\ + b 2 j) 

= (a, + b\)i + (a 2 + b 2 ) j. 


Also, 


cA = c{a \i + a 2 j) = (caQi + (ca 2 ) j. 


Example 6 If A = 3i + 4j and B = 2i — 5j, find |A| and express 3A — 4B in 
terms of i and j. 

Clearly, |A| = V9 + 16 = 5 and 

3 A - 4B = 3(3i + 4j) - 4(2i - 5j) 

= i + 32j. 


Solution 
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PROBLEMS 

8 Find a unit vector which, if its tail is placed at the point 
(4, 4) on x 2 = 4y, is normal to the curve and points to¬ 
ward the positive y-axis. 

9 If A is a nonzero vector, we know that A/|A| is a unit 
vector with the same direction as A. Use this fact to write 
down a vector that bisects the angle between two nonzero 
vectors A and B whose tails coincide. 

10 Three vectors are drawn from the vertices of a triangle 
to the midpoints of the opposite sides. Show that the sum 
of these vectors is zero. 

11 Use vector methods to show that the diagonals of a par¬ 
allelogram bisect each other. 

12 Use vector methods to show that a line from a vertex of 
a parallelogram to the midpoint of a nonadjacent side tri¬ 
sects a diagonal. 

13 Solve the canoe problem in Example 3 when the current 
and canoe speeds are 2 and 2N/2 = 2.8 mi/h, respec¬ 
tively. 

14 If the velocity of the wind is v w and an airplane flies with 
velocity \ a relative to the air, then the velocity of the 
plane relative to the ground is 

Vg = v w + V fl . 

The vectors \ a and \ g are called the apparent velocity 
and the true velocity, respectively. 

(a) If the wind is blowing from the northeast at 60 mi/h 
and the pilot wishes to fly straight east at 600 mi/h, 
what should be the plane’s apparent velocity? 

(b) Repeat part (a) if the pilot wishes to fly southeast at 
600 mi/h. 


In Section 17.3 we became acquainted with the algebra of vectors. In the rest of 
this chapter we shall be interested in problems of motion, and this requires us 
to work with the calculus of vectors. When vectors and calculus are allowed 
to interact with each other, the result is a mathematical discipline of great 
power and efficiency for studying multidimensional problems of geometry and 
physics. This vector calculus—usually called vector analysis —is one of the 
major topics of advanced courses in calculus. In this chapter we can only intro¬ 
duce the subject and discuss a few of the classic applications, culminating in a 
treatment of Kepler’s laws of planetary motion and Newton’s law of universal 
gravitation. 

We begin by pointing out the connection between vectors and the parametric 
equations of curves discussed in the first two sections of this chapter. 

Suppose a point P = (jc, y) moves along a curve in the xy-plane, and sup¬ 
pose further that we know its position at any time t (Fig. 17.35). This means 
that the coordinates x and y are known as functions of the scalar variable t, 
so that 

x — x(t) and y — y(f). 


17.4 

DERIVATIVES OF 
VECTOR FUNCTIONS. 
VELOCITY AND 
ACCELERATION 



1 For each of the following pairs of vectors A and B, find 
|A|, 3A - 5B, and 6A + B: 

(a) A = i - 3j, B = — 2i + 5j; 

(b) A = -7i + 2j, B = 3i + 2j; 

(c) A = —6j, B = -2i + 3j; 

(d) A = 3i + 5j, B = 2i - 8j. 

2 For each of the following pairs of points P and Q , find 
the vector PQ in terms of i and j: 

(a) P = ( — 5, 0), Q = (1, 3); 

(b) P = (—\, —4), Q = (—2, 3); 

(c) P = (1, —5), Q = (6, 4); 

(d) F = (0, 2), Q= (3, -5). 

3 For any three points P, Q, R in the plane, we have 

PQ + QR + RP = 0. 

Why? Verify this for the special case P = (2, -4), Q = 
(—3, 5), R = (—4, 0) by expressing each vector in terms 
i and j and carrying out the addition. 

4 Find a vector in the same direction as 6i — 2j that has 
(a) three times its length; (b) half its length. 

5 For each of the following vectors A, find two unit vec¬ 
tors parallel to A: 

(a) A = 3i - 4j; 

(b) A = —5i + 12j; 

(c) A = 5i — 7j; 

(d) A = 24i-7j. 

6 Find a vector of length 3 which has (a) the same direc¬ 
tion as 5i — 2j; (b) the opposite direction to 4i + 5j. 

7 Find two vectors of length 26 and slope 
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These are parametric equations for the path in terms of the time parameter t* A 
more concise description of the motion is obtained by using the position vector 
of the moving point, 

R = OP = x(t)i + yWj- 

We can emphasize that R is a vector function of t by writing R = R(r). Thus the 
study of a pair of parametric equations is equivalent to the study of a single vec¬ 
tor function, and, as we shall see, the latter is often much more effective at re¬ 
vealing the essence of what is going on. 

Just as in ordinary calculus, R(r) is said to be continuous at t = r 0 if 

lim R(r) = R(r 0 ), (1) 

which means that |R(r) - R(r 0 )| can be made as small as we please by taking t 
sufficiently close to to. It follows easily from (1) that 

R(t) = x(t)\ + y(t)i (2) 


is continuous if and only if x(t) and y(t) are both continuous. 

We define the derivative of the vector function R (t) exactly as might be ex¬ 
pected. When t changes to f + Ar, the change in R is AR = R (t + A t) - R(r), 
and the derivative of R(r) with respect to t is defined as the limit 


d R 

~T — hm 
dt 0 


AR 

A t ' 


(3) 


That is, we divide the vector AR by A t and then find the limit of the new vector 
AR/A t as A t —> 0. This vector will approach a limit if and only if its head ap¬ 
proaches a limiting position, and this happens if and only if each of its compo¬ 
nents approaches a limit. It is clear that in terms of components we have 

AR = R(r + A/) - RQ) 

A t At 

_ x(t + At) - x(t) . y(t + At) - y(t) . 

At * At J 


Thus, if the definition (3) is applied to (2), we see at once that R(t) is differen¬ 
tiable if and only if ;c(r) and y(t) are, and in this case 


JR _ dx . dy . 
dt dt dt ^ 


(4) 


As in ordinary calculus, we often write R \t) for dR/dt and R ”(t) for d 2 R/dt 2 . 

Several familiar differentiation rules can now be extended to vector functions. 
One of the most important is this: If a vector function is multiplied by a scalar 
function, and if both can be differentiated, then their product can be differenti¬ 
ated according to the rule 


d , dR du 

— (mR) = u —— + R —. 
dt dt dt 


*In Section 17.1 we wrote the parametric equations of a curve as x = f{t), y = g(/). However, it is 
more direct and convenient to use the same letter for the function as for the dependent variable, as 
we do here. 
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This is proved in just the same way as the product rule for two scalar functions. 
Also, the rule for the sum of two vector functions is just what we expect, the de¬ 
rivative of a constant vector function is the vector 0, and the chain rule is valid. 

It is important to understand the meaning of the derivative dR/dt as a vector, 
and not merely in terms of its components as given by (4). To do this, we fol¬ 
low the geometric meaning of the various steps expressed in the definition (3). 
First, the change A t of the independent variable t carries the position vector from 
R(r) to R (t + Ar), as shown in Fig. 17.36. The vector AR = R(r + A t) — R(r) is 
directed along the chord from the head of R(r) to the head of R (t + Ar). Divid¬ 
ing AR by the scalar A t changes its length and produces another vector AR/A t 
parallel to AR. Since the limiting direction of the chord as Ar —» 0 is the direc¬ 
tion of the tangent, the derivative dR/dt is tangent to the path at the head of R. 
As we know, every vector can be thought of as having its tail at the origin, but 
in Fig. 17.36 we place the tail of dR/dt at the head of R in order better to visu¬ 
alize what is happening. 

To interpret the length of the vector dR/dt, let 5 be the length of the curve from 
a fixed point Pq given by t = to to the variable point P given by t, where t > fo 
By (4) we have 

^/R _ // z/jc \ 2 fdy\ 2 _ S/dx 2 + dy 2 _ ds_ 

dt ~\l(dt) \dtj~ dt ~~dt () 



Since t is time in the present discussion, the derivative ds/dt is the rate at which 
the moving point P traverses distance, that is, its speed. 

These observations tell us that the vector dRJdt has as its direction and length 
the direction and speed of our moving point. It is therefore natural to adopt the 
following formal definitions. Just as in the case of one-dimensional motion, we 
define the velocity v of a moving point as the rate of change of its position, 

dR 


and the speed v as the magnitude of the velocity, 


v 


d R 

dt 


Example 1 If R = (4 cos 2t)\ + (3 sin 2r)j, find the path of the moving point, 
the velocity v, and the points on the path where the speed v is greatest and least. 


Solution The curve has parametric equations x = 4 cos 2t, y = 3 sin 2 r, so the 
path is the ellipse shown in Fig. 17.37, 



The point P = (x, y) moves around this ellipse in the counterclockwise direction, 
as indicated by the arrows in the figure. The velocity is 

v = (—8 sin 2r)i + (6 cos 2r)j, (6) 

and the speed is 

V = |v| = (64 sin 2 It + 36 cos 2 2 t) m = (28 sin 2 2t + 36) l/2 . 



Figure 17.37 
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It is clear from this formula for v that the smallest speed is 6, and this occurs 
when sin It = 0, and by the given formula for R, this happens when P is at ei¬ 
ther end of the major axis. The greatest speed is 8, and this occurs when sin It = 
1, so that cos It = 0, that is, at either end of the minor axis. 


Just as the velocity v of our moving point is the rate of change of its position, 
the acceleration a is the rate of change of its velocity, 

= dx d 2 R 

3 dt dt 2 ' 


Thus, our present concepts of velocity and acceleration are direct extensions of 
more limited versions of these concepts from our earlier studies of one-dimen¬ 
sional motion. 

If the moving point P is the location of a moving physical object, and can 
therefore be thought of as a particle of mass m moving under the action of an 
applied force F, then Newton's second law of motion states that 

F = ma. (7) 



This vector form of Newton’s law shows that the force and acceleration vectors 
both have the same direction. Since we visualize the force F as being applied to 
the particle, so that its tail is at P, it is customary also to place the tail of a at P, 
as shown in Fig. 17.38. Both F and a usually point toward the concave side of 
the curve, but in exceptional cases they may be tangent to the curve. 

Example 1 (continued) To find the acceleration a of the motion given by R = 
(4 cos 2t)\ + (3 sin 2r)j, we have only to differentiate the velocity (6) with re¬ 
spect to t, 

dx 

a = —— = (— 16 cos 2r)i + (-12 sin 2t)\. 
dt 

Since this can be written in the form 

a = -4[(4 cos 2t)\ + (3 sin 2r)j] 

= -4R, 


the acceleration vector is always directed toward the center of the elliptical path. 



A simple but important situation is that in which a particle travels at constant 
speed around a circular path. 

Example 2 Uniform circular motion. A particle of mass m moves counter¬ 
clockwise around the circle x 2 + y 2 = r 2 with constant speed u Find the accel¬ 
eration of the particle and the force needed to produce this motion. 

Solution By using the notation in Fig. 17.39, the path can be written as 

R = (r cos 6 )i + (r sin 0) j, (8) 

with 6 as the parameter. Since s = rQ, we have 

v _ ds_ _ r d0^ 


Figure 17.39 
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and therefore dOldt = vlr . This enables us to find the velocity and acceleration 
from (8) by using the chain rule: 

dR _ d R dd 
V “ dt ~ de ' dt 

= [(— r sin 0)i + (r cos 0)j] • — 

r 

= ^ [(—sin 0)i + (cos 0)j]; 
and 

_ dv_ _ dv_ d0_ 
dt dd dt 

= v [(-cos 0)i + (—sin 0)j] ■ y 
ip- 

=- [(cos 0)i + (sin 0)j]. 

r 

By multiplying and dividing by r, it is easy to see that a = ~(v 2 /r 2 ) R. This tells 
us that the acceleration vector a points in toward the center of the circle and has 
magnitude 


By Newton’s law (7), the force F needed to produce this motion must point to¬ 
ward the center of the circle and have constant magnitude mv 2 /r. Such a force is 
called a centripetal force. 


It is obvious by now that the time t is a parameter of fundamental importance 
for studying the motion of a point P along a curved path. Another important pa¬ 
rameter is the arc length s, measured along the curve from a fixed point P 0 t° P* 
as shown in Fig. 17.40. We now consider R as a function of 5 and examine the 
meaning of the derivative dR/ds. If P moves along the curve to Q when 5 changes 
to 5 + As, then 

AR _ PQ 
A s A s 

is a vector in the direction of the chord from P to Q whose length is 

PQ _ chord 
As arc 

When As —> 0, the direction of the chord approaches the direction of the tangent 
and the ratio of the chord to the arc approaches 1. Therefore the vector T, which 
is defined by 

T = = lim 4^-’ 

ds As—>o As 



is a vector of unit length which is tangent to the curve at P and points in the di¬ 
rection of increasing s. T is called the unit tangent vector. 

To clarify our first use of T, we recall that the formulas for the velocity v and 
acceleration a in terms of their components are 
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dx dy . , 

V= * ,+ * J and 


d 1 2 x . d 2 y . 

-d?' + l? y 


(9) 


These formulas are convenient for calculation, but they don’t contribute much to 
our intuitive understanding of the nature of the vectors v and a. However, the 
chain rule enables us to write the velocity v in the form 


dR _ dR ds _ ^ ds 
dt ds dt dt 


( 10 ) 


For the purpose of conveying insight, this formula is much superior to the first 
of formulas (9), because in (10) the direction of v is given by T and its magni¬ 
tude is given by ds/dt, and the meaning of each is visible at a glance. Our main 
aim in the next two sections is to obtain a corresponding formula for the accel¬ 
eration a. 


Remark In most of our work we restrict ourselves to parametrized curves R = 
R(r) that are smooth , in the sense that the derivative R'(r) is continuous and 
nonzero at every point. In principle, the continuity of R'(f) enables us to find s 
as a function of t from formula (5), 



And since 


^ = |R'Wl > 0, 

the function s = s(t) is strictly increasing and therefore has an inverse function 
t = t(s). This permits us to introduce s as a parameter for the curve, 

R = R(r) = R [t(s)l 

However, in most cases it is difficult or impossible to carry out these calcula¬ 
tions. The integral for s may be hard to evaluate; and even if s = s(t) is known 
explicitly, it may be hard to find the inverse function t = t(s). Fortunately these 
difficulties are not a serious obstacle, because there is seldom any real need to 
have R expressed explicitly as a function of s. It is the idea of using arc length 
as a parameter that is important for understanding motion along curves—as in 
the preceding paragraph—and not the actual act of doing so in specific prob¬ 
lems. 


PROBLEMS 


1 Give a geometric description of the locus of the head of 
R if R = A + fB, where neither A nor B is 0 and B is 
not parallel to A. Draw a sketch. 

2 What is the locus of the head of R if R = at\ + b(\ - 
r)j, where a and b are nonzero constants? 

3 Show that the locus of the head of R = ri + ( mt + b) j is 
the line y = mx + b. 

4 What is the locus of the head of R = (r + 1 )i + (r 2 + 
2 t + 3)j? 


In Problems 5-9, R is the position of a moving point at time 
t. In each case compute the velocity, acceleration, and speed. 

5 R = (r 2 + l)i + (f — l)j. 

6 R = t 2 \ + r 3 4 j. 

7 R = ri + (/ 3 - 30 j- 

8 R = (cos 2r)i + (sin r)j. 

9 R = (tan t) i + (sec f)j. 

10 If the position vector of a moving particle is R = 
(i a cos kt) i + (b sin kt) j, where a, b, k are positive con- 
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stants, then the particle moves on the ellipse x 2 ta 2 + 
y 2 fb 2 = 1. Show that a = -£ 2 R and describe the force 
F that produces such a motion. 

11 If the acceleration of a moving particle is a = aj, where 
a is a constant, find R by two successive integrations 


with respect to t and show that the path is a parabola, a 
straight line, or a single point. 

12 If a moving particle is acted on by no force, so that a = 
0, show that the particle moves with constant speed along 
a straight line. This is Newton's first law of motion. 


In Section 17.4 we expressed the velocity v of our moving point P in terms of 
the unit tangent vector T shown in Fig. 17.41, where T was obtained as the de¬ 
rivative of the position vector R with respect to arc length s, 


T = 


dR 

ds 


17.5 

CURVATURE AND THE 
UNIT NORMAL VECTOR 


As our first step toward the general acceleration formula derived in Section 17.6, 
we must now analyze the derivative of T with respect to s, and this requires us 
to examine the purely geometric concept of the “curvature” of a curve. 

If we consult our intuitive feelings about the notion of curvature, most of us 
will agree that a straight line does not curve at all; that is, it has zero curvature. 
Also, a circle has the same curvature at every point, and a small circle has greater 
curvature than a large one, as suggested in Fig. 17.42. In the case of a nonuni¬ 
form curve like the one on the right, the curvature ought to be smaller where the 
curve is relatively straight and larger where the curve bends more sharply. 

These opinions are based on the idea that curvature at a point ought to mea¬ 
sure how rapidly the direction of a curve is changing at that point with respect 
to distance along the curve. Since direction is specified by the angle </> from the 
x-axis to the tangent line (Fig. 17.41), we consider this angle as a function of the 
arc length s and define the curimture k to be the rate of change of </> with respect 
to s, 



k 


d$ 
ds * 


( 1 ) 


The curvature can be either positive or negative, and it may be zero in certain 
cases. Since k > 0 means that </> is increasing as s increases, it is clear that this 
means that the curve turns away to the left of the tangent as we move along the 
curve in the positive direction. Similarly, k < 0 means the curve turns away to 
the right of the tangent. 

It is obvious from the definition (1) that the curvature of a straight line is zero, 
since (J> does not change as we move along the line. In the case of a circle of ra¬ 
dius a (Fig. 17.43), we have 





Figure 17.42 The meaning of 
curvature. 




612 


PARAMETRIC EQUATIONS. VECTORS IN THE PLANE 



<t>=0 + f. 


and by using the fact that 0 = s/a , we easily see that the curvature is 

k d<j) = dO = 1 
ds ds a 


(2) 


Since the curvature of a circle is clearly constant, we can also obtain the result 
(2) by observing that a complete revolution amounts to a change in direction of 
27 t radians over a curve of length lira, so 

£ — _ 277 _ 1 

ds ILttci a 



We note in passing that formula (2) shows that smaller circles have larger cur¬ 
vatures, as indicated in Fig. 17.42. 

Apart from these very simple cases, the actual calculation of the curvature is 
carried out by various rather complicated formulas, depending on how the curve 
is defined 

The simplest situation is that in which the curve is the graph of a function y = 
f(x). Since tan </> = dy/dx, we have 


(f> = tan 


-i 


and 


d<j> = 


d 2 x/dx 2 


dx. 


dx 1 + {dy/dx) 2 

Also, the expression ds = \/ dx 2 + dy 2 for the differential of arc length gives 


ds = 



1 + 



(3) 


On dividing dcf) by ds, we see that in this case the curvature is given by the formula 


dcf) _ d 2 y/dx 2 
ds [1 + {dy/dx) 2 ] V2 ' 


(4) 


In Section 4.2 we used the sign of the second derivative d 2 y/dx 2 to find out which 
direction the curve is bending, concave up or concave down. Formula (4) gives 
us this information and much more—it tells us precisely how much the curve is 
bending. 


Example 1 Show that the curvature of the parabola y = x 2 is greatest at the 
vertex. 


Solution We are familiar enough with the general shape of parabolas (Fig. 
17.44) to accept this statement without difficulty, because the curve visibly flat¬ 
tens out as |jc| —» To verify it by calculation, we use the fact that dy/dx = 2x 
and d 2 y/dx 2 = 2 to write 

_ d 2 y/dx 2 _ 2 

k = [1 + (dy/dx) 2 ] 3n = (1 + 4jc 2 ) 3/2 ' 

It is clear that this quantity has its greatest value when x = 0, which is at the ver¬ 
tex, and also that A: —> 0 as |jc| —> *». To illustrate how quickly the curve flattens 
out as we move away from the vertex, we notice that at the point (2, 4)—which 
is fairly close to the vertex—we have 

2 2 _ 1 
(1 + 16) 3/2 16 3/2 ~ 32’ 


Figure 17.44 
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Thus, at this point the parabola is flatter than a circle of radius 32, which is quite 
surprising. 


If a curve is defined by parametric equations jc = x(t) and y = y(t), then 
its curvature is computed from a slightly different formula. This time we start 
with 


cf) = tan 1 


dy/dt 

dx/dt 


(5) 


and 



( 6 ) 


The calculations leading to the curvature formula are a bit more complicated be¬ 
cause of the quotient in (5), and the result is 

_ d(f> _ Q dx/dt)(d 2 y/dt 2 ) — ( dy/dt)(d 2 x/dt 2 ) 
ds [{dx/dt) 2 + (dy/dt) 2 ] 312 

K *') 2 + (/) 2 ] 3/2 ‘ u> 


Students will notice that (7) includes (4) as a special case, when a curve y = f(x) 
is thought of as a parametric curve x = x, y = f(x), with x replacing t as the pa¬ 
rameter. 

There is a slight difficulty with signs that should be mentioned. By choosing 
the positive square root in both (3) and (6), we are assuming that the direction 
of increasing arc length 5 is the same as the direction in which the parameter in¬ 
creases. If this is not the case in applying (4) or (7) to a specific problem, then 
it is necessary to change the sign to get the actual curvature. 


Example 2 Show that a circle of radius a has curvature 1 la by using the para¬ 
metric equations x = a cos 0, y = a sin 6. 


Solution We apply formula (7) with the understanding that primes denote de¬ 
rivatives with respect to 6 . First we calculate 

x' = -a sin 6, y — a cos 0 , 

x" = —a cos 6 , y” = —a sin 6 . 

Formula (7) now gives 

_ a 2 sin 2 6 + a 2 cos 2 6 _ 1 

(a 2 sin 2 6 + a 2 cos 2 0) 3/2 a ’ 


as expected. 


Now that we understand the concept of curvature, we are ready to deal quickly 
with the main problem of this section, which is to analyze the derivative of the 
unit tangent vector T with respect to 5. 

We begin by observing that in terms of the slope angle 0 (Fig. 17.45) we have 

T = i cos 4> + j sin </>, 
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SO 


—— = -i sin (f) + j cos </>. 
d(p 


The derivative (8) is clearly a unit vector, because its length is 


d T 

clef) 


= Vsin 2 cj) + cos 2 <f> = 1. 


Also, it is perpendicular to T, because its slope is 

COS (ft _ 1 

-sin cf) tan </>’ 


(8) 


which is the negative reciprocal of the slope of T. In fact, the derivative (8) is 
the unit normal vector N shown in the figure, 



(9) 


where N is obtained by rotating T through an angle 7 t/ 2 in the counterclockwise 
direction. This is established by comparing (8) with 

N = i cos ^<f> + + j sin = — i sin </> + j cos </>. 

By using the chain rule together with (1) and (9), we now easily obtain the main 
result of this section, 

dT_ _ dJ_ d<f_ _ 
ds d4> ds 


It should be clear at this stage why it was necessary to discuss curvature in 
analyzing the meaning of dT/ds : Since T has constant length, only its direction 
changes as s varies, and this is what brings us to the curvature. We also point out 
that regardless of whether k is positive or negative, N k always points toward the 
concave side of the curve. 



Figure 17.46 The circle of curvature. 


Remark Let P be a point on a curve at which the curvature k is not zero, and 
draw the normal toward the concave side of the curve, as shown in Fig. 17.46. 
Every circle through P whose center lies on this normal will be tangent to the 
curve at P. That particular circle whose curvature is equal to |/:| is called the cir¬ 
cle of curvature . Also, the center C of this circle is called the center of curva¬ 
ture, and its radius r is called the radius of curvature. We know from (2) that in 
the case of a circle, the radius is the reciprocal of the curvature, so the radius of 
curvature is given by the formula 

1 = [1 + (dy/dx) 2 ] 3 ' 2 
r | A: | \d 2 y/dx 2 \ 

if the curve is the graph of a function y = /(;t). A similar formula holds for a 
parametric curve. As P moves along the given curve, the locus of the corre¬ 
sponding center of curvature C is called the evolute of the given curve.* 


*Some remarkable applications of the theory of evolutes to cycloids are given in Section B.23 of the 
author’s Calculus Gems (McGraw-Hill, 1992). 
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PROBLEMS 


1 Find the curvature of the given curve as a function of 
jc or t: 

(a) y = Vx; 

(b) y = In sec x ; 

(c) y = x + —; 

Jt 

(d) x = e r sin f, y = e r cos V, 

(e) x = t 2 , y = In t. 

2 Find the radius of curvature of the given curve at the 
given point: 

(a) x = r 2 , y = t 3 at t = 2; 

(b) x = e', y = e~ l at t = 0; 

(c) y = — at (1, 1); 

(d) jc = tan t, y = cot t at / = tt/4. 

3 In each case find the largest value (if any) of the cur¬ 
vature: 

(a) y = sin x; 

(b) y = yx 3 ; 

(c) y = In x. 

4 Carry out the details of establishing the parametric cur¬ 
vature formula (7). 

5 Find the curvature of the circle x 2 + y 2 = a 2 by apply¬ 
ing formula (4) separately to y = \ a 2 - x 2 and y = 
— Va 2 - x 2 . What difficulty arises, and how can it be 
fixed? 

6 For the curve y = e x , find the radius of curvature and 
the equation of the circle of curvature at the point (0, 
1). Sketch the curve and this circle. Use the equation of 
the circle to calculate the values of dy/dx and d 2 y/dx 2 
at the point (0, 1), and verify that these derivatives have 
the same values there as the corresponding derivatives 
of y = e x . 

7 At what point on the curve y = e x is the radius of cur¬ 
vature smallest? What is this smallest radius? 

8 We know that if y = f(x) is a straight line, then k = 0. 
Show, conversely, that if k = 0, then y =f(x) is a 
straight line. 


9 Find the largest value of the radius of curvature on the 
first quadrant part of the hypocycloid of four cusps x = 
a cos 3 6, y = a sin 3 6. Where does the radius have this 
largest value? 

10 By Problem 15 in Section 17.1, the equations 

x = cos 6+6 sin 6 , 
y = sin 6 - 6 cos 6 

represent the involute of a circle of radius 1. Find the 
curvature at any point. 

11 Find the radius of curvature of the cycloid 

x = a(6 — sin 6), 
y = a{\ — cos 6) 

at any point. 

12 (a) Sketch the ellipse x = a cos 6, y = b sin 6 , where 

0 < b < a, and find its curvature k at an arbitrary 
point. 

(b) Without calculating dk/d6 , use the formula in (a) to 
show that k has its largest values at the ends of the 
major axis and its smallest values at the ends of the 
minor axis. Show that these values are alb 2 and bla 2 , 
respectively. Notice that if b = a, then we have a 
circle, and both of these formulas give k = \/a, as 
they should. 1 " 

*13 Let a be a positive number and consider the curve y = 
x a for x > 0. Show that the curvature approaches a fi¬ 
nite limit as x —» 0 if a < \ or a = 1 or a > 2, and only 
in these cases. 


^or curves in general, a point where the curvature has a maximum 
or minimum value is called a vertex. By problem 12, an ellipse has 
four vertices. An ellipse is a special case of an oval, which is a con¬ 
vex closed curve whose parametric equations x = x(r), y = y(t) have 
continuous second derivatives. The famous four vertex theorem of dif¬ 
ferential geometry states that every oval has at least four vertices. 


Consider a moving particle whose position at time t is given by the parametric 
equations x = x(t) and y = y(t). The position vector of this particle is R = xi + 
yj, and its velocity and acceleration are 


dR dx . dy . d\ d 2 x . d 2 y . 

v = —— = —1 + -r- j, a = —— = —pr -1 + —1 
dt dt dt J dt dt 2 dt 2 


( 1 ) 


Unfortunately, the i- and j-components of these vectors have no physical mean¬ 
ing, because they depend on the coordinate system, and the choice of the coor¬ 
dinate system is arbitrary; it is not determined by the intrinsic nature of the mo¬ 
tion itself. However, we saw in Section 17.4 that the velocity can also be written 
as 
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Figure 17.47 



or 


rp dS 

v = T —, 
dt 


( 2 ) 


where T is the unit tangent vector (Fig. 17.47). This expression for the velocity 
does have physical meaning, because T gives the direction of the motion and 
dsldt gives its magnitude, the speed. 

To obtain a similar revealing expression for the acceleration, we differentiate 
(2) with respect to t, 


d\ _ d 2 s ds_ dT 
dt dt 2 dt dt ’ 


(3) 


By Section 17.5 we know that 


dl _ dJ_ ds_ _ dT dcf) ds 
dt ds dt d(f) ds dt 



(4) 



where k is the curvature and N is the unit normal vector shown in Fig. 17.48. 
When (4) is substituted in (3) we get our fundamental result, 


a 



+ m 



(5) 


This is an important equation in mechanics. The vectors T and N serve as refer¬ 
ence unit vectors much like i and j. They enable us to resolve the acceleration 
into two “natural” components, in the direction of the motion and normal to this 
direction, in contrast to the arbitrary components given by the second of equa¬ 
tions (1). The tangential component, d 2 s!dt 2 , is simply the derivative of the speed 
v = dsldt of the particle along its path. The normal component, k(ds/dt) 2 = kv 2 , 
has magnitude 

I k\v 2 = d (6) 

r 


where r is the radius of curvature. It is clear from (5) that when k =£ 0 and the 
particle is actually moving, the acceleration is always directed toward the con¬ 
cave side of the curve. 
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Of course, the great importance of the acceleration lies in the fact that when 
a particle of mass m is acted on by a force F, it moves in accordance with New¬ 
ton’s second law of motion F = ma. The vectors F and a therefore have the same 
direction, as shown in the figure, and this fits with our intuitive understanding 
that when a force changes the direction of a moving particle, it pulls the parti¬ 
cle away from the direction of the tangent toward the concave side of the path. 

Since the curvature k is available whenever the curve is given in parametric 
form, the tangential and normal components of acceleration can be calculated 
from (5). However, it is often more efficient to use the following procedure. The 
acceleration vector is the same whether it is expressed in terms of i- and j -com¬ 
ponents or tangential and normal components, so 

a = a x i + a v j = a t T + a n N, 

where the a's have the obvious meanings. This tells us that 

|a| 2 = a x 2 + a,, 2 = a xt 2 + a n 2 , 


so 

a n = V|a | 2 - a t 2 

where a t = d 2 s/dt 2 . 


(7) 


Example 1 If a particle moves along the curve whose parametric equations are 
jc = cos t + t sin t, y = sin t — t cos t, 
find the tangential and normal components of acceleration. 


Solution The position vector is 

R = (cos t + t sin r)i + (sin t — t cos r)j. 


so 


= 

dt 

= (—sin t + t cos t + sin r)i + (cos t + t sin t — cos r)j 
= (r cos t )i + (r sin t )i 


and 


a = — = (— t sin t + cos t) i + ( t cos t + sin r)j. 

The speed v — ds/dt is given by 

^ = |v| = V(f cos t ) 2 + (t sin t ) 2 = t, 

so the tangential component of acceleration is 

d 2 s d 

ai = lF = T, , = L 

The normal component of acceleration can be computed directly, by finding k 
and using a n = k(ds/dt) 2 . However, it is easier to use (7), which gives 
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a„ = V|a | 2 - a, 2 

= V( — t sin r + cos /) 2 + (/ cos r + sin 0 2 — 1 = >■ 


m 



Example 2 If a particle of mass m moves around a circular path of radius r with 
constant speed v = ds/dt , then d 2 s/dt 2 is zero. By equations (5) and (6), the ac¬ 
celeration is directed toward the center of the circle and has magnitude v 2 /r . Fur¬ 
ther, the centripetal force acting on the particle has magnitude 


F i 


mv 2 

r 


( 8 ) 


Thus, for an automobile going around a given unbanked curve, it takes four times 
as much normal force between the tires and the road to “hold the road” at 60 
mi/h as at 30 mi/h, and the required force is doubled again if the radius is halved. 
These are the results about uniform circular motion that we obtained in Exam¬ 
ple 2 of Section 17.4. 

Now suppose that our particle is an artificial satellite in a circular orbit around 
the earth, as shown in Fig. 17.49. If M is the mass of the earth, then Newton’s 
law of gravitation tells us that the force of attraction which the earth exerts on 
the satellite has magnitude 


where r is the distance from the satellite to the center of the earth and G is a con¬ 
stant of proportionality called the constant of gravitation. We know that the weight 
of the satellite is the force which gravity exerts on it at the surface of the earth, 
and this is mg. Therefore, if R denotes the radius of the earth, then Fi = mg when 
r = R, so (9) tells us that 

mg = G or GM = gR 2 . 

A 


This enables us to write (9) in the more convenient form 


F 2 = 



( 10 ) 


For a satellite in stable circular orbit, the centripetal force is precisely equal to 
the gravitational force, so F\ = Fi and 


This formula gives the speed at which a satellite must move in order to maintain 
a circular orbit at a specified distance r from the center of the earth. 

We make two observations about formula (11). First, if our satellite is mov¬ 
ing in a circular orbit at a relatively low altitude above the surface of the earth, 
then r = R and v = Vg/?. This orbital speed is approximately 5 mi/s, which 
should be compared with the escape speed of V2 gR or 7 mi/s that we calculated 
in Example 3 of Section 5.5. 

Second, we consider a communications relay satellite that is placed in a cir¬ 
cular orbit around the earth and has a period of revolution of T = 24 hours. This 
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is a so-called synchronous orbit, in which the satellite moves with the turning 
earth and appears to hang motionless in the sky. If r is the radius of this orbit, 
then the orbital speed is v = lirrlT, and when this is substituted in (11) we find 
that 

r 3 = i^H 

47 T 2 

With suitable adjustments applied to the values g = 32 ft/s 2 , R = 4000 mi, and 
T = 24 h, we easily find that r is approximately 26,000 mi, which means that 
the satellite must be about 22,000 mi above the surface of the earth. Such satel¬ 
lites were first conceived in 1945 by the famous science fiction writer Arthur C. 
Clarke, and are the crucial links in our present-day worldwide television com¬ 
munications. 

Of course, the ideas discussed in this example assume a circular path, which 
is approximately true for some satellites. We shall give a detailed treatment of 
elliptical orbits in Section 17.7. 


PROBLEMS 

1 In Example 1, find a n the other way, by first calculating 
k and then using a n = k(ds/dt) 2 . 

In Problems 2-7, find the velocity and acceleration vectors, 
then find the speed and the tangential and normal components 
of the acceleration. 

2 R = (2 1 - 5)i + (r 2 + 3)j. 

3 R — a cos cot i + a sin cot j, where a and co are positive 
constants. 

4 R = cos t 2 i + sin t 2 j. 

5 R = e l cos t i + e* sin t j. 

6 R = t cos t i + t sin t j. 

7 R = 2 In it 2 + 1 )i + (2t - 4 tan -1 r)j. 

In Problems 8 and 9, find the normal component of the ac¬ 
celeration for the given values of t. 

8 R = a cos t i + b sin t j; t = 0, 7t/2. 

9 R = t i + sin t j; t = 0, tt/2. 


10 Deduce from equation (5) that 

(a) the path of a moving particle will be a straight line 
if the normal component of acceleration is zero; 

(b) if the speed of a moving particle is constant, then the 
force is always directed along the normal; 

(c) if the force acting on a moving particle is always di¬ 
rected along the normal, then the speed is constant. 

11 A road has the shape of the parabola 120y = x 2 . A truck 
is loaded in such a way that it will tip over if the normal 
component of its acceleration exceeds 30. What speeds 
will guarantee disaster for the truck as it swings around 
the vertex of the parabola? 

12 If a particle moves along a path whose curvature k is 
never zero, how must the speed be adjusted if the nor¬ 
mal component of the acceleration is to be held at a con¬ 
stant magnitude? 


As we know, Isaac Newton conceived the basic ideas of calculus in the years 
1665 and 1666 (at age 22 and 23) for the purpose of helping him to understand 
the movements of the planets against the background of the fixed stars. In order 
to appreciate what was involved in this achievement, we briefly recall the main 
stages in the development of astronomical thinking up to his time. 

The ancient Greeks constructed an elaborate mathematical model to account 
for the complicated movements of the sun, moon, and planets as viewed from 
the earth. A combination of uniform circular motions was used to describe the 
motion of each body about the earth. It was very natural for them—as it is for 
all people—to adopt the geocentric point of view that the earth is fixed at the 
center of the universe and everything else moves around it. Also, they were in- 
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fluenced by the semimystical Pythagorean belief that nothing but motion at con¬ 
stant speed in a perfect circle is worthy of a celestial body. 

In this Greek model, each planet P moves uniformly around a small circle 
(called an epicycle) with center C, and at the same time C moves uniformly 
around a larger circle centered at the earth E, as shown in Fig. 17.50. The radius 
of each circle and the angular speeds of P and C around the centers C and E are 
chosen to match the observed motion of the planet as closely as possible. This 
theory of epicycles was given its definitive form in Ptolemy’s massive treatise 
Almagest in the second century A.D., and the theory itself is called the Ptole¬ 
maic system . 

The next great step forward was taken by the Polish astronomer Copernicus. 
Shortly before his death in 1543, when he was presumably almost beyond the 
reach of a wrathful Church, he at last allowed the publication of his heretical 
book. On the Revolution of the Celestial Spheres. This work changed the Ptole¬ 
maic point of view by placing the sun, instead of the earth, at the center of each 
primary circle. Nevertheless, this heliocentric system was of much greater cul¬ 
tural than scientific importance. It enlarged the consciousness of many educated 
Europeans by giving them a better understanding of their place in the scheme of 
things, but it also kept the clumsy machinery of Ptolemy’s circles whose centers 
move around on other circles. 

It was Johannes Kepler (1571-1630) who finally eliminated this jumble of cir¬ 
cles. Kepler was the assistant of the wealthy Danish astronomer Tycho Brahe, 
and when Brahe died in 1601, Kepler inherited the great masses of raw data they 
had accumulated on the positions of the planets at various times. Kepler worked 
incessantly on this material for 20 years, and at last succeeded in distilling from 
it his three beautifully simple laws of planetary motion, which were the climax 
of thousands of years of purely observational astronomy: 

1 The orbit of each planet is an ellipse with the sun at one focus. 

2 The line segment joining a planet to the sun sweeps out equal areas in equal 
times. See Fig. 17.51. 

3 The square of the period of revolution of a planet is proportional to the cube 
of the semimajor axis of the planet’s elliptical orbit. That is, if T is the time 
required for a planet to make one complete revolution about the sun and a is 
the semimajor axis shown in the figure, then the ratio T 1 2 /« 3 is the same for 
all planets in the solar system. 



Figure 17.51 Kepler’s second law. 
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From Kepler’s point of view, these were empirical statements that fitted the data, 
and he had no idea of why they might be true or how they might be related to 
one another. In short, there was no theory to provide a context within which they 
could be understood. 

Newton created such a theory. In the 1660s he discovered how to derive the 
inverse square law from Kepler’s laws by mathematical reasoning, and also how 
to derive Kepler’s laws from the inverse square law. We recall that Newton’s in¬ 
verse square law of universal gravitation states that any two particles of matter 
in the universe attract each other with a force directed along the line between 
them and of magnitude 



where M and m are the masses of the particles, r is the distance between them, 
and G is a constant of nature called the gravitational constant. With this simple, 
clean, clear law as the unifying principle of his thinking, Newton published his 
theory of gravitation in 1687 in his Principia Mathematica. In this one book— 
perhaps the greatest of all scientific treatises—his success in using mathemati¬ 
cal methods to explain the most diverse natural phenomena was so profound and 
far-reaching that he essentially created the sciences of physics and astronomy 
where only a handful of disconnected observations and simple inferences had ex¬ 
isted before. These achievements launched the modem age of science and tech¬ 
nology and radically altered the direction of human history. 

We now derive Kepler’s laws of planetary motion from Newton’s law of grav¬ 
itation, and to this end we discuss the motion of a small particle of mass m (a 
planet) under the attraction of a fixed large particle of mass M (the sun). 

For problems involving a moving particle in which the force acting on it is al¬ 
ways directed along the line from the particle to a fixed point, it is usually sim¬ 
plest to resolve the velocity, acceleration, and force into components along and 
perpendicular to this line. We therefore place the fixed particle M at the origin 
of a polar coordinate system (Fig. 17.52) and express the position vector of the 
moving particle m in the form 

R = m„ (2) 

where u r is the unit vector in the direction of R. It is clear that 

u r = i cos 6 + j sin 9 , (3) 



Figure 17.52 
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and also that the corresponding unit vector Up, perpendicular to u r in the direc¬ 
tion of increasing 0 , is given by 

I!# = -i sin 6 + j cos 6. (4) 


It is easy to see by componentwise differentiation that 


du r 

~d0 


= Ufl 


and 


due 

d6 


(5) 


Thus, differentiating u r and with respect to 0 has the effect of rotating these 
vectors 90° in the counterclockwise direction. We shall need the derivatives of 
u r and it# with respect to the time t. By means of the chain rule we at once ob¬ 
tain the formulas 


du r _ du r dO _ d0_ dup _ du e d6 _ _ d6_ 

~dt~ ~~dd ~dt ~ Ue ~dt an dt ~ de dt ~ ~ Ur dt’ 


(6) 


which are essential for computing the velocity and acceleration vectors v and a. 
Direct calculation from (2) now yields 


v 


JR du r dr 
dt r dt + U/ ‘ dt 


d6 dr 

r ^ u ° + i; Ur 


(7) 



d\ dr d6 d 2 6 d6 dup d 2 r dr du r 

dt dt dt dt 2 dt dt dt 2 dt dt 


and by keeping formulas (6) in mind and rearranging, the latter equation can be 
written in the form 


d 2 0 

dt 2 


dr^dOy 
dt dt , 


\ 

d 2 r , 

(doy 1 

1 U» + 

_dt 2 



U r 


If the force F acting on m is written as 

F = FpUp + F r u r 


( 8 ) 

(9) 


then, from (8) and (9) and Newton’s second law of motion ma = F, we get 


d 2 0 dr d0\ 

r i? +2 ^^ =F ° 


and 


\d 2 r , 

(dev 

dt 2 1 

(*J. 


= F, 


( 10 ) 


These differential equations govern the motion of the particle m and are called 
the equations of motion; they are valid regardless of the nature of the force F. 
Our next task is to extract the desired conclusions from these equations by mak¬ 
ing suitable assumptions about the direction and magnitude of F. 


CENTRAL FORCES AND KEPLER’S SECOND LAW 


F is called a central force if it has no component perpendicular to R, that is, if 
Fff = 0. Under this assumption the first of equations (10) becomes 


d 2 6 dr dO 


= 0 . 


On multiplying through by r, we obtain 

, d 2 0 
K dt 2 


„ dr dO „ 
+ 2r-- = 0 
dt dt 
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or 


so 



(ID 


for some constant h. We shall assume that h is positive, or equivalently that d6/dt 
is positive, which evidently means that m is moving around the origin in a coun¬ 
terclockwise direction. 

If A = A(t) is the area swept out by R from some fixed position of reference 
so that dA = \r 2 dQ, then (11) implies that 



dt = \h dt. 


On integrating this from t\ to ti , we get 


A(t2) ~ A(/i) — \h(t2 ~ t\). 


( 12 ) 


This yields Kepler’s second law: The line segment joining the sun to a planet 
sweeps out equal areas in equal intervals of time. 


CENTRAL GRAVITATIONAL FORCES AND KEPLER’S FIRST LAW 


We now specialize even further, and assume that F is a central attractive force 
whose magnitude is given by the inverse square law (1), so that 


F r = 



(13) 


If we write (13) in the slightly simpler form 



where k = GM, then the second of equations (10) becomes 

d 2 r _ I d6\ 2 _ k 
dt 2 r \dt) r 2 ' 


(14) 


The next step in this line of thought is difficult to motivate, because it involves 
considerable technical ingenuity, but we will try. Our purpose is to use the dif¬ 
ferential equation (14) to obtain the equation of the orbit in the polar form r = 
f(d ), so we want to eliminate t from (14) and consider 6 as the independent vari¬ 
able. Also, we want r to be the dependent variable, but if (11) is used to put (14) 
in the form 



d 2 r h 2 _ _k L 
dt 2 r 3 “ r 2 


(15) 


then the presence of powers of Mr suggests that it might be temporarily conve¬ 
nient to introduce a new dependent variable z = Mr. 

To accomplish these various aims, we must first express d 2 rldt 2 in terms of 
d 2 z/d6 2 , by calculating 
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and 


dr 

d ( l ) 

1 dz 

1 dz 

dO 

dt 

dt \Z 

z 2 dt 


dt 


1 dz h _ dz 
z 2 dO r 2 1 dO 


d 2 r 

It 2 


, d 
-h — 
dt 


d 2 z h 
dO 2 r 2 




-h 2 z 2 


dh 

dO 2 ' 


d0_ 

dt 


When the latter expression is inserted in (15), and r is replaced by 1 /z, we get 

-h 2 z 2 ~ h 2 z 3 = ~kz 2 


or 


d 2 z _ k 

de 2 z h 2 ' 


(16) 


To solve this equation, we observe that, except for the constant term on the 
right, it is the differential equation of simple harmonic motion discussed in Sec¬ 
tion 9.6. To eliminate the constant term, we put 

k 


so that d 2 w/d0 2 = d 2 z/d0 2 and (16) becomes 


d 2 w 
~d0 2 


+ w = 0. 


As we know, the general solution of this familiar equation is 

w = A sin 0 + B cos 0, 


so 


z 


= A sin 0 + B cos 0 + — 
h z 


(17) 


For the sake of simplicity, we now shift the direction of the polar axis in such a 
way that r is minimal (that is, m is closest to the origin) when 6=0. This means 
that z is to be maximal in this direction, so 



and 



0 


when 6 = 0. By calculating dz/d6 and d 2 z/d0 2 from (17), we easily see that these 
conditions imply that A = 0 and B > 0. If we now replace z by \/r, then (17) can 
be written as 

= 1 __ h 2 /k _ 

k/h 2 + B cos 0 1 + ( BhVk ) cos 0’ 


and if we put e = Bh 2 /k , then our equation for the orbit becomes 
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h 2 /k 

1 + e cos 0’ 


(18) 


where e is a positive constant. 

We recall from Section 16.3 that (18) is the polar equation of a conic section 
with focus at the origin and vertical directrix to the right; and furthermore, that 
this conic section is an ellipse, a parabola, or a hyperbola according as e < 1, 
e — 1, or e > 1. Since the planets remain in the solar system and do not move 
infinitely far away from the sun, the ellipse is the only possibility. This yields 
Kepler’s first law: The orbit of each planet is an ellipse with the sun at one 
focus.* 


KEPLER’S THIRD LAW 


We now restrict ourselves to the case in which m has an elliptic orbit (Fig. 17.53) 
whose polar and rectangular equations are (18) and 


*L , £_ 

a 2 b 2 


We know that e — da and c 2 = a 2 - b 2 , so e 2 = {a 2 — b 2 )/a 2 and 

b 2 = a 2 { 1 -e 2 ). 


(19) 


In astronomy the semimajor axis a of the elliptical orbit is called the mean dis¬ 
tance , because it is one-half the sum of the least and greatest values of r. These 
are the values of r corresponding to 6 = 0 and 6 = tt in (18), so by (18) and (19) 
we have 


* = 2 


I / h 2 lk . h 2 /k 


+ ■ 


h 2 


hra 2 

1 + e 1 — el k( 1 — e 2 ) kb 2 ’ 


which yields 


b 2 = 


h 2 a 


( 20 ) 


If T is the period of m (that is, the time required for one complete revolution in 
its orbit), then, since the area of the ellipse is 7 Tab, it follows from (12) that tt ab = 
\KT, so T = 2vab/li. By using (20), we now obtain 


^0 4v 2 a 2 b 2 ( 4 tt 2 \ , 

p ——*(—J ” 3 


( 21 ) 


Since the constant k — GM depends on the central attracting mass M but not on 
m, (21) holds for all the planets in our solar system and we have Kepler’s third 
law: The squares of the periods of revolution of the planets are proportional to 
the cubes of their mean distances. 


*In the discussion of equation (17) we have ignored the possibility that r might have a constant value 
and therefore not be minimal in any direction, so that z has a constant value and is not maximal in 
any direction. This happens when both A = 0 and B = 0, so that z = k/h 2 and r = h 2 lk. Under these 
circumstances we have a circular orbit with radius h 2 /k, and this can be included under equation (18) 
by allowing the possibility that e = 0. However, we saw in Section 17.6 that a circular orbit of given 
radius requires a certain precise orbital speed, which is infinitely unlikely for an actual planet and 
can be disregarded as a genuine possibility. 


y 



Figure 17.53 
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As we explained in Section 15.3, the standard unit of distance among as¬ 
tronomers who work with the solar system is the astronomical unit. This is the 
mean distance from the earth to the sun, which is approximately 93,000,000 mi, 
or 150,000,000 km. Equation (21) takes the more convenient form 


( 22 ) 


when time is measured in years and distance in astronomical units. The reason 
for this, of course, is that 1 year is by definition the period of revolution of the 
earth in its orbit, so that with these units of measurement T = 1 when <2=1. 

We would like to point out that the mathematical theory discussed in this sec¬ 
tion is just the beginning of what Newton accomplished, and constitutes only a 
first approximation to the full story of planetary motion. For instance, we have 
assumed that only the sun and one planet are present. But actually, of course, all 
the other planets are present as well, and each exerts its own independent grav¬ 
itational force on the planet under consideration. These additional influences in¬ 
troduce what are called perturbations into the idealized elliptical orbit derived 
here, and the main purpose of the science of celestial mechanics is to take all 
these complexities into account. One of the great events of nineteenth century 
astronomy arose in just this way, namely, the discovery of the planet Neptune by 
Adams and Leverrier, through their attempts to explain the relatively large devi¬ 
ations of Uranus from its Keplerian orbit.* 

Also, we have assumed that the sun and planet under discussion are particles, 
that is, points at which mass is concentrated. In fact, of course, they are extended 
bodies with substantial dimensions. One of Newton’s most remarkable achieve¬ 
ments was to prove that the sun and planets behave like particles under the in¬ 
verse square law of attraction. We will prove this statement ourselves in Chap¬ 
ter 20, where we study three-dimensional integrals. 

Newton’s enormous success revived and greatly intensified the almost-forgot¬ 
ten Greek belief that it is possible to understand the universe in a rational way. 
This new confidence in its own intellectual powers permanently altered human¬ 
ity’s perception of itself, and over the past 300 years almost every department of 
human life has felt its consequences. 


Tor the details of this dramatic story, see pp. 820-839 of The World of Mathematics, James R. New¬ 
man, ed. (Simon and Schuster, 1956). 


PROBLEMS 


1 Newton himself did not know the value of the constant 
of gravitation G. This was determined by means of a clas¬ 
sic experiment in 1789 by the English scientist Henry 
Cavendish. Once G is known, explain how equation (21), 
written in the form 


*3 Kepler’s first two laws, in the form of equations (11) and 
(18), imply that m is attracted toward the origin with a 
force whose magnitude is inversely proportional to the 
square of r. This was Newton’s fundamental discovery, 
for it caused him to propound his law of gravitation and 
investigate its consequences. Prove this by assuming (11) 
and (18) and verifying the following statements: 


(a) twice that of the earth? 

(b) three times that of the earth? 

(c) 25 times that of the earth? 



might be used to calculate the mass of the sun. 

What is the period of revolution T (in years) of a planet 
whose mean distance from the sun is 


(a) F e = 0; 


2 
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dr ke . 

(b) ^=T sin 


(C) 


d 2 r 

dt 2 


ke cos 6 


<d >f,--4=-c 


Mm 

T2"’ 


*4 Use formula (17) to show that the speed of a planet at 
any point of its orbit is given by 



5 Suppose that the earth explodes into fragments which fly 
off at the same speed in different directions into orbits 
of their own. Use Kepler’s third law and the result of 
Problem 4 to show that all fragments that do not fall into 
the sun or escape from the solar system will reunite later 
at the same point. 


CHAPTER 17 REVIEW: CONCEPTS, FORMULAS 

Think through the following. 


1 

Parametric curve. 

7 

Velocity v = dRJdt. 

2 

Cycloid: area and length. 

8 

Acceleration a = d\/dt = d 2 RJdt 2 . 

3 

Brachistochrone problem. 

9 

Newton’s second law of motion: F = 

4 

Tautochrone property. 

10 

Curvature: definition and formula. 

5 

Scalars and vectors. 

11 

Kepler’s laws of planetary motion. 

6 

Parallelogram rule. 

12 

Newton’s law o f gravitation. 


ADDITIONAL PROBLEMS FOR CHAPTER 17 


SECTION 17.1 

1 Find the area of the loop of the folium of Descartes shown 
in Fig. 17.11. Hint: Use the polar equation of the folium 
and evaluate the area integral with the aid of the substi¬ 
tution u = tan 6. 

2 Find parametric equations for the right loop of the lem- 
niscate r 2 = la 2 cos 26 by using the slope of the radial 
line t = y/x as parameter. How can the left loop be rep¬ 
resented? 

SECTION 17.2 

3 Consider the cycloid discussed in Section 17.2. 

(a) Find the volume of the solid generated by revolving 
the region under one arch about the x-axis. 

(b) Find the area of the surface generated by revolving 
one arch about the x-axis. 

*4 Let a be fixed and consider the hypocycloid of n cusps, 
so that a = nb. Find the total length L n of this curve, and 
also the limit approached by L n as /i —» 

*5 Find the length of one arch of the epicycloid generated 
by a circle of radius b rolling on the outside of a fixed 
circle of radius af 


Newton discovered this length, and obtained Wren’s Theorem (Ex¬ 
ample 2 in Section 17.2) by letting a —> oo. See Book I of the Prin- 
cipia, Prop. 48 and Cor. 2 to Prop. 52. It is interesting to try to bridge 
the gap between Newton’s language and our own. 


*6 Let a be fixed and consider the epicycloid of n cusps, so 
that a = nb. Find the total length L n of this curve, and 
also the limit approached by L n as n —» «>. 

*7 Consider an ideal pendulum consisting of a particle of 
mass m at the end of a weightless string of length L (Fig. 
17 . 54 ). If it is pulled aside through an angle a and re¬ 
leased, show that its period of oscillation T can be ex¬ 
pressed in the form 

7=4 fif d * 

' 8 V(1 - « 2 )( 1 - JtV) 



Figure 17.54 An ideal pendulum. 
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where k = sin \a and u = (1 Ik) sin j6. Hint: x = 
L sin 6 and y = L cos 6 , and \mv 2 = mg{L cos 6 - 
L cos a). This integral is called a complete elliptic inte¬ 
gral of the first kind, and it cannot be evaluated by means 
of elementary functions. When a is small, so that k 2 is 
very small, we have the approximation 



as in Example 3 in Section 9.6. 

*8 Consider a wire bent into the shape of a cycloid 
with parametric equations x = a(6 — sin 6) and y = 
o(l — cos 6 ), and invert it as in Fig. 17.15. If a bead is 
released on the wire and slides without friction under the 
influence of gravity alone, show that its velocity v satis¬ 
fies the equation 

4 av 2 = g(so 2 - s 2 ), 


where so and s are the arc lengths from the lowest point 
to the bead’s initial position and its position at any later 
time, respectively. By differentiation obtain the equation 


dt 2 4a 


which shows that the bead moves in simple harmonic 
motion. Use the ideas of Section 9.6 to find s as a func¬ 
tion of t, determine the period of the motion, and observe 
that this establishes in another way the tautochrone prop¬ 
erty of the cycloid proved in Section 17.2. 


SECTION 17.3 

9 Use vector methods to show that the line joining the mid¬ 
points of two sides of a triangle is parallel to the third 
side and half its length. 

10 Generalize Problem 9 by using vector methods to show 
that the line joining the midpoints of the nonparallel sides 
of a trapezoid is parallel to the parallel sides and half the 
sum of their lengths. 


SECTION 17.5 

11 Locate the points on the curve y = \x 4 where the radius 
of curvature is smallest. What is this smallest radius? 

12 Show that the radius of curvature at any point (x, y) on 
the hypocycloid of four cusps x 2/3 + y m = a 2/3 is three 
times the distance from the origin to the line which is 
tangent to the curve at (x, y). 


of the earth. Calculate M e (approximately) in grams by 
using the values g = 980 cm/s 2 and/? = 6.37 x 10 8 cm. 
14 With the notation of Section 17.7, the inverse square law 
can be written as 



where k = GM, and since ma = F, we have 
d\ k 

— = — ju r . 
dt r 2 

Verify the following steps to obtain another derivation of 
Kepler’s first law from the inverse square law: 

(a) Use (6) and (11) to write 

dx _ k_ ( 1 du e \ _ k du tj 

dt r 2 \ dO/dt dt ) h dt 

(b) Integrate the equation in (a) to obtain 

v = i Ue+ (" o ~i) j ’ 

by assuming initial conditions in the following form 
(Fig. 17.55): At t = 0, m has its closest approach to 
the origin and crosses the polar axis at the point R = 
r 0 i with velocity v = v$y 



(c) Equate u r components of the equation in (b) to ob¬ 
tain 


r 


de_ 

dt 




cos 6 , 


SECTION 17.7 

13 The Cavendish value for G is 6.7 X 10 -8 cm 3 • g“ 1 • s -2 
when mass is measured in grams, distance in centime¬ 
ters, and time in seconds. In Example 2 in Section 17.6 
we used the fact that GM e — gR 2 , where M e is the mass 


and use (11) to write this in the form 


h 

r 


h 


+ 



cos 6. 


(d) Solve the equation in (c) for r to obtain 
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h 2 /k 


1 + ( Vq h/k - 1) cos 6 ' 

(e) Use (7) and (11) to show that h = r Q vo, and write the 
equation in (d) in the form 

( roVo) 2 /k ( r 0 i.> 0 ) 2 /k 


1 + (roVo 2 lk — 1) cos 6 
where e — r^v^lk — 1. 


+ e cos 6 ’ 


(f) Observe that the equation in (e) represents 
A circle if r Q v 0 2 = GM\ 

An ellipse if GM < r$v o 2 < 2 GM\ 

A parabola if r Q v 0 2 = 2GM\ 

A hyperbola if r 0 v q 2 > 2 GM. 


As explained in Section 17.2, we begin with a point P% and a lower point P i, and we seek 
the shape of the curved wire joining these points down which a bead will slide without 
friction in the shortest possible time. 

We start by considering an apparently unrelated problem in optics. Figure 17.56 illus¬ 
trates a situation in which a ray of light travels from A to P with constant velocity v\, and 
then, entering a denser medium, travels from P to B with a smaller velocity v 2 . In terms 
of the notation in the figure, the total time T required for the journey is given by 

Vtf 2 + x 2 \/b 2 + (c — x) 2 

T - -h - 1 - — 

V\ v 2 

If we assume that this ray of light is able to select its path from A to B in such a way as 
to minimize T, then dT/dx = 0, and with a little work we see that the minimizing path is 
characterized by the equation 

sin a\ _ sin a 2 
V\ v 2 

This is Snell's law of refraction.* The assumption that light travels from one point to an¬ 
other along the path requiring the shortest time is called Fermat's principle of least time. 
This principle not only provides a rational basis for Snell’s law—which is an experimental 
fact—but also can be applied to find the path of a ray of light through a medium of vari¬ 
able density, where in general light will travel along curves instead of straight lines. In 
Fig. 17.57(a) we have a stratified optical medium. In the individual layers the velocity of 
light is constant, but the velocity decreases from each layer to the one below it. As the 
descending ray of light passes from layer to layer, it is refracted more and more toward 


APPENDIX: 
BERNOULLI’S 
SOLUTION OF THE 
BRACHISTOCHRONE 
PROBLEM 



Figure 17.56 The refraction of light. 


*See Example 4 in Section 4.4. 




Figure 17.57 Refraction in other 
optical media. 
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the vertical, and when Snell’s law is applied to the boundaries between the layers, we ob¬ 
tain 


sin a i _ sin a 2 _ sin a 3 _ sin a 4 
V\ v 2 v 3 V4 

If we next allow these layers to grow thinner and more numerous, then in the limit the 
velocity of light decreases continuously as the ray descends, and we conclude that 

sin a 

-= a constant. 

v 



Figure 17.58 


This situation is indicated in Fig. 17.57(b); it is approximately what happens to a ray of 
sunlight falling on the earth as it slows in descending through atmosphere of increasing 
density. 

Returning now to the brachistochrone problem, we introduce a coordinate system as in 
Fig. 17.58 and assume that the bead (like the ray of light) is capable of selecting the path 
down which it will slide from Po to P\ in the shortest possible time. The argument given 
above yields 

sin a 

-= a constant. (1) 

v 

If the bead has mass m, so that mg is the downward force that gravity exerts on it, then 
the fact that the work done by gravity in pulling the bead down the wire equals the in¬ 
crease in the kinetic energy of the bead tells us that mgy = { mv 2 . This gives 

v=Vliy. (2) 

From the geometry of the situation we also have 


sin a = cos (3 = -- = = = — (3) 

sec P Vl + tan 2 j3 Vl + (y') 2 

On combining equations (1), (2), and (3)—obtained from optics, mechanics, and calcu¬ 
lus—we get 

y[\ +(/) 2 ] =c (4) 


as the differential equation of the brachistochrone. 

We now complete our discussion, and discover what curve the brachistochrone actu¬ 
ally is, by solving equation (4). When y' is replaced by dy/dx and the variables are sepa¬ 
rated, (4) becomes 

dx = J—*— dy, 

V c - y 


SO 


■ dy. 

c - y 

We evaluate this integral by starting with the algebraic substitution u 2 = y/(c - y), so that 
cu 2 , , 2cm 


y : 


+ 


and 


dy = 


(1 + M 2 ) 2 


du. 


Then 


f 2 cm 2 

J (1 + M 2 ) 2 


du , 


and the trigonometric substitution m = tan c/>, du = sec 2 <f> dcfr enables us to write this as 
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The constant of integration here is zero because y = 0 when cf> = 0, and since P 0 is at the 
origin, we also want to have ;c = 0 when 0 = 0. The formula for y gives 



We now simplify our equations by writing a = \c and 0 = 20, which yields 
x = a{6 - sin 6 ), y = a{\ - cos 0). 

These are the standard parametric equations of the cycloid with a cusp at the origin. We 
note that there is a single value of a that makes the first inverted arch of this cycloid pass 
through the point P\ in Fig. 17.58; for if a is allowed to increase from 0 to then the 
arch inflates, sweeps over the first quadrant of the plane, and clearly passes through P\ 
for a single suitably chosen value of a. 
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18.1 

COORDINATES AND 
VECTORS IN THREE- 
DIMENSIONAL SPACE 


Figure 18.1 Coordinate axes. 


VECTORS IN 
THREE- 
DIMENSIONAL 
SPACE. SURFACES 


In the preceding seventeen chapters we have discussed many aspects of the cal¬ 
culus of functions of a single variable. The geometry of these functions is two- 
dimensional because the graph of a function of a single variable is a curve in the 
plane. Most of the remainder of this book is concerned with the calculus of func¬ 
tions of several (two or more) independent variables. The geometry of functions 
of two variables is three-dimensional, because in general the graph of such a 
function is a curved surface in space. 

In this chapter we discuss the analytic geometry of three-dimensional space. 
Our treatment will emphasize vector algebra, partly because this approach pro¬ 
vides a more direct and intuitive understanding of the equations of lines and 
planes, and partly because the concepts of dot and cross products as developed 
in the next two sections are indispensable in many other parts of mathematics 
and physics. 

Rectangular coordinates in the plane can be generalized in a natural way to 
rectangular coordinates in space. The position of a point in space is described by 
giving its location relative to three mutually perpendicular coordinate axes pass¬ 
ing through the origin O. We always draw the jc-, y-, and z-axes as shown in Fig. 
18 . 1 , with equal units of length on all three axes and with arrows indicating the 
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Figure 18.2 Locating a point by its 
rectangular coordinates. 


positive directions. Each pair of axes determines a coordinate plane : the x-axis 
and y-axis determine the xy-plane, etc. The configuration of axes in this figure 
is called right-handed , because if the thumb of the right hand points in the di¬ 
rection of the positive z-axis, then the curl of the fingers gives the positive di¬ 
rection of rotation in the xy-plane, from the positive x-axis to the positive y-axis. 

Since many people have trouble visualizing space figures from plane draw¬ 
ings, we point out that Fig. 18.1 can be thought of as part of a rectangular room 
drawn in perspective, with the origin O at the far left corner of the floor. The 
xy-plane is the floor, and has the normal appearance of the xy-plane if we look 
down on it from a point on the positive z-axis; the yz-plane is the back wall of 
the room, in the plane of the paper; and the xz-plane is the wall on the left side 
of the room. 

A point P in space (see Fig. 18.2) is said to have rectangular (or Cartesian) 
coordinates x , y, z if: 

x is its signed distance from the yz-plane; 

y is its signed distance from the xz-plane; 

z is its signed distance from the yy-plane. 

Just as in plane analytic geometry, we write P = (x,y, z) and identify the point 
P with the ordered triple of its coordinates. On the right in the figure we attempt 
to strengthen the illusion of three dimensions by completing the box that has O 
and P as opposite vertices.* 

The three coordinate planes divide all of space into eight cells called octants. 
The cell emphasized in Fig. 18.2, where x, y, and z are all positive numbers, is 
called the first octant. (No one bothers to number the other seven octants.) 

Even before plunging into a general study of the equations of lines and planes 
in Section 18.4, we can notice a few obvious facts. The xy-plane is the set of all 
points (x, y, 0); it consists precisely of those points in space whose z-coordinate 
is 0, so its equation is 

z — 0. 

Similarly, the equation of the yz-plane is x = 0, and the equation of the xz-plane 
is y = 0. 


*The technical term for the object shown on the right is “rectangular parallelepiped.” We prefer the 
simpler word “box.” 
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Figure 18.3 Horizontal plane. 



Figure 18.4 The position vector of P. 



The z-axis is the set of all points (0, 0, z). It is therefore represented by the 
pair of equations 

x = 0, y = 0. (1) 

These are the equations of the yz-plane and the xz-plane, respectively, so equa¬ 
tions (1) taken together characterize the z-axis as the intersection of these two 
coordinate planes. Similarly, the equations of the jc-axis are y = 0, z = 0; and the 
equations of the y-axis are jc = 0, z = 0. 

There is nothing special about the number 0 in these remarks. For instance, 
the equation of the horizontal plane 5 units above the jcy-plane is z = 5; and the 
equations of the vertical line that passes through the point (1, 2, 0) in the xy-plane 
are* = 1, y = 2. See Fig. 18.3. 

Almost all of the ideas about vectors that were presented in Section 17.3 are 
valid in three-dimensional space and require no further discussion. This remark 
applies to the concept of a vector, to the definition of equality for vectors, and 
to the definitions of addition and scalar multiplication. In all this material there 
is no need at all to suppose that the vectors lie in a plane. 

The only real difference is that a vector in space has three components rather 
than two. In computing with vectors in the plane, we used the unit vectors i and 
j in the positive x- and y-directions. In order to compute with vectors in three- 
dimensional space, we introduce a third unit vector k in the positive z-direction, 
as shown in Fig. 18.4. If P = (x , y, z) is any point in space, the position vector 
R = OP can be written in the form 


R = x\ + yj + zk, 

and the numbers jc, y, and z are called its i-, j-, and k -components. 
The length of the vector R is given by the formula 


|R| = Vjc 2 + y 2 + z 2 . ( 2 ) 

This can be proved by a double application of the theorem of Pythagoras, as il¬ 
lustrated in Fig. 18.5: 

|R | 2 = OP 2 = OB 2 + BP 2 

= OA 2 + AB 2 + BP 2 
= x 2 + y 2 + z 2 . 


If Pi = (jci,yi, Zi) and P 2 = (x 2 , y 2 , Z 2 ) are any two points in space (Fig. 18.6), 

-» 

the distance between them is the length of the vector P1P2 from Pi to P2. Since 
P1P2 = R 2 - Ri = (x 2 l + y2j + z 2 k) - (jqi + y 1 j + zik) 

= (x 2 - *i)i + (y2 ~ y i)j + (■ Z2 ~ zi)k, 

we can use (2) to obtain 


IP 1 P 2 I = V(jc 2 - *i) 2 + (y 2 - yi) 2 + (z 2 - zi) 2 (3) 


This is the important distance formula’, it has many uses. 

Since a sphere is the set of all points P at a given distance r from a given fixed 
point Pq (Fig. 18.7), the equation of a sphere can be written as 


|PoP| = r. 


(4) 
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Figure 18.6 


If P = (x, y, z) and Po — Uo.yo* Zo X then (3) enables us to write (4) in the equiv¬ 
alent form 

(x - jto) 1 2 3 4 5 6 + (y — y () ) 2 + (z - zo) 2 = r 2 . (5) 

This is the standard equation of the sphere with center Pq = (;to, yo, Zo) and ra¬ 
dius r. 


Example If we complete the squares in the equation 

jc 2 + y 2 + z 2 + 4x - 2 y - 6z + 8 = 0, 


it becomes 


( 6 ) 


(* + 2) 2 + (y - l) 2 + (z - 3) 2 = 6. 

By comparing this with (5), we see at once that (6) is the equation of the sphere 
with center (—2, 1, 3) and radius V6. 



PROBLEMS 


1 Sketch the box with the vertices (1, —1,0), (1,4,0), 

(-2,4,0), (-2,-1,0), (1,-1,5), (1,4,5), 

( — 2,4,5), (—2, —1,5). Write down the equations of 
the faces and edges of the box that pass through the ver¬ 
tex (1,4, 5). 

2 Sketch the box bounded by the planes x = l,x = 3, y = 
0, y = 4, z = 1, z = 5. Write down the vertices. 

3 Sketch the tetrahedron whose base vertices are (3, 4, 0), 
(3, —4, 0), and (—5, 4, 0) and whose fourth vertex is 
(0, 0,6). Use the fact that the volume of a tetrahedron 
is one-third the area of the base times the height to find 
the volume of this tetrahedron. 

4 Sketch the straight lines whose equations are given: 

(a) * = 2, z = 3; (b) y = 1, z = 4; 

(c) x = —3, y = 1. 

5 Describe the graph of the equation 

(a) *y = 0; (b) xyz = 0. 

6 Describe and sketch the locus of all points P = (x, y , z) 
that satisfy the given pairs of simultaneous equations: 
(a) x 2 + y 2 = 4, z = 3; 


(b) ;c = 4, z = 4y 2 ; 

(c) y = x, x = 5; 

(d) z = -x 2 , y = 0. 

7 Find the point on the y-axis which is equidistant from 
(2, 5, -3) and (-3,6, 1). 

8 Find and simplify the equation of the locus of all points 
that are equidistant from (7, 0, —4) and (—3, 2, 2). De¬ 
scribe this locus in geometric language. 

9 Write the equation of the sphere with radius 7 and cen¬ 
ter on the positive z-axis, if the sphere is tangent to the 
plane z = 0. 

10 Find the equation of the sphere with center (3, —2, 5) 
which is 

(a) tangent to the ry-plane; 

(b) tangent to the yz-plane; 

(c) tangent to the ^z-plane. 

11 Identify the graph of each of the following equations, 
and if it is a sphere, give its center and radius: 

(a) x 2 + y 2 + z 2 + 2x — 6y — 10z + 26 = 0; 

(b) x 2 + y 2 + z 2 — lOx + 2y - 6z + 35 = 0; 
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(c) 2x 2 + 2 y 2 + 2 z 2 ~ 16x + 8 y + 4z + 49 = 0; 

(d) x 2 4- y 2 + z 2 + 2x — \4y — 6z + 59 = 0; 

(e) 4x 2 + 4y 2 + 4z 2 ~ I6x + 24y + 52 = 0. 

12 A point P moves in such a way that it is always twice 
as far from (3, 2,0) as from (3, 2, 6). Show that the lo¬ 
cus of P is a sphere and find its center and radius. 

13 If Pi = (*i, yi, zi) and P 2 = (* 2 , yi, Z 2 ), use vectors to 
show that the coordinates of the midpoint of the seg¬ 
ment P 1 P 2 are 

x = ~(x l +x 2 ), y = {{y\+yi), z = {(zi+Z 2 ). 

14 Find the equation of the sphere that has the two given 
points as ends of a diameter: 

(a) (6,2,-1), (-2,4, 3); (b) (0, 1,-7), (-6,7,3). 

15 Show that the triangle with vertices (4, 3,6), (- 2, 0, 8), 
and (1, 5, 0) is a right triangle. Find its area. 

16 For each of the following pairs of points, find the vec¬ 


tor from the first point to the second, and also the dis¬ 
tance between the points: 

(a) (2, 0, -3), (5, 1,2); (b) (-2, 1,7), (1, -4, 2); 

(c) (8, 3, 6), (2, -2, 0); (d) (1, 5, -3), (9, 7, 1). 

17 Find the vector from the origin O to the intersection of 
the medians of the triangle whose vertices are A = 
(3, 2, 2), B = (-1, 0, 4), and C = (5, 3, -2). 

18 If A, B, C are any three distinct vectors, their endpoints 
form a triangle. Find the position vector of the inter¬ 
section of the medians of this triangle. 

*19 If A, B, C, D are any four distinct vectors, their end¬ 
points form a tetrahedron. Show that the four lines join¬ 
ing each vertex to the intersection of the medians of the 
opposite face are concurrent, and find the position vec¬ 
tor of their common point. 


18.2 

THE DOT PRODUCT 
OF TWO VECTORS 


Up to this point in our work we have not defined the product of two vectors A 
and B. There are two different ways of doing this, both of which have important 
uses in geometry and physics. Since there is no reason to choose one of these 
definitions in preference to the other, we keep both, using a dot for one defini¬ 
tion and a cross for the other. The dot product (or scalar product ) of A and B is 
denoted by A • B and is a number. The cross product (or vector product) is de¬ 
noted by A X B and is a vector. These two kinds of multiplication are totally dif¬ 
ferent. We discuss the first in this section and the second in Section 18.3. 

The dot product A • B of two vectors A and B is defined to be the product of 
their lengths and the cosine of the angle between them. This definition can be 
written as 


A • B = |A||B| cos 6 , 


(1) 



Figure 18.8 Scalar projection. 


where 6 (0 < 6 < tt) i s the angle between A and B when they are placed so that 
their tails coincide (Fig. 18.8). It is clear from the definition that A • B is a scalar 
(or number), not a vector. 

As Fig. 18.8 shows, the number |B| cos 6 is the scalar projection of B on A, 
denoted by proj A B. Definition (1) can therefore be interpreted geometrically as 
follows: 


A • B = |A|(|B| cos 0) = |A| proj A B 

= (length of A) X (scalar projection of B on A). 

By interchanging the roles of A and B, we also have 

A • B = |B|(|A| cos 6) = |B| proj B A 

= (length of B) X (scalar projection of A on B). 


The vector projection of B on A is also indicated in the figure. Both types of pro¬ 
jections are useful in applications. 

It is easy to see from the definition (1) that the dot product has the properties 


A • B = B • A, 


( 2 ) 


the commutative law, 
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and 


(cA) • B = c(A • B) = A • (cB). 


(3) 


It also has the property 


A • (B + C) = A • B + A • C, the distributive law, (4) 

but this is not quite as evident as (2) and (3). To establish (4), we observe from 
Fig 18.9 that 

A • (B + C) = |A|[proj A (B + C)] 

= |A|(proj A B + proj A C) 

= |A|proj A B + |A|proj A C 
= A B + A • C. 



If we combine (4) with the commutative law (2), we also have 

(A + B) • C = A • C + B • C (5) 

Properties (4) and (5) permit us to multiply out sums of vectors by the ordinary 
procedures of elementary algebra, as in 

(A + B) (C+D) = A C + A D + B C+BD 


Another simple consequence of the definition (1) is the fact that 

A • A = |A | 2 


for any vector A. 


(6) 


Example 1 In the notation of Fig. 18.10, the cosine law of trigonometry states 
that 

c 2 = a 1 + b 2 — lab cos 6. 

This can be proved very easily by using property (6) to write 
c 2 = |C| 2 = |A - B| 2 = (A - B) • (A - B) 

= AA + BB-2AB 
= |A | 2 + |B| 2 — 2A • B 
= a 2 + b 2 — lab cos 6. 



If we apply the definition (1) to the mutually perpendicular unit vectors i, j, 
and k introduced in Section 18.1, we obtain 


i i = j ■ j =k k = 1, 
i j=i k = j k = 0. 


(7) 


These facts enable us to find a convenient formula for computing the dot prod¬ 
uct of any two vectors given in i, j, k form, 

A = a\\ + a 2 j + 33 k and B = b\\ + 6 2 j 4- /73k. 


If we expand A • B by using (7) together with the general properties previously 
discussed, we get 


A • B = a\b\ + <a 2 fr 2 + a^b^ 


( 8 ) 
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since six of the nine terms in the expansion vanish. Thus, to compute A • B, we 
simply multiply their respective i-, j-, and k-components, and add. 

If A and B are nonzero vectors, the definition (1) can be written in the form 


A • B 


(9) 


This formula displays the main significance of the dot product in geometry: It 
provides a simple way to find the angle between two vectors and, in particular, 
to decide when two vectors are perpendicular. Indeed, if we agree that the zero 
vector is perpendicular to every vector, then by (9) we see at once that 

A _L B if and only if A • B = 0. 


Formula (8) makes it possible for us to use the dot product in these ways as a 
convenient computational tool. 


Example 2 Find the cosine of the angle 6 between the vectors A = i + 2j + 
2k and B = — 3i + 4j. 


Solution It is clear that 

|A| = Vl + 4 + 4= 3, |B| = V9 + 16 = 5, A-B = -3 + 8 + 0 = 5. 

Therefore by (9) we have 


A • B _J_ 

|A||B| 3-5 


3' 


If we want the angle 6 itself, we can use a calculator to find that 6 = 70.5 ’. 


Example 3 Compute the cosine of the angle 6 between A and B if A = i — 2j 
+ 2k and B = —i + ck, and find a value of c for which A _L B. 


Solution We have 

|A| = Vl+4 + 4 = 3, |B| = Vl + c 2 , A • B = -1 + 2c, 


so 


„ A • B 


2c - 1 
3Vl + c 2 ' 


When c this quantity has the value 0, and hence the vectors are perpendic¬ 
ular. 



The simplest physical illustration of the use of the dot product is furnished by 
the concept of work. We recall that the work W done by a constant force F ex¬ 
erted along the line of motion in moving a particle through a distance d is given 
by W = Fd. But what if the force is a constant vector F pointing in some direc¬ 
tion other than the line of motion from P to Q, as shown in Fig. 18.11 ? Only the 
vector component of F in the direction of the line of motion does work, so in 
this case we have 


Figure 18.11 


W = (|F| cos e)\PQ\ = \F\\PQ\ cos 0 = F ■ PQ, 
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that is, 


W = F • PQ. 


( 10 ) 


In more advanced treatments of the physical uses of vectors, it is often neces¬ 
sary to calculate the work done by variable forces whose points of application 
move along curved paths, and formula (10) is the starting point for all such ap¬ 
plications. 


PROBLEMS 


1 Show that the vectors 

A = i + 3j + 4k, 

B = 4i + 4j - 4k, 

are perpendicular. 

2 Show that 

A = i - 2 j + k, 

B = i - k, 

C = i + j + k, 

are mutually perpendicular. 

3 Find the angle between each of the given pairs of vec¬ 
tors: 

(a) A = i + 2j + k, B=— i+j + 2k; 

(b) A = i, B = i + j; 

(c) A = 3i + 4j, B = 4i - 3j + 9k. 

4 Use dot products to show that the given three points are 
the vertices of a right triangle. Which is the vertex of the 
right angle? 

(a) P = (1, 7, 3), Q = (0, 7, - 1), R = (-1, 6, 2). 

(b) P = (2, -5, -2), Q = (-1, -2, 2), R = (4, 1, -5). 

(c) P = (2, 7, -2), Q = (0,4, - 1), R = (1, 4, 1). 

5 Show that the vectors 

A = i - 3j - 5k, 

B = 2i - j + k, 

C = 3i — 4j — 4k, 

form the sides of a right triangle if placed in the proper 
positions. 

6 Find the angle 6 between a diagonal of a cube and 

(a) an adjacent edge; 

(b) an adjacent diagonal of a face. 

7 Let A be a nonzero vector, and suppose that B and C are 
two vectors such that A • B = A • C. Is it legitimate to 
cancel A from both sides of this equation and conclude 
that B = C? Explain. 

8 Find a value of c for which the given vectors will be per¬ 
pendicular: 

(a) 3i - 2j + 5k, 2i + 4 j + ck; 

(b) i + j + k, i + j + ck. 

9 If a = |A| and b = |B|, show that the vector bA -I- aB bi¬ 
sects the angle between A and B. 

10 With the notation of Problem 9, show that bA + aB and 16 
bA — aB are perpendicular. 


11 Use the dot product to prove that an angle inscribed in 
a semicircle is a right angle. Hint: With the notation in 
Fig. 18.12, calculate (A + B) • (A — B). 



12 If Q = (1, — 1, 7), find the points P = (0, c, c) on the line 
z = y in the yz-plane such that the vector OP is perpen¬ 
dicular to the vector PQ. 

13 If A = i + 2j - 3k and B = 4i - 2k, find the vector 
component of A along B. Solve this problem by finding 
a general formula for the vector component of A along 
B if A and B are any two vectors. 

14 Use vector methods to show thatthe distance from a point 
Uo, yo) to a line ax + by + c = 0 (both in the .ry-plane) 
is 

\ax 0 + by 0 + c\ 

V a 2 + b 2 

15 Use property (6) to prove the parallelogram law of ele¬ 
mentary geometry: The sum of the squares of the diag¬ 
onals of a parallelogram equals the sum of the squares 
of the four sides. Hint: With the notation of Fig. 18.13, 
use (6) to expand |A + B| 2 + |A - B| 2 . 



Figure 18.13 The parallelogram law. 

For the triangle OAB in Fig. 18.14, the law of cosines 
states that 
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|A - B| 2 = |A| 2 + |B| 2 - 2|A||B| cos 9. 

Give another proof of formula (8) by solving this equa¬ 
tion for |A||B| cos 6 and simplifying the result. 



17 If a vector V = a\ + bj + ck makes angles a, /3, and y 
with the positive x-, y-, and z-axes (Fig. 18.15), then these 
angles are called the direction angles and cos a, cos ( 3 , 
and cos y are called the direction cosines of V. Show that 

(a) (a\ + b] + ck)/Va 2 + b 2 + c 2 is a unit vector hav¬ 
ing the same direction as V; 

(b) cos a = — ; Q — _ , 

V< 2 2 + b 2 + c 2 


- .-, 

Vtf 2 + b 2 + c 2 

c 

cos y = — - ; 

Va 2 + b 2 + c 2 


(c) cos 2 a + cos 2 /3 + cos 2 y = 1. 

18 How many lines through the origin make angles of 45° 
with both the positive x-axis and the positive y-axis? 

19 How many lines through the origin make angles of 60° 



with both the positive x-axis and the positive y-axis? 
What angles do they make with the positive z-axis? 

20 Find the work done by the force F when its point of ap¬ 
plication moves from P to Q : 

(a) F = 2i - 5j + 3k, P = (1,2, -2), Q = (3, -1, 1); 

(b) F = 3i + 2j - 3k, P = (-1, 2, 3), Q = (1,2, -1). 

21 Find the work done by a force F = —ck when its point 
of application moves from P = (jcj, y u z\) to Q = 
( X2 , }> 2 , zi). 

22 Find the work done by a constant force F if its point of 
application moves around a closed polygonal path. 


18.3 

THE CROSS PRODUCT 
OF TWO VECTORS 


A X B 



Many problems in geometry require us to find a vector that is perpendicular to 
each of two given vectors A and B. A routine way of doing this is provided by 
the cross product (or vector product) of A and B, denoted by A X B. This cross 
product is very different from the dot product A • B — for one thing, A X B is 
a vector, while A • B is a scalar. First we define this new product, then we de¬ 
scribe its algebraic properties so that we can compute it with reasonable ease, 
and finally we illustrate some of its uses. 

Consider two nonzero vectors A and B. Suppose that one of these vectors is 
translated, if necessary, so that their tails coincide, and let 0 be the angle from 
A to B (not from B to A), with 0 < 0 < tt. If A and B are not parallel, so that 
0 < 6 < 77, then these two vectors determine a plane, as shown in Fig. 18.16. We 
now choose the unit vector n which is normal (perpendicular) to this plane and 
whose direction is determined by the right-hand thumb rule. This means that if 
the right hand is placed so that the thumb is perpendicular to the plane of A and 
B and the fingers curl from A to B in the direction of the angle 6, then n points 
in the same direction as the thumb of this hand. This gives the direction of the 
vector A X B that we are defining. Not only do the vectors A and B determine 


Figure 18.16 
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the plane under consideration, but they also determine a parallelogram in this 
plane, of area |A||B| sin 6 (see Fig. 18.17). We take the area of this parallelogram 
as the magnitude of the vector A X B. With these preliminaries, we can now state 
the definition of the cross product of A and B, in this order, as follows: 

A X B = |A||B| sin 0n. (1) 

Observe that if A or B is 0, or if A and B are parallel, then they do not deter¬ 
mine a plane, and hence the unit normal vector n is not defined. But in these 
cases |A| = 0 or |B| = 0, or sin 6 = 0, so by (1) we have A X B = 0 and the de¬ 
termination of n is not necessary. If we agree that the zero vector is to be con¬ 
sidered as parallel to every vector, then it is easy to see that 

A is parallel to B if and only if A X B = 0. 

In particular, we have 

A X A = 0 


for every A. If instead of A X B we consider B X A, then the direction of the 
angle 6 is reversed, and we must flip the right hand over so that the thumb points 
in the opposite direction. This means that n is replaced by — n, and therefore 

B X A = —A X B. (2) 

This shows that the cross product is not commutative, and we must pay close at¬ 
tention to the order of the factors. 

If we keep (2) in mind and apply the definition ( 1 ) to the unit vectors i, j, and 
k (Fig. 18.18), then we easily see that 

i x j = —j x i = k, 

j x k = -k x j = i, (3) 

k X i - -i X k = j, 

and also that 


ixi=jxj = kxk = 0. 

For example, the right-hand thumb rule says that the direction of i X j is the same 
as the direction of k. But the area of the parallelogram determined by i and j is 
1, and since k itself has length 1, we have 

ixj = k. 

The products (3) are easy to remember by visualizing the figure. Another way to 
remember them is to arrange i, j, and k in cyclic order. 






A 

Figure 18.17 


Figure 18.18 
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j/->j -»k, 

and to observe that 

(each unit vector) X (the next one) = (the third one). 

Our next objective is to develop a convenient formula for calculating A X B 
in terms of the components of A and B, where 

A = a\i + a 2 j + 03k and B = b\\ 4- Z? 2 j + b^k. ( 4 ) 

In order to multiply out the product 

A X B = (#ii + (22 j + 03k) X (b\i + j "h 63k) = ? 

we need to know that the cross product possesses the following algebraic prop¬ 
erties: 


(cA) X B = c(A X B) = A X (cB), (5) 

A X (B + C) = A X B + A X C, ( 6 ) 

(A + B) X C = A X C + B X C. (7) 

Property (5) is easily established directly from the definition (1). Property (7) 
follows from ( 6 ) by using ( 2 ), 

(A + B)XC = -[C X (A + B)] 

= -(C XA + CXB) 

= —C X A — C X B 
= AXC + BXC. 

The real difficulty here is with the distributive law ( 6 ). There is no simple proof 
of this fact; and rather than hold up our progress by pausing to insert a compli¬ 
cated proof here, we simply take ( 6 ) for granted and continue on to our imme¬ 
diate objective. A proof of ( 6 ) is given in Remark 2 for the use of any students 
who may wish to examine it. 

We continue with our task of multiplying out the cross product of the vectors 
(4). Remembering to pay close attention to the order of the factors, we have 

A X B = (a\i + <22 j + a^k) X ( b\i + 62 j + 63 k) 

= a\i X ( b\i + b2 j + 63k) -I- a 2 j x (b\i + &2j + 63k) 

-I- a^k X ( b\i + 62j + 63k) 


so 

A X B = a\b\i X i -I- afai X j -I- a\b# X k 

+ ^ 2^1 j X i + tf 2 ^ 2 j X j + ^2^3j X k 

-I- a^b\k X i -I- a 3 £ 2 k X j -I- a^b^k X k. 

By using (3), we now obtain the rather awkward formula 

A X B = i(< 32^3 - a^b 2 ) ~ jiaibi - a 3 b\) + k{a\b 2 ~ a 2 b\). (8) 

(The slightly strange way of writing the signs here has a purpose that will be¬ 
come clear below.) 
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It is not necessary to memorize formula ( 8 ), because there is an equivalent ver¬ 
sion involving determinants that is easy to remember. We recall that a determi¬ 
nant of order 2 is defined by 


For example, 


a\ 

b\ 


a2 
b 2 


— a\b 2 ~ ci2b\. 


3 

4 


-2 

5 


= 3 • 5 - (-2) • 4 = 23. 


A determinant of order 3 can be defined in terms of determinants of order 2: 


ax 

a 2 

a 3 

b\ 

b 2 

b 3 

c 1 

c 2 

c 3 


b 2 b 3 
ci c 3 



b 3 

C3 


+ a 3 


bi 
c i 


b 2 

C2 


(9) 


Here we see that each number in the first row on the left is multiplied by the de¬ 
terminant of order 2 that remains when that number’s row and column are deleted. 
We particularly notice the minus sign attached to the middle term on the right 
side of formula (9). 

Even though a determinant of order 3 can be expanded along any row or col¬ 
umn, we use only expansions along the first row, as in (9). For example, 



3 3 


4 3 


4 3 

= 3 

7 1 

- 2 

-2 1 

+ (-D 

-2 7 


= 3(3 - 1 — 3 - 7) — 2[4 - 1 — 3 - (-2)] + (—1)[4 • 7 -3 • (-2)] 

= - 54 - 20 - 34 = -108. 

Formula (8) for the vector product of A = a \i + a 2 j + a 3 \L and B = b\\ + 
b 2 ] + b 3 k is clearly equivalent to 


A X B = 


«2 

b 2 



a i 
bx 


*3 

b 3 


+ k 


a 2 

b 2 


( 10 ) 


Motivated by (9), we now write (10) in the form 


A X B 


i j k 

a\ a 2 a 3 

b\ b 2 b 3 


(ID 


This is the concise and easily remembered formula for A X B that we have been 
seeking. The “symbolic determinant” here is to be evaluated by expanding along 
its first row, just as in equation (9). We emphasize that the components of the 
first vector A in A X B form the second row of the determinant in (11), and that 
the components of the second vector B form the third row of this determinant.* 


*Some authors define A X B by formula (11). This approach has several disadvantages, one of which 
is that considerable effort is needed before the geometric nature of A X B (that is, its length and di¬ 
rection) can be understood. We prefer to define A X B directly, in terms of its length and direction, 
and to consider formula (11) as simply a convenient tool for making calculations. Definitions of vec¬ 
tor operations that avoid dependence on explicit representations of vectors in terms of any particu¬ 
lar coordinate system are called invariant or coordinate-free. 
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Example 1 Calculate the cross product of A = 2i - j + 4k and B = i + 5 j - 

3k. 


Solution By formula (11) we have 



= -17i + lOj + Ilk. 


As a routine check to help guard ourselves against computational errors, we ob¬ 
serve that our answer is perpendicular to A because (A X B) • A = —34 — 10 + 
44 = 0, and is perpendicular to B because (A X B) • B = -17 + 50 — 33 = 0. 


Example 2 Find all unit vectors perpendicular to both of the vectors A = 2i — 
j + 3k and B = -4i + 3j - 5 k. 


Solution 

compute 

A x 


Since A X B is automatically perpendicular to both A and B, we 



= - 4i - 2j + 2k. 


We next convert this in to a unit vec tor in the same direction by dividing by its 
own length, which is Vl6 + 4 + 4 = V24 = 2V6: 


—4i - 2 j + 2 k _ — 2 i — j + k 

2V6 V6 


And finally, we introduce a plus-or-minus sign, 

± — 2 i — j + k 
V6 ’ 

because there are two possible directions. 


Example 3 Find the area of the triangle whose vertices are P = (2, — 1, 3), Q = 
(1,2, 4), and R = (3, 1, 1 ). 


Solution Two sides of the triangle are represented by the vectors 
A = PQ = (1 — 2)i + (2 + l)j + (4 - 3)k = -i + 3j + k, 
B = ~PR = (3 - 2)i + (1 + l)j + (1 - 3)k = i + 2j — 2k. 


The vector 


A X B = 


-1 

1 


j 

3 

2 


k 

1 

-2 


= - 8 i - j - 5 k 


has magnitude V64 + 1 + 25 = V90 = 3VT0, and this is equal to the area of 
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the parallelogram with A = PQ and B = PR as adjacent sides. The area of the 
given triangle is clearly half the area of this parallelogram, and is therefore 4v4o. 


Remark 1 The cross product arises quite naturally in many situations in physics. 
For example, if a force F is applied to a body at a point P (Fig. 18.19), and if R 
is the vector from a fixed origin O to P, then this force tends to rotate the body 
about an axis through O and perpendicular to the plane of R and F. The torque 
vector T defined by 

T = R X F 

specifies the direction and magnitude of this rotational effect, since |R||F| sin 0 
is the moment of the force about the axis, namely, the product of the length of 
the lever arm and the scalar component of F perpendicular to R. 

As another example, we mention the force F exerted on a moving charged par¬ 
ticle by a magnetic field B. It turns out that 



Figure 18.19 Torque vector. 


F^VXB, 


where V is the velocity of the charged particle and q is the magnitude of its 
charge. This is the primary fact that causes the aurora borealis, or northern lights, 
which are produced by blasts of charged particles from the sun streaming through 
the magnetic field of the earth. This basic principle of electromagnetism also un¬ 
derlies the design and operation of cyclotrons and TV sets. 


Remark 2 We now return to the problem of establishing the distributive law (6). 
We prove (6) only for unit vectors A, because once this has been done, an ap¬ 
plication of (5) allows us to obtain (6) immediately for vectors A of arbitrary 
length. 

With a unit vector A and an arbitrary vector V, A X V can be constructed by 
performing the following two operations, shown on the left side of Fig. 18.20: 
First, project V on the plane perpendicular to A to obtain a vector V' of length 
|V| sin 0; then rotate V' in this plane through an angle of 90° in the positive di¬ 
rection to obtain V", which is A X V since A is a unit vector. Each of these op¬ 
erations transforms a triangle into a triangle; so if we start with the three vectors 
B, C, and B + C shown on the right, the final three vectors B", C", and (B + 
C)" still form a triangle, and therefore (B + C)" = B" + C" But this means that 

AX(B + C) = AXB+AXC, 



and the argument is complete. 



Figure 18.20 The distributive law. 
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PROBLEMS 

1 Calculate A X B and check the result by showing that it 9 

is perpendicular to both A and B: 

(a) A = 3i - 2j + 4k, B = 2i + j — 2k; 10 

(b) A = 2i + 2j — k, B = i + j + k; 

(c) A = 5i - 4j + 3k, B = -3i - 2j + k; 11 

(d) A = i, B = i + j. 

2 Find a vector N perpendicular to the plane of the three 
points P = (I, — 1, 4), Q = (2, 0, 1), R = (0, 2, 3). 

3 Find the area of the triangle PQR in Problem 2. 

4 Find the distance from the origin to the plane in Prob¬ 
lem 2 by finding the scalar projection of OP along the 
vector N. 

5 If A • (B X C) = 0, what can be concluded about the 
configuration of A, B, and C? 

6 Show that |A X B| 2 = |A| 2 |B| 2 - (A • B) 2 . 

7 Show that the cross product is not associative by show¬ 
ing that 12 

A x (B x C) * (A x B) x C 

for the three vectors A = i + j, B = j, C = k. 

8 Show that the cross product of each pair of the follow¬ 
ing vectors is parallel to the third: i - 2j + k, i + j + 
k, i — k. What does this tell us about the configuration 
of the vectors? 

Since all the machinery of vector algebra is now in place, it might be expected 
that we would next turn to the calculus of vector functions in three-dimensional 
space. However, a full study of this subject belongs to a later course in advanced 
calculus or vector analysis, and is not part of our purpose in this book. 

Nevertheless, on a few occasions we will need to consider the position vector 
R (t) = x(t)\ + y(t )j + z(f)k of a point P that moves along a space curve, as shown 
in Fig. 18.21. The derivative of this function is defined in the obvious way, 

dR R(r + \t) - R (t) 

dt Ar—>0 A t 

and has all the properties we expect on the basis of our experience in Chapter 
17. In particular, dR/dt is tangent to the path at the point P, and is the velocity 
of P if the parameter t is time, and the unit tangent vector if t is arc length. 

With these brief remarks we put aside the calculus of vector functions, and 
turn to the main subject of the rest of this chapter, namely, the analytic geome¬ 
try of lines, planes, and curved surfaces in three-dimensional space. We shall find 
that the vector algebra discussed in the preceding sections is a very valuable tool 
for this work. 

As we know, in plane analytic geometry a single first-degree equation, 

ax + by + c = 0, 

is the equation of a straight line (assuming that a and b are not both zero). How¬ 
ever, we shall see that in the geometry of three dimensions such an equation rep- 
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Figure 18.21 A space curve. 


Show that if A is a nonzero vector and A X B = A X C, 
then B = C is not necessarily true. 

If A, B, and C are mutually perpendicular, show that A X 
(B X C) = 0. 

Let P i and Qi he two points on a line L\, and let P 2 and 
Q 2 be two points on a line L 2 . If L\ and L 2 are not par¬ 
allel, then the perpendicular distance d between them is 
the absolute value of the scalar projection of P\P 2 on a 
unit vector that is perpendicular to both lines. Why? 

(a) Show that 

—» P \Q\ x P2Q2 

P\P 2 * 


d = 


\P\Qt x PiQi\ 


(b) Find d if L\ is the line determined by P\ = ( — 1, 1, 1) 
and Q\ — (1,0,0), and L 2 is the line determined by 
P 2 = (3, 1,0) and Q 2 = (4, 5, -1). 

If A = 2i — 3j + k is normal to one plane and B = 
—i -F 4j — 2k is normal to another plane, do the planes 
necessarily intersect? Give a reason for your answer. If 
they do intersect, find a vector parallel to their line of 
intersection. 
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resents a plane, and therefore it is not possible to represent a line in space by any 
single first-degree equation. 

We begin with the study of lines. A line in space can be given geometrically 
in three ways: as the line through two points, as the intersection of two planes, 
or as the line through a point in a specified direction. The third way is the most 
important for us. 

Suppose L is the line in space that passes through a given point Pq = U 0 , y o> Zo) 
and is parallel to a given nonzero vector 

V = a\ + b) 4- ck, 

as shown in Fig. 18.22. Then another point P = (x, y, z) lies on the line L if and 
only if the vector P 0 P is parallel to the vector V. That is, P lies on L if and only 
if PoP is a scalar multiple of V, so that 

P^P = t\ (I) 

for some real number t . If Ro = OPq and R = OP are the position vectors of Pq 
and P, then P 0 P = R — Rq and ( 1 ) gives 



R = R 0 + fV, 


( 2 ) 


which is the vector equation of L. As t varies from - oo to °°, the point P tra¬ 
verses the entire infinite line L, moving in the direction of V. 

If we write (2) in the form 

x\ + yj + zk — x 0 [ + y 0 j + Zok + t(ai + bj + ck) 

and equate the coefficients of i, j, and k, we get the three scalar equations 

x = x§ + at, 

y = yo + bu (3) 

Z = Zo + ct. 


These are the parametric equations of the line L through the point Pq = (jcq, yo> £o) 
and parallel to the vector V = a\ + b\ + ck. Observe that the parametric equa¬ 
tions of a straight line are not unique. The numbers xo, yo, and zo can be replaced 
by the coordinates of any other point on L, and a , b , and c can be replaced by 
the components of any other nonzero vector parallel toL, and the resulting para¬ 
metric equations will be completely equivalent to equations (3) in the sense that 
they describe the same line. 

In order to obtain the Cartesian equations of the line, we eliminate the para¬ 
meter from equations (3) by equating the three expressions obtained by solving 
for t. This gives 


x ~ *o = y ~ yo = z ~ zo ^ 

a b c 

These are called the symmetric equations of the line L. If any one of the con¬ 
stants a, b, c is zero in a denominator of (4), then the corresponding numerator 
must also be zero. This is easy to see from the parametric form (3), which shows, 
for example, that if 


x = xo + at and 


a = 0, 
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then jc = xq. Thus, when one of the denominators in (4) vanishes, we interpret 
this as meaning that the corresponding numerator must also vanish. With this in¬ 
terpretation, equations (4) can always be used, even though division by zero is 
normally forbidden. 

Example 1 A line L goes through the points P 0 = (3, —2, 1) and P\ = (5, 1, 0). 
Find the parametric equations and the symmetric equations of L. Also find the 
points at which this line pierces the three coordinate planes. 


Solution The line L is parallel to the vector PqP\ = 2i + 3j — k, so by using 
Po as the known point on the line, equations (3) give the parametric equations 


x = 3 + 2t, 
y = -2 + 3 1, 
z = \ ~ t. 

By eliminating r, we obtain the symmetric equations 

x - 3 _ y + 2 _ z ~ 1 
2 3 -1 ’ 

To find the point at which L pierces the xy-plane, we set z — 0 in the third para¬ 
metric equation and see that t = 1. With this value of t, x = 5 and y = 1, so the 
point is (5, 1,0). Similarly, x = 0 implies that t = — |, so the point in the yz-plane 
is (0, — y, f); and y = 0 implies r = f, so the point in the jcz-plane is (y, 0, y). 


Now we turn to the study of planes. A plane can also be characterized in sev¬ 
eral ways: as the plane through three noncollinear points, as the plane through a 
line and a point not on the line, or as the plane through a point and perpendicular 
to a specified direction. Again, the third approach is the most convenient for us. 

Consider the plane that passes through a given point P 0 = (*o> yo> Zo) and is 
perpendicular to a given nonzero vector 

N = a\ + b) + ck, (5) 


z 



as shown in Fig. 18.23. Another point P = (jc, y, z) lies on this plane if and only 
if the vector PqP is perpendicular to the vector N, which means that 

N • PoP = 0. ( 6 ) 

If Ro = OPo and R — OP are the position vectors of Pq and P, so that PqP = 
R — Ro, then ( 6 ) becomes 

N • (R - R,) = 0. (7) 

This is the vector equation of the plane under discussion. 

Since R — Rq = (x — ^o)i + (y — yo)j + (z ~ Zo)k> (7) can be written out in 
the scalar form 


a(x - x 0 ) + b(y - y 0 ) + c(z - z 0 ) = 0 . ( 8 ) 

This is the Cartesian equation of the plane through the point Pq = (*o> Jo* Zo) 
with normal vector N = ai + b] + ck. For example, the equation of the plane 
through Pq = (5, —3, 1) with normal vector N = 4i + 3j — 2k is 


Figure 18.23 A plane in space. 
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or 


4(;c - 5) + 3(y + 3) - 2(z - 1) =0 


4x + 3y — 2z = 9. 


Observe that the coefficients of jc, y, and z in the last equation are the compo¬ 
nents of the normal vector. This is always the case, for equation (8) can be writ¬ 
ten in the form 


ax + by + cz = d, 


(9) 


where d = ax o + byo + czo\ and the coefficients of jc, y, and z in this equation 
are clearly the components of the normal vector (5). Conversely, every linear 
equation in jc, y, and z of the form (9) represents a plane with normal vector 
N = ai + b] + ck if the coefficients a , b, and c are not all zero. To see this, we 
notice that if (for instance) a =£ 0, then this permits us to choose yo and Zq arbi¬ 
trarily and solve the equation a :cq + byo + czo = d for jcq. With these values, (9) 
can be written as 


ax + by + cz = ax o + byo + czo 


or 


a(x - x 0 ) + b(y - y 0 ) + c(z ~ Zo) = 0, 

and this is immediately recognizable as the equation of the plane through 
(jc 0 , yo, Zo) with normal vector N = ai + b] + ck. 


Example 2 Find an equation for the plane through the three points Pq = (3, 2, 
-1), P\ = (1, — 1, 3), and P 2 = (3,-2,4). 


Solution To use equation (8), we must find a vector N that is normal to the 
plane. This is easy to do by using the cross product. We compute 


N — PoP\ x PqPi 



k 

4 

5 


— i + lOj + 8k. 


Since PqP\ and PqPi li e i n the plane, their cross product N is normal to the plane. 
Using equation (8) with Pq as the given point, our plane has the equation 


Qc — 3) + 10(y — 2) + 8(z + 1) = 0 


or 


after simplification. 


x + lOy + 8z = 15, 


Example 3 Find the point at which the line 

;c - 2 _ y -I- 3 _ z ~ 4 
1 “ 2 ” 2 

pierces the plane jc + 2y + 2z = 22. 

Solution jo find parametric equations for the line, we introduce t as the com¬ 
mon ratio in the given symmetric equations, 
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x - 2 _ y + 3 _ z - 4 

1 “ 2 ” 2 


= t y 


which gives 

x = 2 + t, y = -3 + 2t, z = 4 + 2r. 

We want the value of t for which the variable point (jc, y, z) on the line lies on 
the given plane. By substituting these equations into the equation of the plane, 
we obtain 


(2 + t) + 2(—3 + It) + 2(4 + 2 1) = 22, 

so t = 2 at the point where the line pierces the plane. By substituting t = 2 back 
in the parametric equations of the line, we find that the desired point is (4, 1, 8). 


Example 4 Find the cosine of the angle between the two planes x + 4y — 4z = 
9 and x + 2y + 2z = “3. Also, find parametric equations for the line of inter¬ 
section of these planes. 


Solution Clearly the angle 6 between two planes is the angle between their nor¬ 
mals (Fig. 18.24). By inspecting the equations of the given planes, we see at once 
that their normals are 

N! = i + 4j -4k, N 2 = i + 2j + 2k. 

We therefore use the dot product to obtain 

. _ Nj • N 2 _ 1 

|Ni||n 2 | 3 V 33 ' 


From this we can find the angle 6 if we wish, by tables or otherwise. 

To find parametric equations for the line of intersection, we need a vector V 
parallel to this line and a point on the line. We find V by computing the cross 
product of Ni and N 2 , 


V = N, x N 2 = 


i 

1 

1 


j k 

4 -4 
2 2 


= 16i - 6j - 2k. 



Figure 18.24 
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Since any vector parallel to the line will do, we divide by 2 and use the slightly 
simpler vector 8i — 3j — k. To find a point on the line, we can set z = 0 and solve 
the resulting system in the unknowns x and y, 

x + Ay = 9, 
jc + 2 y= -3. 

This yields x = - 15, y = 6. The desired point is therefore (—15, 6, 0), and the 
parametric equations of the line are 

x = —15 + 8f, 

y = 6 — 3t, 

z = -t. 

We repeat that there is nothing unique about these equations, for we could have 
found a point on the line in many other ways and there are many different vec¬ 
tors parallel to the line. 


As we remarked at the beginning of this section, any two intersecting planes 
determine a straight line in space. The equations of the two planes are satisfied 
simultaneously only by points on the line of intersection. From this point of view, 
a pair of linear equations considered as a simultaneous system can be interpreted 
as representing a line, namely, the line of intersection of the two planes repre¬ 
sented by the individual equations. (Of course, the planes must actually intersect, 
and not be parallel or identical.) Thus, in Example 4 the pair of simultaneous 
equations 

x + 4y - 4z = 9, 
x + 2y + 2z = -3, 

represents the line discussed in that example. We also point out that the sym¬ 
metric equations (4) are equivalent to the three simultaneous equations 

b(x - xo) ~ a(y - y 0 ) = 0, 

c(x - x 0 ) ~ a(z ~ Zo) = 0, 

c(y - yo) ~ b(z - zo) = 0. 

These are the equations of three planes that intersect in the line L represented by 
(4). The first has normal vector b \ — a\, which is parallel to the xy-plane, so the 
first plane is perpendicular to the xy-plane. Similarly, the second plane is per¬ 
pendicular to the jcz-plane and the third is perpendicular to the yz-plane. Any pair 
of these equations represents the line L, which is the intersection of the corre¬ 
sponding pair of planes. 


PROBLEMS 


1 Label each of the following statements as true or false: 

(a) Two planes perpendicular to a line are parallel. 

(b) Two lines perpendicular to a third line are parallel. 

(c) Two planes parallel to a third plane are parallel. 

(d) Two lines perpendicular to a plane are parallel. 


(e) Two planes parallel to a line are parallel. 

(f) Two lines parallel to a third line are parallel. 

(g) Two planes perpendicular to a third plane are parallel. 

(h) Two lines parallel to a plane are parallel. 
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2 What conclusion can be drawn about the lines 

x ~ xp _ y ~ yp = z - zo 
a b c 


12 (a) As a function of t, find the distance D from the point 

P 0 = (1, 2, 3) to a variable point on the line 

X = 3 + t, y = 2 + t, z=\+t. 


x~ x 0 = y - y 0 = z~ zo 
ABC 

if aA + bB + cC = 0? 

3 What conclusion can be drawn about the lines 


x-xp = y ~ yp = z~ zo 
a b c 


x-xi = y - y i = z-z i 
ABC 

if at A = b/B = c/Cl 

4 Write symmetric equations for the line through the point 
(3, 0, -2) and parallel to 

(a) the vector 4i - 3j + 5k; 

(b) the line (x + l)/7 = (y ~ 2)12 = z/(—3); 

(c) the x-axis. 

5 Write parametric equations for the line through the point 
(2, —1, -3) and parallel to 

(a) the vector i + 4j -2k; 

(b) the line xl3 = (y + 7)/( -1) = (z - 3)/6; 

(c) the line x = 2t - 3, y = 3 - 2t, z = 5t - 4. 

6 Write symmetric equations for the line through the points 

(a) (2, -1,3) and (5, 2, -2); 

(b) (7,3, -1) and (3, -1, 3). 

7 Write parametric equations for the line through the points 

(a) (2, 0, 3) and (-1, 3, 5); 

(b) (4,2, -1) and (0,2, -1). 

8 If L is the line through the points (—6,6, —4) and 
(12, —6, 2), find the points where L pierces the coordi¬ 
nate planes. 

9 Show that the lines 

x = \ + t, y = 2t, z = 1 + 3t 
and 

x = 3s, y = 2s, z = 2 + s 

intersect, and find their point of intersection. 

10 Find the distance between the lines 


and 

jc + 1 y -2 = z + 1 

1 _ 0 2 ; 

(b) * = 2t - 4, y = 4 — t, z = ~2t — 1 

and x = 4 t - 5 , y = — 31 + 5, z = —5 1 + 5. 
11 Find the distance from the origin to the line 


x — 4 
3 


y-2 


4-z 

5 


(b) By differentiating, find the value of t that minimizes 
D, find the actual minimum distance, and find the 
corresponding point P\ on the line. 

(c) Verify that the vector PoP\ is perpendicular to the 
line. 

13 Find the equation of the plane that contains the point 
(1,3, 1) and the line x = t,y = t, z = t + 2. 

14 Find symmetric equations for the line of intersection of 
each of the following pairs of planes: 

(a) 2x + y + z = 0, 3x + 4y - z = 10; 

(b) 2x + 3y + 5z = 21, 3x — 2y + z = 12. 

15 Use vector methods to show that the distance D from the 
point (*o, ;yo> zo) to the plane ax + by + cz + d = 0 is 
given by the formula 

\axp + by 0 + czp + d\ 

Va 2 + t? + c 2 

16 Show that the planes ax + by + cz + d\ = 0 and ax + 
by + cz + d 2 = 0 are parallel, and that the distance be¬ 
tween them is 

\di ~ d 2 \ 

Vtf 2 + b 2 + c 2 

17 Find the distance between the planes x - 2y + 4z = 1 
and 2x - 4y + 8 z = ~ 14. 

18 Find an equation for the plane that is parallel to the plane 
2x - 5 y + 3z = 7 and passes through the point (5, 2, 3). 

19 Consider the sphere of radius 3 with its center at the ori¬ 
gin. The plane tangent to this sphere at (1, 2, 2) inter¬ 
sects the x-axis at a point P. Find the coordinates of P. 

20 Find the value of the parameter t for which the planes 

3x - 4y + 2z + 9 = 0, 

3x + 4y - tz + 7 = 0, 

are perpendicular. 

21 Verify that the planes 

2x + 3y + 4z ~ 1 =0, 
x-2y + 3z-4 = 0, 

intersect in a line L. Find symmetric equations for L in 
two ways: 

(a) by finding two points on L and going on from there; 

(b) by first eliminating x from the given equations, and 
then y, to find two planes through L that are perpen¬ 
dicular to the yz-plane and the xz-plane, respectively, 
and then solving each of these equations for z and 
equating all the z' s. 


4 
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22 Show that the two sets of equations 

x - 4 _ y — 6 _ z + 9 
3 ” 4 ” -12 

and 

jc — 1 = y ~ 2 = z ~ 3 
-6 ” -8 24 

represent the same straight line. 

23 Let p\ and p 2 be two planes that intersect in a line L and 
have equations 

a\x + b\y + c\z + d\ = 0, 

a 2 x + b 2 y + c 2 z + d 2 = 0. 

If A: is a constant, show that 

(a \x + b\y + c\z + d\) + k(a 2 x + byy + c 2 Z + d 2 ) = 0 

is the equation of a plane containing L. For various val¬ 
ues of k, this equation represents every member of the 
family of all planes containing L, with one exception. 
What is this exception? 

24 Find the equation of the plane that contains the inter¬ 
section of the planes 2x + 3y — z = I and 3x — y + 
5z = 2 and passes through (1,4, 1). 

25 Find the equation of the plane that contains the inter¬ 
section of the planes x — 2y — 5z = 3 and 5 x + y — z = 
1 and is parallel to 4x + 3y + 4z + 7 = 0. 


26 Find the coordinates of the point P at which the line 

x - 1 = y + 3 = z-3 
3 4 2 

pierces the plane 3* + 4y + 5z = 76. 

27 Show that the line 

x + 8 = y- 10 = z-9 
9 _ 4 _ 6 

lies in the plane 2x — 3y + 5z = — 1. 

28 Show that the line of intersection of the planes 

x + y — z = 0 and x — y — 5z + 7 = 0 
is parallel to the line 

X + 3 = y - 1 = Z - 5 
3 -2 1 ' 

29 Find the cosine of the angle between the given planes: 

(a) 2x - y + 2z = 3, 3x + 2y — 6z = 7; 

(b) 5x - 3y + 2z = 3, x + 3y + 2z = -11. 

30 Show that the single equation 

(2x + y - z - 3) 2 + (x + 2y - 3z + 5) 2 = 0 

represents a straight line in space. (But this equation is 
of the second degree.) 


We know that the graph of an equation /(jc, y) = 0 is usually a curve in the ;ry- 
plane. In just the same way, the graph of an equation 

FU,y,z) = 0 (1) 

is usually a surface in jryz-space. The simplest surfaces are planes, and we saw 
in Section 18.4 that the equation of a plane is a linear equation that can be writ¬ 
ten in the form 
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ax + by + cz + d = 0; 


that is, it contains only first-degree terms in the variables jc, y, and z. In this sec¬ 
tion and the next we examine a few other simple surfaces containing terms of 
higher degree that often appear in multivariable calculus. 

Cylinders are the next surfaces after planes in order of complexity. To under¬ 
stand what these surfaces are, we consider a plane curve C and a line L not par¬ 
allel to the plane of C. B y a cylinder we mean the geometric figure in space that 
is generated (or swept out) by a straight line moving parallel to L and passing 
through C (Fig. 18.25)/ The moving line is called the generator of the cylinder. 
The cylinder can be thought of as consisting of infinitely many parallel lines, 


*This concept includes the familiar right circular cylinders of elementary geometry, for which the 
curve C is a circle and the line L is perpendicular to the plane of the circle. In geometry the adjec¬ 
tives are often omitted, because no other kinds of cylinders are considered. However, it should be 
noticed that when C is itself a straight line, the cylinder is a plane, so cylinders also include planes 
as special cases. 



Figure 18.25 A general cylinder. 


















VECTORS IN THREE-DIMENSIONAL SPACE. SURFACES 


654 



called rulings, corresponding to various positions of the generator. This is sug¬ 
gested in the figure. 

For example, suppose that the given curve C is the curve 

Ax,y) = 0 (2) 

in the jcy-plane, and let the generator be parallel to the z-axis, as shown in Fig. 
18.26. Then exactly the same equation (2) is the equation of the cylinder in three- 
dimensional space. The reason for this is that the point P = (jc, y, z) lies on the 
cylinder if and only if the point Pq = (x, y, 0) lies on the curve C, and this hap¬ 
pens if and only if/(;c, y) = 0. The essential feature of (2) as the equation of the 
cylinder is that it is an equation of the form (1) from which the variable z is miss¬ 
ing. To express this in another way, the fact that we are dealing with a cylinder 
whose rulings are parallel to the z-axis means that for a point P = (jc, y, z), the 
value of z has no bearing on whether P lies on the cylinder or not; and since only 
the variables jc and y are relevant to this issue, only the variables jc and y can be 
present in the equation of the cylinder—that is, z must be missing f rom this equa¬ 
tion. 



Example 1 Sketch the cylinder 



Solution This appears to be the equation of an ellipse in the jcy-plane. However, 
it is stated that this is a cylinder, and since the variable z is missing from the 
equation, the rulings of this cylinder are parallel to the z-axis. In Fig. 18.27, the 
ellipse in the jcy-plane is drawn first, then two vertical rulings, then a horizontal 
elliptical cross section above the jcy-plane. In spite of the limitations of our fig¬ 
ure (which we hope students will try to overcome by an active use of imagina¬ 
tion), it should be remembered that all rulings on a cylinder extend to infinity in 
both directions. This surface is called an elliptic cylinder 


It is clear that this discussion can be carried through for a cylinder with rul¬ 
ings parallel to any coordinate axis. We therefore have the conclusion that any 
equation in rectangular coordinates x, y, z with one variable missing represents 
a cylinder whose rulings are parallel to the axis corresponding to the missing 
variable. 


z 



Figure 18.28 Parabolic cylinder. 


Example 2 Sketch the cylinder z = x 2 . 

Solution In the Jtz-plane, this is the equation of a parabola with vertex at the 
origin that opens in the positive z-direction. However, we know that we are deal¬ 
ing with a cylinder, and since the variable y is missing from the equation, the 
rulings of this cylinder are parallel to the y-axis. In Fig. 18.28 the parabola in the 
jcz-plane is drawn first, then several rulings, and then a second parabolic cross 
section located to the right of the Jtz-plane. This surface can be described as a 
parabolic cylinder. 


Another way to generate a surface by using a plane curve C is to revolve the 
curve (in space) about a line L in its plane. The resulting surface is called a sur- 
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Figure 18.29 


face of revolution with axis L. In Chapter 7 we became acquainted with surfaces 
of revolution by calculating their areas as an application of definite integrals. We 
now consider the equations of these surfaces. 

Suppose, for example, that the curve C lies in the yz-plane and has equation 

/(y,z) = 0. (3) 

As this curve is revolved about the z-axis, a typical point P = (jc, y, z) on the re¬ 
sulting surface comes from a point Q on C, as shown in Fig. 18.29. Since Q lies 
on C, its coordinates (yo, Zo) satisfy (3), 

/(yo, zo) = 0. (4) 

But the relation of P to Q tells us that zo = z and yo = Vjc 2 + y 2 , so (4) yields 

/(VV + y2, z) = 0 (5) 

as the equation of the surface of revolution. Briefly, as Q swings out to the point 
P on the surface, the distances QR and PR to the z-axis are equal, and we get 
equation (5) by replacing y in (3) by Vjc 2 + y 2 . Equation (5) assumes that y > 
0 on C. If y is positive on some parts of C and negative on others, we must re¬ 
place y in (3) by ± Vjc 2 + y 2 to get 

/(±Vjc 2 + y 2 , z) = 0 

as the equation of the complete surface. The awkward radical with its plus-or- 
minus sign can usually be eliminated by squaring. 

Example 3 If the line z = 3y in the yz-plane is revolved about the z-axis, the re¬ 
sulting surface of the revolution is clearly a right circular cone of two nappes 
with vertex at the origin and axis the z-axis (Fig. 18.30). To get the equation of 
this cone, we replace y in the equation z = 3y by ± Vjc 2 + y 2 and then ratio¬ 
nalize by squaring: 

z - ±3^/x 2 + y 2 , z 2 = 9(jc 2 -1- y 2 ). 

If we had merely replaced y by Vjc 2 + y 2 to obtain 

z = 3 Vjc 2 + y 2 . 



Figure 18.30 Cone. 
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we would have had the equation of only the upper nappe of the cone, the part 
where z > 0. 


In essentially the same way, we can obtain equations for surfaces of revolu¬ 
tion with the x-axis or the y-axis as the axis of symmetry. 


PROBLEMS 


Sketch the cylinders whose equations are given in Problems 
1-8. If a cylinder has an obvious name, state it. 

\ y = x 2 . 2 y 2 + 4 z 2 = 16. 

3 x = sin y. 4 xz = 4. 

5 x + 3z = 6. 6 x 2 +z 2 = 9. 

7 x = tan y, - 7 t/2 < y < 7t/2. 

8 y = e*. 

9 The rulings of a cylinder are parallel to the y-axis. Its in¬ 
tersection with the *z-plane is a circle with center (0, 0, a) 
and radius a. Sketch the cylinder and find its equation. 

10 The rulings of a cylinder are parallel to the ;c-axis. Its in¬ 
tersection with the yz-plane is a parabola with vertex at 
(0, 0, 0) and focus at (0, 0, — p). Sketch the cylinder and 
find its equation. 

11 Find the equation of the surface of revolution generated 
by revolving the curve z = e~ yl about 

(a) the z-axis; (b) the y-axis. 

Sketch both surfaces. 

12 Find the equation of the surface of revolution generated 

by revolving the circle (y — b) 2 + z 2 = a 2 (a < b) about 
(a) the z-axis; (b) the y-axis. 

Sketch both surfaces. 

13 In each of the following, write the equation for the sur¬ 
face of revolution generated by revolving the given curve 
about the indicated axis, and sketch the surface: 

(a) y = z 2 , the y-axis; 

(b) 9x 2 + 4y 2 = 36, the y-axis; 

(c) z = 4 - x 2 , the z-axis; 

(d) x = y 2 , the x-axis. 


14 Any direction in space not parallel to the xy-plane can 
be specified by a vector of the form V = ai + fcj + k 
(why?). If a curve C in the ^y-plane has the equation 
f(x,y) = 0, show that the equation of the cylinder gen¬ 
erated by a moving line that is parallel to V and passes 
through C (Fig. 18.31) is 

fix ~az,y ~ bz) = 0. 

Hint: Write the symmetric equations of the line through 
a point (*o, yo, 0) on C and parallel to V. 



15 Find the equation of the cylinder generated by a line 
through the circle x 2 + y 2 = 6x in the .xy-plane that 
moves parallel to the vector V = 2i + 3j + k. 

16 Find the equation of the cylinder generated by a line 
through the parabola y = x 2 in the yy-plane that moves 
parallel to the vector V = — 2i — 3j + 5k. 


18.6 

QUADRIC SURFACES 


In Section 15.6 we learned that the graph of a second-degree equation in the vari¬ 
ables x and y is always a conic section—a parabola, an ellipse, a hyperbola, or 
perhaps some degenerate form of one of these curves, such as a point, the empty 
set, or a pair of straight lines. 

In three-dimensional space the most general equation of the second degree is 
Ax 2 + By 2 + Cz 2 + Dxy + Exz + Fyz + Gx + Hy + lz + J = 0. (1) 


We assume that not all of the coefficients A, R, .. . , F are zero, so that the de¬ 
gree of the equation is really 2 instead of 1 or 0. The graph of such an equation 
is called a quadric surface. We have already encountered several quadric sur- 
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faces, such as spheres and parabolic, elliptic, and hyperbolic cylinders, but there 
are a number of others as well. Indeed, if we set aside the familiar case of cylin¬ 
ders, then by suitable rotations and translations of the coordinate axes—which 
we do not discuss—it is possible to simplify any equation of the form (1) and 
thereby show that there are exactly six distinct kinds of nondegenerate quadric 
surfaces: 


1 The ellipsoid. 

2 The hyperboloid of one sheet. 

3 The hyperboloid of two sheets. 

4 The elliptic cone. 

5 The elliptic paraboloid. 

6 The hyperbolic paraboloid. 

In the following we give an example of each type of surface in which the equa¬ 
tion appears in as simple a form as possible. 

Students should become familiar with these surfaces and their equations, and 
in particular should try to understand how the shape of each surface is related to 
the special features of its equation. For the purpose of visualizing and sketching 
a surface, it is often useful to examine its sections , which are the curves of in¬ 
tersection of the surface with planes 

x = k, y = k, z = k 

parallel to the coordinate planes. We point out explicitly that every second- 
degree section of every quadric surface is a conic section. Sections that are closed 
curves are usually the easiest to sketch, and therefore we look for elliptic sec¬ 
tions and sketch these first. Symmetry considerations should also be kept in mind. 

In the following examples, the numbers a , b , and c are all assumed to be pos¬ 
itive. We comment informally rather than exhaustively on the surface considered 
in each example. 


Example 1 The ellipsoid 



( 2 ) 


is shown in Fig. 18.32. Since only even powers of x, y, and z occur in the equa¬ 
tion, this surface is symmetric about each coordinate plane. The sections in the 
xz- and yz-planes are the ellipses 




1 


with a common vertical axis. The section in a horizontal plane z — k is the 
ellipse 




and this decreases in size as k varies from 0 to c or — c. The numbers a , b , and 
c are the intercepts on the coordinate axes, and are called the semiaxes. If two 
of the semiaxes are equal, the ellipsoid is called a spheroid —an oblate spheroid 
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if it is flattened like a “flying saucer,” and a prolate spheroid if it is elongated 
like a football. Of course, if a = b = c, then the ellipsoid is a sphere. 



Figure 18.33 Hyperboloid of one 
sheet. 


Example 2 The graph of the equation 




b 2 


(3) 


is a hyperboloid of one sheet (Fig. 18.33). If we write the equation in the form 

JC 2 V 2 z 2 

—+ 7J = \+ 1, (4) 

a 2 b 2 c l 


then we see that all its horizontal sections in planes z — k are ellipses, and that 
these ellipses grow larger as their planes move up or down from the ry-plane, 
the smallest ellipse being the one in the xy-plane. The section of the surface in 
the yz-plane is the hyperbola 



It is this hyperbola that binds together the horizontal elliptical sections into a 
smooth surface. The phrase “of one sheet” is used because this surface consists 
of one piece, in contrast to the hyperboloid discussed in the next example, which 
consists of two pieces. Observe that equation (3) is obtained from (2) by chang¬ 
ing the sign of the third term on the left; we get the same kind of surface no mat¬ 
ter which of these terms has its sign changed. 



Figure 18.34 Hyperboloid of two 
sheets. 


Example 3 The hyperboloid of two sheets 



= 1 


(5) 


is shown in Fig. 18.34. This equation is obtained from (2) by changing the signs 
of the first two terms on the left. (The reason for this choice is explained below.) 
If we write the equation in the form 



( 6 ) 


then we see that all its horizontal sections in planes z = k with k > c or k < —c 
are ellipses or single points, while sections in planes z — k with \k\ < c are empty. 
The section in the yz-plane is the hyperbola 


b 2 


1 , 


and it is this hyperbola that unifies the horizontal sections into a smooth surface 
—of “two sheets.” Observe that (6) is identical with (4) except for the presence 
of the minus sign on the right, and it is this sign that makes all the difference be¬ 
tween the surfaces in these two examples; for the right side of (4) is positive for 
all z’s, whereas the right side of (6) is negative for |z| < c. 
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Example 4 The graph of the equation 



is an elliptic cone (Fig. 18.35). This surface intersects the xz-plane and the yz- 
plane in the pairs of intersecting straight lines 

c c 

z = ±—x and z = -~r y> 
a b 

respectively. It intersects the xy-plane at the origin alone. All horizontal sections 
in planes z — k with k # 0 are ellipses. (In Chapter 15 it was convenient to dis¬ 
tinguish circles from ellipses; here we include circles among the ellipses.) It is 
clear from the form of (7) that if (x, y, z) is a point on the surface, then (tx, ty, tz) 
is also on the surface for any number t. This tells us that the entire surface can 
be thought of as generated by a moving line through the origin O and a variable 
point P on any horizontal elliptical section. When a = b, the cone is the famil¬ 
iar right circular cone. 



Example 5 The elliptic paraboloid 

z = ax 2 + by 2 . (8) 

is shown in Fig. 18.36. The vertical sections of this surface in the xz-plane and 
yz-plane are the parabolas 

z = ax 2 and z = by 2 , 

respectively. The horizontal section in the plane z = k is an ellipse if k > 0, the 
origin alone if k = 0, and empty if k < 0. 


Example 6 In Fig. 18.37 we sketch the hyperbolic paraboloid 

z = by 2 — ax 2 . (9) 



Figure 18.36 Elliptic paraboloid. 


The section in the yz-plane is the parabola z = by 2 opening upward, and that in 
the xz-plane is the parabola z = —ax 2 opening downward. In all planes y = k 
parallel to the xz-plane, the sections are downward-opening parabolas that are 
identical with one another and can be thought of as hanging from their vertices 
at various points along the parabola z = by 2 \ this is emphasized in the way we 
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have drawn the figure. Near the origin the surface rises in the y-direction and 
falls in the x-direction, and thus has the general shape of a saddle or a mountain 
pass. For this reason, the surface is often called a saddle surface, with the origin 
as the saddle point. It is clear from (9) that in the horizontal plane z = k, the sec¬ 
tion is a hyperbola with principal axis in the y-direction if k > 0, and a hyper¬ 
bola with principal axis in the x-direction if k < 0; if k = 0, the section is a pair 
of intersecting straight lines through the origin. 


PROBLEMS 


Sketch and identify the surfaces in Problems 1-14. 


1 

2x 2 + y 2 + 4 z 2 = 16. 

2 

z 2 = 4(x 2 + y 2 ). 

3 

Z = 4(x 2 + y 2 ). 

4 

x 2 +z 2 ~ 4y 2 = 4. 

5 

y 2 - 4x 2 - 9z 2 = 36. 

6 

z = 4 - 2x 2 - 3y 2 . 

7 

z = ^-2j 2 . 

8 

x 2 = y 2 + 4z 2 . 

9 

x 2 - 4y 2 -z 2 = 4. 

10 

x 2 +9/- 4z 2 = 36. 

11 

36x 2 + 4y 2 + 9 z 2 = 36. 

12 

y = 4 — x 2 — 2 y 2 . 

13 

Z + 4;t 2 = y 2 . 



14 

x 2 + y 2 — z 2 ~ 2x — 4y 

+ 1 

= 0. 

15 

Find the points at which 

the line 


x — 6 

y + 2 z — 2 


3 

- 

6 4 


pierces the ellipsoid 



16 Show that the plane 2 x —2 z — y - 10 intersects the pa¬ 
raboloid 



at a single point, and find the point. 
17 (a) Consider the ellipsoid 


and find the area A(k) of the elliptical section in the 
horizontal plane z = k. Hint: Recall that ttAB is the 
area of an ellipse with semiaxes A and B. 

(b) Use the formula found in (a) to calculate the volume 
of the ellipsoid by integration. 

18 Consider the elliptic paraboloid z = ax 2 -I- by 2 , and use 
integration to show that the volume of the segment cut 
of f by the plane z = k(k> 0) is half the area of its base 
times its height. 

19 Show that the projection on the ;ry-plane of the curve of 
intersection of the surfaces z = 1 - x 2 and z = x 2 + y 2 
is an ellipse. Hint: What does it mean geometrically to 
eliminate z from these equations? 


20 Show that the projection on the yz-plane of the curve of 
intersection of the plane x = 2y and the paraboloid x = 
y 2 + z 2 is a circle. 

21 Show that the projection on the xy-plane of the intersec¬ 
tion of the paraboloids z = 3x 2 + 5y 2 and z = 8 — 5x 2 — 
3 y 2 is a circle. 

22 The two equations 

* 2 + 3y 2 - z 2 + 3x = 0, 

2;c 2 + 6y 2 - 2z 2 - 4y = 3, 

when taken together as a simultaneous system, define the 
space curve in which the corresponding surfaces inter¬ 
sect. Show that this curve lies in a plane. Hint: Project 
onto a coordinate plane. 

23 Use the methods of Section 15.6 to discover the nature 
of the graph of z = xy. Sketch the surface. 

A ruled surface is a surface S with the property that for each 
point P on S there is a straight line through P that lies entirely 
on S. All cones and cylinders are ruled surfaces, while ellip¬ 
soids, hyperboloids of two sheets, and elliptic paraboloids ob¬ 
viously are not. It is very surprising that all hyperboloids of 
one sheet and all hyperbolic paraboloids are ruled surfaces. 

24 Show that the hyperboloid of one sheet x 2 + y 2 - z 2 = 
1 is a ruled surface, as follows: 

(a) The section of the surface in the *y-plane is the cir¬ 
cle C whose equation is x 2 + y 2 = 1. Let Po = 
Uo, yo, 0) be a point on C, and show that the line L 
whose equations are 

* = *o + yo*, y = yo~ x 0 t, z = t, 

passes through Po and lies entirely on the surface. 

(b) If P = ( x, y, z) is an arbitrary point on the surface, 
show that the line L in part (a) passes through P for 
a suitable point Po = C*o, yo, 0). Thus, as Po moves 
around C, the lines L cover the surface.* 

"The family of lines jc = jc 0 + yot, y = yo - x 0 t, z = —t also covers 
the surface, and for this reason the hyperboloid of one sheet is often 
called a doubly ruled surface. 
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25 Show that the hyperbolic paraboloid z = y 2 — x 2 is a 
ruled surface by showing that if Pq = (* 0 , >’o, >’o 2 — *o 2 ) 
is any point on the surface, then the line 

* = xo + t, y = >’o + t, 
z = (To 2 “ *o 2 ) + 2(y 0 “ x 0 )t 
passes through Pq and lies entirely on the surface.* 


The two families of straight lines constituting the doubly ruled 
surfaces discussed in Problems 24 and 25 are shown in Fig. 


18.38. 




Figure 18.38 Doubly ruled surfaces. 


*The family of lines x ~ x 0 + t, y = y o — L z - (yo 2 - *# 2 ) — 
2(y 0 + x 0 )t also covers the surface, so the hyperbolic paraboloid is 
also doubly ruled. 


In plane analytic geometry we used a rectangular coordinate system for some 
types of problems and a polar coordinate system for others. We saw that there 
are many situations in which one system is more convenient than the other. The 
same is true for the study of geometry and calculus in space. We now describe 
two other three-dimensional coordinate systems, in addition to the now-familiar 
rectangular coordinate system, that are often useful for dealing with special kinds 
of problems. 

Consider a point P in space whose rectangular coordinates are (x, y, z). The 
cylindrical coordinates of this point are obtained by replacing x and y with the 
corresponding polar coordinates r and 6 , and allowing z to remain unchanged. 
That is, we place a z-axis on top of a polar coordinate system and describe the 
location of a point in space by the three coordinates (r, 6 , z). We will always as¬ 
sume that this cylindrical coordinate system is superimposed on a rectangular co¬ 
ordinate system in the manner shown in Fig. 18.39, so that the transformation 
equations connecting the two sets of coordinates of a given point are 

x = r cos 9, y = r sin 6, z = z, 
and 

y 

r 2 = x 2 + y 2 , tan 0 = — , z = z. 

x 

It is easy to see that the graph of the equation r — a constant is a right circular 
cylinder whose axis is the z-axis; this is the reason for the term “cylindrical co¬ 
ordinates.” Similarly, the graph of 6 = a constant is a plane containing the z-axis, 
and the graph of z = a constant is a horizontal plane. 


18.7 

CYLINDRICAL AND 

SPHERICAL 

COORDINATES 



. ...... .. _ Figure 18.39 Cylindrical coordinates. 

Example 1 Find cylindrical coordinates for the points P\ and P 2 whose rectan¬ 
gular coordinates are (3, 3, 7) and (2V3, 2, 5), respectively. 


Solution For P x we have r = \ 9 + 9 = 3V2, tan 6 = 1, z = 7, so a s et of 
cylindrical coordinates is (3V2, 7t/4, 7). For P 2 we have r = Vl2 + 4 = 4, 
tan 6 = 1/V3 = |V3, z = 5, so a set of cylindrical coordinates is (4, 7t/6, 5). 
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Example 2 Describe the surfaces 

(a) r + z = 3, and 

(b) r( 2 cos 6 + 5 sin 0) + 3z = 0. 

Solution (a) The intersection of the surface r + z = 3 with the yz-plane is the 
straight line y + z = 3, because r = y in the yz-plane. But 6 is missing from the 
given equation, so the desired surface is symmetric about the z-axis, and is there¬ 
fore the cone generated by revolving the line y + z = 3 about the z-axis. More 
generally, it follows from our discussion of surfaces of revolution in Section 18.5 
that if a curve/(y, z) = 0 is revolved about the z-axis, then the cylindrical equa¬ 
tion of the resulting surface is f(r, z) = 0. 

(b) Since r cos 6 = x and r sin 6 = y, the given equation transforms into 2 jc + 
5y + 3z = 0, which is the plane through the origin with normal vector N = 
2i + 5j + 3k. 


Example 3 Find a cylindrical equation for (a) the spheroid x 2 + y 2 + 2z 2 = 4, 
and (b) the hyperbolic paraboloid z = x 2 — y 2 . 

Solution The equation in (a) transforms at once into r 2 + 2z 2 = 4. For (b), we 
have 

7 2 

z = - y z 

= r 2 cos 2 6 — r 2 sin 2 6 = r 2 (cos 2 6 — sin 2 6) 

= r 2 cos 26, 

so z = r 2 cos 26 is the desired equation. 



Figure 18.40 Spherical coordinates. 


In physics, cylindrical coordinates are particularly convenient for studying sit¬ 
uations in which there is axial symmetry, that is, symmetry about a line in space. 
As examples we mention two important classes of problems: those dealing with 
the flow of heat in solid cylindrical rods, and those concerned with the move¬ 
ments of a vibrating circular membrane—for instance, a drumhead. 

Again consider a point P in space whose rectangular coordinates are (x, y, z). 
The spherical coordinates of P are the numbers (p, 0, 6) shown in Fig. 18.40. 
Here p (the Greek letter rho) is the distance from the origin O to P, so p > 0. 
The angle </> is the angle down from the positive z-axis to the radial line OP, and 
it is understood that (/> is restricted to the interval 0 < (/> < tt. Finally, the angle 
6 has exactly the same meaning in spherical coordinates as it has in cylindrical 
coordinates; that is, 6 is the angle from the positive x-axis to the line OP', where 
P' is the projection of P on the jcy-plane. It is clear from the figure that OP' = 
p sin c/>, and since Jt = OP' cos 6 and y = OP' sin 6 , we have the transformation 
equations 


and 


x = p sin 4> cos 6, y = p sin </> sin 6, z = p cos </>, 


Vjc 2 + y 2 _ y 

tan <t> =-, tan 6 — —. 


p 2 = x 2 + y 2 + z 2 , 
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The term “spherical coordinates” is used because the graph of the equation p = 
a (constant) is a sphere with center at the origin and radius a. The graph of </> = 
a (constant) is the upper nappe of a cone with vertex at the origin and vertex an¬ 
gle a, if 0 < a < 7t/2. The graph of 6 = Go (constant) is a plane containing the 
z-axis, just as in cylindrical coordinates. 


Example 4 Find an equation in spherical coordinates for the sphere x 2 + y 2 + 
z 2 — 2 az — 0, where a > 0. 

Solution Since p 2 = x 2 + y 2 + z 2 and z = p cos 0, the given equation can be 
written as 

p 2 — 2 ap cos (j) — 0 or p(p — 2a cos </>) = 0. 

The graph of this equation is the graph of p = 0 together with the graph of p - 
2 a cos (/> = 0. But the graph of p = 0 (namely, the origin) is part of the graph of 
p = 2a cos 0, so the desired equation is 

p = 2a cos (f). 

This is the sphere of radius a that is tangent to the .ry-plane at the origin, as shown 
in Fig. 18.41. 



Figure 18.41 


Example 5 What is the graph of the spherical equation p = 2a sin </>? 

Solution The variable 8 is missing from this equation, so we have a surface of 
revolution about the z-axis. In the yz-plane the equation p = 2a sin </> represents 
a circle of radius a , as shown in Fig. 18.42. Since the graph we are seeking is 
obtained by revolving this circle about the z-axis, this graph is a torus (dough¬ 
nut) in which the hole has radius zero. 


There are many physical uses of spherical coordinates, ranging from problems 
about heat conduction to problems in the theory of gravitation. We shall discuss 
some of these applications in Chapter 20. 



PROBLEMS 


1 Find a set of cylindrical coordinates for the point whose 
rectangular coordinates are 

(a) (2, 2,-1); (b) (1, - VJ, 7); 

(c) (3, V3, 2); (d) (3, 6, 5). 

2 Find the rectangular coordinates of the point with cylin¬ 
drical coordinates 

(a) (V2,17-/4, -2); (b) (V3, 5 t7/6, 11); 

(C) (1,1,1); (d) (2, 77/3, 77). 

3 Find a set of spherical coordinates for the point whose 
rectangular coordinates are 

(a) (1, 1,V6); (b) (1,-1,-V6); 

(c) (1, 1,V2); (d) (0, -1, V3). 

4 Find the rectangular coordinates of the point with spher¬ 
ical coordinates 


(a) (3, tt/2, tt/ 2); (b) (4, tt/2, 77); 

(c) (4, 77/3, 77/3); (d) (4, 2t7/3, tt/3). 

In Problems 5-11, find a cylindrical equation for the surface 
whose rectangular equation is given. Sketch the surface. 

5 x 2 + y 2 + z 2 = 16. 6 x 2 + y 2 = 6z. 

7 x 2 y 2 = z 2 ‘ 8 x 2 — y 2 — 3. 

9 jc 2 + y 2 — 2y = 0. 10 x 2 + y 2 - 4x = 0. 

11 * 2 + y 2 = 9. 

12 Find a cylindrical equation for the surface whose rec¬ 
tangular equation is z 2 (x 2 — y 2 ) = 4 xy. 

In Problems 13-18, find a spherical equation for the surface 
whose rectangular equation is given. Sketch the surface. 

13 jc 2 + y 2 + z 2 = 16. 
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14 *2 + +z 2 +4z=0. 

15 x 2 +> ?2 + ^ 2— 6z — 0. 

16 *2+ j2 = 9. 


17 z = 4 — x 2 — y 2 . 

18 (^2 + -y2 _|_ ^2)3 = (^2 + -y2^2 


CHAPTER 18 REVIEW: DEFINITIONS, EQUATIONS 


Think through the following. 

1 Rectangular (or Cartesian) coordinates. 

2 Equation of a sphere. 

3 Dot product: definition and formula. 

4 Cross product: definition and formula. 

5 Equations of a line: parametric and symmetric. 


6 Equation of a plane. 

7 Cylinder. 

8 Surface of revolution. 

9 The six quadric surfaces: graphs and equations. 
16 Cylindrical coordinates. 

11 Spherical coordinates. 
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PARTIAL 

DERIVATIVES 


Many of the functions that arise in mathematics and its applications involve two 
or more independent variables. We have already met functions of this kind in our 
study of solid analytic geometry. Thus, the equation z = x 2 — y 2 is the equation 
of a certain saddle surface, but it also defines z as a function of the two variables 
jt and y, and the surface can be thought of as the graph of this function. 

We usually denote an arbitrary function of the two variables x and y by writ¬ 
ing z =/(*, y), and we can visualize such a function by sketching—or imagin¬ 
ing—its graph in jryz-space, as suggested in Fig. 19.1. In this figure, P = (x , y) 
is a “suitable” point in the jcy-plane—that is, a point in the domain D of the func¬ 
tion—and z is the directed distance up or down to the corresponding point on 
the surface. This surface is thought of as lying “over” the domain D, even though 
part of it may actually be below the jcy-plane. 

By an obvious extension of the notation used here, w =/(jc, y, z, r, w, v) is a 
function of the six variables displayed in parentheses. For example, if the tem¬ 
perature T at a point P inside a solid iron sphere depends on the three rectangu¬ 
lar coordinates jc, y, and z of P, then we write T = /(;t, y, z); and if we also al¬ 
low for the possibility that the temperature at a given point may vary with the 
time r, then T is a function of all four variables, T = /(;c, y, z, t). 

In this chapter we shall see that the main themes of single-variable differen¬ 
tial calculus—derivatives, rates of change, chain rule computations, maximum- 
minimum problems, and differential equations—can all be extended to functions 
of several variables. However, students should be prepared for the fact that there 
are striking differences between single-variable calculus and multivariable cal¬ 
culus. Since most of these differences already show up in functions of only two 
independent variables, we usually emphasize this case, and refer more briefly to 
functions of three or more variables. In the next chapter we turn to the integral 
calculus of functions of several variables. 

DOMAIN 

Just as in our previous work, the domain (or domain of definition) of a function 
z = /(jc, y) is the set of all points P = (jc, y) in the jcy-plane for which there ex¬ 
ists a corresponding z, and similarly for functions defined in jcyz-space, jcyz^- 
space, etc. Most of the functions we deal with are defined by formulas, and in 
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= (x, y ) 

Figure 19.1 A surface in space. 
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these cases the domain is understood to be the largest set of points for which the 
formula makes sense. For example, the domain of 


z = /(x,y) 


1 

*-y 


is understood to be the set of all points (x, y ) with x y, that is, all points in the 
xy -plane that do not lie on the line y = x. The domain of 

z = g(x, y) = V9 - x 2 - y 2 

is the set of all points (x, y) for which 9 - x 2 - y 2 > 0, that is, the circular disk 
x 2 + y 2 < 9 of radius 3 with center at the origin. And the domain of 


w = ft(x,y, z ) 


2x + 3y + 4 z 
x 2 + y 2 + z 2 


is the set of all points (x, y, z) for which x 2 + y 2 + z 2 =£ 0, that is, all points of 
xyz-space except the origin. 

In discussing a general function z = /(x, y), we shall often require that this 
function be defined at a certain point P o and throughout some neighborhood of 
this point. This means that the domain of /(x, y) must include not only Pq itself, 
but also every point “sufficiently close” to Po , that is, every point in some small 
circular disk centered on Po- Similar remarks apply to functions defined in xyz- 
space, etc. 


CONTINUITY 


There are several places in this chapter where it will be necessary to mention 
continuity in order to state things correctly. This concept extends in a natural way 
from the one-variable case to functions /(x, y), as follows. 

A function/(x, y) is said to be continuous at a point (xo, Jo) in its domain if 
its value/(x, y) can be made as close as we please to /(xo, yo) by taking the point 
(x, y) close enough to (xo, JoX that is, if |/(x, y) — fix o, yo)| can be made as small 
as we please by making both \x - xo\ and |y - yo| small enough. For example, 
f(x , y) = xy is continuous at any point (xo, yo), because 

\xy ~ -Wol = \xy - xvo + xy 0 - x 0 y 0 | 

= |x(y - y 0 ) + yo(x - x 0 )| 

^ Wl y — Tol + boll* “*o|. 

and it is easy to see that the quantity last written can be made as small as we 
please by making both |x — xo| and |y — yo| small enough. 

On the other hand, the function defined by 


/(x, y) = 


xy 

X 2 + y 2 
0 


if (x, y) * (0, 0), 
if (x, y) = (0, 0), 


0 ) 


is not continuous at the origin (0, 0). For, if we let (x, y) approach (0, 0) along a 
line y = mx with m =£ 0, then 


fix, y) = 


mx 2 

x 2 + m 2 x 2 


m 

1 + m 2 ’ 


( 2 ) 
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which is a nonzero constant, and these values cannot be made as close as we 
please to/(0, 0) = 0 by making (jc, y) close enough to (0, 0). To express this in 
another way, (2) shows that the values of the function approach different limit¬ 
ing values as the point (jc, y) approaches the origin from different directions, and 
this is impossible if the function is continuous at the origin. 

We shall not pursue the details of this topic any further, beyond making the 
rather loose statement that any finite combination of elementary functions is con¬ 
tinuous at each point of its domain. Also, continuity is defined in essentially the 
same way for functions of three or more variables. 

LEVEL CURVES 

Many simple functions z — /(jc, y) have graphs that are much too difficult to 
sketch. Fortunately there is another way to understand and express the geomet¬ 
ric nature of such a function. 

The basic idea comes from the art of the mapmaker. In mapping terrain with 
valleys, hills, and mountains, it is common practice to draw curves joining points 
of constant elevation. When these curves are included on a map and properly la¬ 
beled, the resulting topographical map enables an experienced user to obtain a 
clear mental picture of the contours of the land in three-dimensional space from 
this two-dimensional representation. 

We can do the same thing to portray a function z =/(j c, y) of two variables. 
For any value c that /(jc, y) assumes, we can sketch the curve 

f(x, y) = c 

in the Ary-plane, as shown in Fig. 19.2. Such a curve is called a level curve ; it lies 
in the domain of the function, and on it z = /(jc, y) has the constant value c. 

A collection of level curves is called a contour map\ it can give a good idea 
of the shape of the graph, and is the next best thing to a three-dimensional sketch. 
For instance, the graph of z = xy is difficult—though not impossible—to draw. 
However, a reasonably clear idea of the shape of this graph is given by the con¬ 
tour map shown in Fig. 19.3, which is easy to draw. Each level curve jcy = c is 
a hyperbola in the first and third quadrants if c > 0, a hyperbola in the second 
and fourth quadrants if c < 0, and the two axes taken together if c = 0. We as¬ 
cend the surface as we leave the origin going into the first and third quadrants, 




Level curves. 
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Figure 19.3 


and descend it as we leave going into the second and fourth quadrants, and in 
this way we see that the origin is the saddle point of a saddle surface. Students 
should try to use this figure to visualize the shape of the surface as it appears in 
three-dimensional space, looking down on it from above. 


LEVEL SURFACES 

Drawing graphs for functions of two variables is often difficult, but drawing 
graphs for functions of three variables is always impossible. We would need a 
visible space of four dimensions to contain such a graph, and no such space is 
available. 

However, the concept of level curves suggests a way to visualize the behavior 
of a function w =/(; c, y, z) of three variables: examine its level surfaces. These 
are the surfaces 


/(*, y,z) = c 


(3) 


z 



Figure 19.4 Level surfaces. 


for various values of the constant c. Of course, level surfaces can be hard to draw, 
but a knowledge of what they are can help us form a useful intuitive idea of the 
nature of the function. In Fig. 19.4 we present a schematic view of three adja¬ 
cent level surfaces of the form (3) for three values of the constant c, where 
c\ < C 2 < C 3 . As a point P = (jc, y, z) moves along the lowest surface, the value 
of w = /(;c, y, z) is constantly equal to c\ \ but as this point hops to the next sur¬ 
face above it, the value of the function increases to C 2 ; and so on. 

We consider two simple examples. In the case of the function w = x + 2y + 
3z, the level surfaces are easily seen to be the planes 

x + 2y + 3z = c 

with normal vector N = i + 2j + 3k; and for w = Vjc 2 + y 2 + z 2 , the level sur¬ 
faces are the concentric spheres 

x 2 + y 2 + z 2 = c 2 . 


In applications, if the function w =f(x,y, z) represents the temperature at the 
point P = (x, y, z), then the level surfaces are called isothermal surfaces ; if it rep¬ 
resents potential, they are called equipotential surfaces. 


HIGHER DIMENSIONS 

Level surfaces have a certain limited value, but in a sense they avoid the real 
question: How do we go about trying to obtain an intuitive understanding of the 
behavior of functions of three or more variables? 

Briefly, what we do is work by analogy with the one- and two-variable cases. 
For example, there is nothing to prevent us from considering the set of all quadru¬ 
ples of numbers such as (2, —3, 1, 4) as forming a perfectly legitimate four¬ 
dimensional space, with an origin ( 0 , 0 , 0 , 0 ), four coordinate axes, and a satis¬ 
factory concept of the distance from an arbitrary point (jc, y, z, w) to the origin, 

d = Vjc 2 + y 2 + z 2 + w 2 . 

We can now consider the graph of a function 

w =/(*, y, z) 
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as forming a three-dimensional “surface” in this four-dimensional space, with 
the domain D of the function lying in the three-dimensional “coordinate plane” 
consisting of all points of the form ix, y, z, 0 ). 

In a similar way, if n is any positive integer we can think of the graph of a 
function of n variables as forming an rc-dimensional “surface” in (n + 1 ^di¬ 
mensional space. It is true that for n > 3 we can no longer draw pictures, but we 
can still bolster our intuition by using geometric language, and we can still think 
geometrically, but in a looser way. However, as we move further away from the 
kind of mathematics that we can study and understand by drawing pictures, it is 
necessary to give more attention to the algebraic and analytic aspects of what we 
are doing, in order to avoid being misled by words and analogies. Nevertheless, 
the words, analogies, and geometric intuition remain indispensable, for they sug¬ 
gest worthwhile things to think about and prevent us from feeling totally lost 
among abstractions. 


PROBLEMS 


In Problems 1-12, find the domain of the given function. 


1 fix, y) 


= 

y ~ 2x 


2 /0t,y) = - + - 

x y 

3 fix,y) = Vxy. 

4 f(x,y) = • 


(e x + e >') 2 ' 

5 fix, y) = In (y - 3*). 

6 f(x,y, z) = Vx 2 + y 2 + z 2 . 

7 fix, y, z) = —====. 

Vx 2 + y 2 + z 2 

8 fix, y, z) = 4x i Z _ y2 ■ 

9 fix, y, z) = Vl 6 — x 2 — y 2 - z 2 . 

10 fix, y, z)=— ■ 

11 fix, y, z) = xy In z + 3 tan \z. 

12 fix, y, z) = In ix 2 + y 2 + z 2 ~ 1 ). 


13 Show that the function defined by 


fix , y) 


xy 

Vjc 2 -I- y 2 
0 


if ix, y) * ( 0 , 0 ), 


iff^,y) = ( 0 , 0 ), 


is continuous at the origin. Hint: Use* = r cos 8 and y — 
r sin 8 to transform to polar coordinates. 


In Problems 14-24, represent the given function by drawing 
a few level curves, and try to visualize the surface from the 
resulting contour map. 

15 
17 
19 
21 
23 


14 

16 

18 

20 

22 

24 


y 2 - 


Z = X 2 + 

Z = X + y. 
z=2x~y. 
z= X 3 - y. 
z = y/x 2 . 

z = V * 2 - 


z = x 2 + 2 y 2 . 
z = x - y. 

r 2 - „ 


Z = X * 
z - y/x. 


y 


In each of the following problems, sketch a few level surfaces 
for the given function and use these to estimate the general 
direction in which the values of the function increase. 


25 

X 2 ^ z 2 

26 

1 

4 + 9 + l 6 ' 

W _|_ -y2 _|_ ^2 ' 

27 

w = 2x — 5y 4- 3z. 

28 

w = x 2 + y 2 — z 2 . 


Suppose that y = fix) is a function of only one variable. We know that its de¬ 
rivative, defined by 

dy ,. fix + Ax)-fix) 

— = lim ---, 

dx Ax->o A* 
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can be interpreted as the rate of change of y with respect to x. In the case of a 
function z = fix, y) of two variables, we shall need similar mathematical ma- 
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chinery for working with the rate at which z changes as both x and y vary. The 
key idea is to allow only one variable to change at a time, while holding the other 
fixed. For functions of more than two variables, we vary one of them while hold¬ 
ing all the others fixed. Specifically, we differentiate with respect to only one 
variable at a time, regarding all the others as constants, and this gives us one de¬ 
rivative corresponding to each of the independent variables. These individual de¬ 
rivatives are the constituents from which we build the more complicated ma¬ 
chinery that will be needed later. 

To return to our function z = /(;c, y) of two variables, we first hold y fixed and 
let x vary. The rate of change of z with respect to x is denoted by dz/dx and de¬ 
fined by 


Jz = )im /(■* + As, y) ~ fix, y) 
dx a*-»o Ax 


This limit (if it exists) is called the partial derivative of z with respect to jc, and 
is read “partial z, partial x” The most commonly used notations for this partial 
derivative are 

dz df 

dx’ Zx ' dx’ fx ' fx(x ’ y)y 


and we shall use all of these from time to time in order to help students become 
accustomed to them. The symbol d in the notation dzJdx is called the “round- 
back d” or “curly d”; it is used to emphasize that there are other independent 
variables present during the process of differentiating with respect to x. 

Similarly, if x is held fixed and y is allowed to vary, then the partial deriva¬ 
tive of z with respect to y is defined by 

dz y fix, y + Ay) - fix, y) 

~=r~ = lirn ---, 

cry Ay—»o fry 

and the standard notations in this case are 

— z f y) 

3 / 3 y ' J y y 

The actual calculation of partial derivatives for most functions is very easy: 
Treat every independent variable except the one we are interested in as if it were 
a constant, and apply the familiar rules. 


Example 1 Calculate the partial derivatives df/dx and df/dy of the function 
fix , y) = x 3 — 3x 2 y 3 + y 2 . 


Solution To find the partial of/with respect to jc, we think of y as a constant 
and differentiate in the usual way, 

£■ 

When we regard x as a constant and differentiate with respect to y, we obtain 

§ = -9*V + 2y. 
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The notations f x (x , y) and f y {x , y) are useful for indicating the values of partial 
derivatives at specific points. 

Example 2 (a) If/Qc, y) = xy 1 + Jt 3 , then 


/*(*, >0 = y 2 + 3x 2 , fy(x, y) = 2xy, 

M 2,D =13. //2, 1)=4. 


In the other notation, the numerical values given here by the simple and conve¬ 
nient symbols f x (2 , 1) and f y (2 , 1) would have to be written more clumsily as 




and 


(b) If gO, y) = xe^ 2 , then 

gx(*> y) — xy 2 e xy2 + e g-yOc, y) = 2 x 2 ye^ 2 . 

(c) If h(x, y) = sin x 2 cos 3^, then 

h x (x, >^) = 2x cos x 2 cos 3 y, h y (x, y) = —3 sin x 2 sin 3 y. 

These examples illustrate the fact that the partial derivatives of a function of 
x and y are themselves functions of x and >\ 

These ideas and notations apply just as easily to functions of any number of 
variables. 

Example 3 If w = f(x , y, z, w, v) = .xy 2 + 2x 3 + xyz + zu + tan uv , then 



In the one-variable case, we know that the derivative dy/dx can legitimately be 
thought of as a fraction, the quotient of the differentials dy and dx. The notation 
dzldx for the partial derivative f x (x,y) suggests that something similar might be 
done with dz and dx. However, it is not possible to treat partial derivatives as frac¬ 
tions. We give an example to emphasize this point. 

Example 4 The ideal gas law states that for a given quantity of gas, the pres¬ 
sure p, volume V, and absolute temperature T are connected by the equation p V = 
nRT , where n is the number of moles of gas in the sample and R is a constant. 
Show that 


dp dV 3T 
dV dT dp 


Solution Since 


nRT 


P 



dp_ = _nRT_ dV_ _ nR_ dT = _V_ 
dV V 2 ’ dT p' dp nR' 


we have 
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It follows that 


dp dV dT / nRT\ nR V nRT _ 

dV dT dp ~ [ V 2 j p nR pV 

The fact that this result is - 1 instead of + 1 shows that we cannot treat the par¬ 
tial derivatives on the left as fractions. 



When we are working with a function z = f(x,y) of only two variables, the 
partial derivatives have the following simple geometric interpretation. The graph 
of this function is a surface, as shown in Fig. 19.5. Let (jcq, To) be a given point 
in the xy-plane, with (*o, To> zo) the corresponding point on the surface. To hold 
y fixed at the value yo means to intersect the surface with the plane y — yo, and 
the intersection is the curve 


z =f{x,y 0 ) 


in that plane. The number 




= fx ( A 0, Vo) 


V / W 

is the slope of the tangent line to this curve at x = Xq. Thus, in the figure we have 


tan 




dx )(x 0 ,X 0 ) 


= fx(x 0 , >’o)- 


Similarly, the intersection of the surface with the plane x = xo is the curve 

Z = f{x 0 , y). 


and the other partial derivative is the slope of the tangent to this curve at y = yo> 

tan/3=(| i ) =/v(-v 0 ,Vo). 

V oy /Cr 0 ,v 0 ) 


No such interpretation is available when there are more than two independent 
variables. 

We remarked that for a function z =f(x,y) of two variables, the partial de¬ 
rivatives f x and f y are also functions of two variables, and may themselves have 
partial derivatives. As we might expect, these second-order partial derivatives 
are denoted by several symbols. If we start with the first derivatives 


dx 


-fx 


and 


3L mf 

By h ' 


then the derivatives with respect to x are 


±m.£L.± f ., 
3x\di) a<’ a* 


and 


dx\dy) dy dx J} 

and the derivatives with respect to y are 

3y\3xj tjv dt V 
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and 


±(2C\ = £L,± /mf 

iy[iyj }y ! if’ 


This notation may seem a bit confusing at first, but it is actually quite reason¬ 
able. Observe that in f yx we differentiate first with respect to the “inside” vari¬ 
able y, then with respect to the “outside” variable x. This is the natural order, 
since f yx ought to mean (f y ) x . Thus, in the symbols/,* and f xy , the subscript let¬ 
ters accumulate from left to right, because this is the order in which the differ¬ 
entiations are performed. For the same reason, in the symbols 


a 2 f 

dx dy 


and 


dy dx ’ 


it is natural for the letters indicating the variable of differentiation to accumulate 
from right to left: first y, then x in the first of these; and first jc, then y in the sec¬ 
ond. 

The pure second partial derivatives, 

, _ d 2 f , . d 2 f 

f xx and dy2 , 

don’t represent anything really new. Each is found by holding one variable con¬ 
stant and differentiating twice with respect to the other variable, and each gives 
the rate of change of the rate of change of / in the direction of one of the axes. 


Example 5 If f(x , y) = x 3 e 5y + y sin 2x, then 

fx = 3x 2 e 5y + 2y cos 2x, f y = 5x 3 e 5y + sin 2x, 
fxx — 6xe 5y — 4y sin 2x, f yy = 25x 3 e 5y . 


On the other hand, the mixed second partial derivatives, 

- _ d 2 f , _ d 2 f 

fxy dydx and fyx dxdy’ 

represent new ideas. The mixed partial derivative/^ gives the rate of change in 
the y-direction of the rate of change of/in the jc-direction, and f yx gives the rate 
of change in the jc-direction of the rate of change of/in the y-direction. It is not 
at all clear how these two mixed partials are related to each other, if indeed they 
are related at all. 


Example 5 (continued) For the function being considered, /(;t, y) = x 3 e 5y + 
y sin 2 jc, we easily see that 

f x = 3x 2 e 5y + 2 y cos 2x , f y = $x 3 e 5y + sin 2 x, 

fxy = \5x 2 e 5y + 2 cos 2x , f yx = \Sx 2 e 5y + 2 cos 2x. 

For the particular function considered in this example, we obviously have 

fxy = fyx* 0 ) 


a 2 / = a 2 / 

dy dx dx dy 9 


or equivalently, 
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so the order of differentiation seems to be unimportant—at least in this case. But 
this is not an accident, and ( 1 ) is true for almost all functions that normally arise 
in applications. More precisely, ifbothfxy andf yx exist for all points near (*o, yo) 
and are continuous at (jcq, yo), then 

fxy(x 0 , yo) = fyxix 0 , ?())• 

A proof of this statement is given in Appendix A. 17. 

Partial derivatives of order greater than two, as well as higher-order deriva¬ 
tives of functions of more than two variables, are defined in the obvious way. 
For example, if w =/(;c, y, z), then 

dxdydz dx \ dy dz) (Jzy)x hyx ' 

dz dy dx 2 dz\dydx 2 ) (Jxxyh fxxyz ’ 

etc. In general, with suitable continuity, it is immaterial in what order a sequence 
of partial differentiations is carried out, for by ( 1 ) we can reverse the order of 
any two successive differentiations. For example, f^y Z = fxyxz = fxyzx = f yX zx = 

fyzxx- 


PROBLEMS 


In Problems 1-14, find dz/dx and dz/dy. 


1 z = 2x + 3y. 

3 z—&- 
* 3x+\ 

5 z = x 2 sin y. 


2 z = 5 x 2 y. 

4 z = y cos x. 

6 z = tan 3* + cot 4y. 


7 z = x tan 2y + y tan 3*. 8 z = sin xy. 


9 z = cos (3* ~ y). 


10 z=xye xy . 


11 z = e x sin y. 
13 z = e y In x 2 . 


x 

12 z = tan 1 —. 

y 

14 z = In (3* + y 2 ). 


In Problems 15-18, find the partial derivatives with respect to 

x , y, and z. 

15 w = x 2 y 5 z 7 . 16 w = sin -1 — . 

xy 

17 w = x \n 18 w = e x7+ y )+zt . 

z 

19 Consider the surface z = 2x 2 + y 2 . 

(a) The plane y = 3 intersects the surface in a curve. Find 
the equations of the tangent line to this curve at x = 
2 . 

(b) The plane* = 2 intersects the surface in a curve. Find 
the equations of the tangent line to this curve at y = 
3. 

20 Consider the surface z = x 2 /( y 2 - 3). 

(a) The plane y = 2 intersects the surface in a curve. Find 
the equations of the tangent line to this curve at * = 
3. 


21 


22 

23 


(b) The plane * = 3 intersects the surface in a curve. Find 
the equations of the tangent line to this curve at y = 2 . 
Show that all of the following functions z = /(*, y) sat¬ 
isfy the equation + y~^ = 0: 


(a) z = £ ; 

y 


(b) z = 


x 

x + y* 


(.1.-4 «>*~r£-r 

x l x 5 + y 3 

If z = ye x,y , show that xz x + yz y = z. 

If z = * 5 - 2 * 4 y + 5* 2 y 3 , show that 


dz , dz c 
x d^ + y d^ = 5z - 


In Problems 24-28, verify that d 2 z/dx dy = d 2 z/dy dx. 

_^ jc 

24 z = tan 1 —. 

y 

25 z = In (* + 5y). 

26 z = e xy cos (y — 2*). 

27 z =/(*)g( y). 

28 z = x 3 tan 2x esc 3y 4 sin -1 V * 2 + 1. 

29 Show that each of the following functions satisfies 
Laplace's equation d 2 f/dx 2 + d 2 fldy 2 = 0: 

(a) /(*, y) = In (* 2 + y 2 ); (b) /(*, y) = e x sin y; 

(c) /(*, y) = e~ 3x cos 3y; (d) /(*, y) = tan -1 
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30 Show that each of the following functions satisfies the 
wave equation a 2 cPfldx 2 = d 2 f/dt 2 : 

(a) f(x, t) = (x + at) 3 ', (b) f(x, t) = (x- at) 5 -, 

(c) f(x, t) = sin (x + at); (d) f(x, t) = e x ~ at . 

31 Find a function f(x, y) such that 


3 £ = 

dx 


■?- = 3y 2 - 2x cos y and = 6 xy + x 2 sin y + 2. 


dy 


It is sometimes convenient to define functions by integrals of 
the form 


F(x) = [ /(*, y) dy. 

Ja 


It is usually possible to calculate the derivative F'(x) of such 
a function by “differentiating under the integral sign”: 




y) 


dy* 


32 Verify the formula just stated in the following cases: 

(a) a = 0 , b = l,f(x,y) = x + y; 

(b) a = 0 , b = 1 ,/(*, y) = x 3 }' 2 + x 2 y 3 ; 

(c) a = 0 , b = tr, f(x, y) = sin xy. 


*A proof is given in Appendix A. 18. 


The concept of a tangent plane to a surface corresponds to the concept of a tan¬ 
gent line to a curve. Geometrically, the tangent plane to a surface at a point is 
the plane that “best approximates” the surface near the point. It will be neces¬ 
sary for us to think rather carefully about what this means, because—as we shall 
see in Sections 19.5 and 19.6—weighty practical consequences depend on it. 

Consider a surface z = f{x , y), as shown in Fig. 19.6. As we pointed out in 
Section 19.2, the plane y = yo intersects this surface in a curve C\ whose equa¬ 
tion is 
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z =/(x, yo), 

and the plane x= xq intersects it in a curve Ci whose equation is 

z =f(xo, t); 

and the slopes of the tangent lines to these curves at the point P o = (xo, yo, Zo ) 
are the partial derivatives 

Mx o,yo) and f/x 0 ,y 0 ). ( 1 ) 


Z 



X 


Figure 19.6 The tangent plane. 
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Figure 19.7 


These two tangent lines determine a plane, and, as Fig. 19.6 suggests, if the sur¬ 
face is sufficiently smooth near Po, then this plane will be tangent to the surface 
at Pq. 

It is important to be quite clear about what we mean by a tangent plane, so 
we give a definition. In this context, where Po is a point on a surface z = /(x, y), 
let Pbe a plane through Pq and let P be any other point on the surface. If, as P 
approaches Po along the surface, the angle between the segment PoP and the 
plane T approaches zero, then T is called the tangent plane to the surface at Po. 

It is easy to see that a surface need not have a tangent plane at a point Po- A 
very simple example is provided by the half-cone z = Vx 2 4- y 2 shown in Fig. 
19.7. It is clear that no plane is tangent to this surface at the origin. In this case 
the curves C\ and C 2 have no tangent lines at the origin, and the partial deriva¬ 
tives (1) do not exist there. However, even when the curves C\ and C 2 are smooth 
enough to have tangent lines at Po, the surface may still not have a tangent plane 
atPo, because of nonsmooth behavior near Pq in the regions between C\ and C 2 . 
In Section 19.4 we discuss a vital lemma to the effect that this cannot happen if 
the partial derivatives/*(x, y) and/y(x, y) exist at all points in some neighborhood 
of (xq, yo) and are continuous at (xo, yo) itself. 

Meanwhile, we assume that the tangent plane exists at Po, and we develop a 
method of finding its equation. Since the point Po = (xo,yo, Zo) lies on this tan¬ 
gent plane, we know that the equation has the form 

a(x - x 0 ) + b(y - y 0 ) + c(z - zo) = 0, (2) 

where N = a\ + b\ + ck is any normal vector. It remains to find N, and to do 
this we use the cross product of two vectors V 1 and V 2 that are tangent to the 
curves C\ and C 2 at Po (see Fig. 19.6). To find Vj, we use the fact that along the 
tangent line to C], an increase of 1 unit in x produces a change/^(xo, yo) in z, 
while y does not change at all. Thus, the vector 

V, = i + 0-j +/c(x 0 , y 0 )k 


is tangent to C 1 at Po. Similarly, the vector 

V 2 = 0 • i + j +/ v (xo, yo)k 


is tangent to C 2 at Po. Since V] and V 2 lie in the tangent plane, we are now able 
to obtain our normal vector N by calculating 


N = V 2 X V, 


i 

0 

1 


j 

1 

0 


k 


fyU 0, yo) 
fx(x 0 , y 0 ) 


= /X*o,yo)i +/v(x 0 ,y 0 )j - k. 


(3) 


(The order of factors in this cross product is chosen only for convenience, to pro¬ 
duce one minus sign in the result instead of two.) When the components of (3) 
are inserted in ( 2 ), we see that the desired equation is 

M* o, yo)U - x 0 ) + f y (x 0 , y 0 )( y - yo) - U - so) = 0 , 


or equivalently. 


z ~ zo = f x (x o, yo)U - x 0 ) + fy(x 0t y 0 )(y - yo)- 


(4) 
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Example 1 Find the tangent plane to the surface 

z - f(x, y) = 2 xy 3 - 5x 2 

at the point (3, 2, 3). 

Solution The first step should be to check that this point actually lies on the 
given surface, and we assume that this has been done. Here we have f x = 2y 3 — 
10t and f y = 6 xy 2 , so f x { 3, 2) = —14 and / v (3, 2) = 72. The equation of the 
tangent plane is therefore 

z- 3 = -14(x - 3) + 72(y — 2). 


Tangent planes to surfaces where z is not explicitly given as a function of jc 
and y will be discussed in Section 19.5. However, we can get a preliminary idea 
of what to expect by applying our present method to simple cases. 

Example 2 Find the tangent plane to the sphere 

Jt 2 + y 2 + z 2 = 14 (5) 

at the point (1,2, 3). 

Solution Even though this sphere is not a surface of the form z = f (x, y), it can 
be thought of as a combination of two such surfaces, the upper and lower hemi¬ 
spheres. By solving (5) for z, we see that the upper hemisphere is given by 

z = /(*, y) = Vl4 - Jt 2 -y 2 , 


so 


fx = —j= and f y = 

Vl4 — x 2 — y 2 Vl4 — x 2 — y 2 

These formulas give 

fx( 1,2) = —j and / > .(1,2) = -|, 

so the equation of the tangent plane is 

z-3 = -}(*- !)-!()--2), 


In this example we solved equation (5) explicitly for z, then proceeded as be¬ 
fore. An alternative method that is often easier is to assume that the given equa¬ 
tion defines z implicitly as a function of jc and y, and to find the partial deriva¬ 
tives by implicit differentiation. With this method we use equation (4) in the 
slightly different form 

z - zo = (l), 0 ( "- jro) + (|), 0 (y_>o) ’ (6) 

where the coefficients are written this way because dz/dx and dz/dy need not de¬ 
pend only on jc and y. 
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Example 3 To find the tangent plane of Example 2 by the method just suggested, 
we first hold y fixed and differentiate (5) implicitly with respect to x , which gives 

lx + 2z^- = 0 , 


so dzJdx = — x/z . Similarly, dz/dy = -y/z . At the point Po = (1, 2, 3), these par¬ 
tial derivatives have the numerical values 


If) =-{ and (4 = 

dx)p 0 3 \dy)p 0 

so by ( 6 ) the tangent plane is 

z ~ 3 = ~j(x - 1 ) - f(y - 2 ), 


2 _ 

3’ 


just as before. Of course, this method is of particular value when the equation of 
the surface is difficult or impossible to solve for z . 


PROBLEMS 


In Problems 1-10, find an equation for the tangent plane to 
the given surface at the indicated point. 

1 z = {x 2 + y 2 ) 2 , (1,2,25). 

2 z = 4xy, (4, j, 4). 

3 z = sin * + sin 2y + sin 3(x + y), (0, 0,0). 

4 z = * 2 + *y + y 2 - 10y + 5, (3, 2, 4). 

5 z = * 2 -2y 2 , (3,2, 1). 


6 


2jc + y 
a: - 2y * 


(3, 1, 7). 


7 z = e y cos a:, (0, 0, 1). 



9 jty 2 + yz 2 + zx 2 = 25, (1, 2, 3) 

10 z 3 + jtyz = 33, (1,2,3). 

11 Let Po = Uo, yo, zo) with zo > 0 be a point on the sphere 

x 2 + y 2 + z 2 = a 2 

Show that the tangent plane at this point is perpendicu¬ 
lar to the radius vector to the point, in agreement with 
the definition given in geometry. 

12 Use implicit differentiation to show that the equation of 
the tangent plane to the sphere 

* 2 + y 2 + z 2 = a 2 

at the point P 0 = (* 0 , yo, Zo) is xqx + yoy + Zoz = a 2 . 

13 Use implicit differentiation to find the equation of the 
tangent plane to the ellipsoid 



at the point P 0 = Uo, yo, zo)* 

14 Let a be a positive constant and consider the tangent 


plane to the surface jcyz = a at a point in the first octant. 
Show that the tetrahedron formed by this plane and the 
coordinate planes has constant volume, independent of 
the point of tangency. What is this volume? 

15 The angle between two surfaces at a common point is 

the smallest positive angle between the normals to these 
surfaces at this point. Find the angle between z = — 

1 and z = In V * 2 + y 2 at ( 0 , 1 , 0 ). 

16 If Po = U 0 , y 0 , zo) is a point on the curve of intersection 

of two surfaces z = /(*, y) and z = y), devise a 

method for finding a tangent vector to this curve at Po- 
Apply this method to find a vector tangent to the curve 
of intersection of the cone z 2 = 3 * 2 + 4y 2 and the plane 
3* - 2 y + z = 8 at the point Po = ( 2 , 1 ,4). 

17 If a surface has an equation of the form z = xf(x!y), show 
that all of its tangent planes have a common point. What 
is this point? 

18 I f Po = Uo, yo, zo) is a point on the cone z 2 = a(x 2 + y 2 ) 
other than the vertex, show that the tangent plane at Po 
has zoz = a(x<>x + y 0 y) as its equation. Conclude that 
every such plane passes through the vertex. Show that 
the normal line at Po has 

* = *o + ax 0 t, y = yo + ayot, z = zo - zot 
as parametric equations. 

19 On the cone in Problem 18, consider all points of fixed 
height h above the xy-plane and draw normal lines at 
these points. Show that the points where these lines in¬ 
tersect the ;ty-plane form a circle, and find the radius of 
this circle. 

20 Let normal lines be drawn at all points on the surface 
z = ax 2 + by 2 which are at a given fixed height h above 
the jcy-plane, and find the equation of the curve in which 
these lines intersect the ^y-plane. 
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Most of calculus can be understood by using geometric intuition mixed with a 
little common sense, without getting bogged down in the underlying theory of 
the subject. In a few places, however, this theory is inescapable, because with¬ 
out it there is no way to grasp what is going on in the main developments of the 
subject itself. This is true for infinite series and the theory of convergence. It is 
also true for the topics of the next two sections—directional derivatives and the 
chain rule—which cannot be understood without a certain degree of attention to 
the theoretical issues that we now briefly discuss. 

In order to see what these issues are, we begin by considering a function y = 
/(x) of one variable that has a derivative at a point x 0 . If Ax is an increment that 
carries xo to a nearby point xo + Ax (see Fig. 19.8), we are interested in the cor¬ 
responding increment in y, 

A y =/(x 0 + Ax) -f{x 0 ). 

The definition of the derivative f\x o) is 

/'(x 0 )= lim (1) 

&x ->0 Ax 

and this can be written in the equivalent form 

^: =/'(* o) + e, 

where e ^ 0 as Ax —> 0. Accordingly, with no further hypotheses than the as¬ 
sumed existence of the derivative (1), we can write the increment Ay in the form 

Ay = /'(xo) Ax + € Ax, where e —> 0 as Ax —» 0. (2) 

The situation is entirely different for a function of two (or more) variables, as we 
now explain. 

Consider a function z = /(x, y) and let (xo, yo) be a point at which the partial 
derivatives f x (xo, yo) and f y (x o, yo) both exist. The increment in z produced by 
moving from (xo, yo) to a nearby point (xo + Ax, yo + Ay) is 

Az =/(x 0 + Ax, y 0 + Ay) -/(x 0 , yo), 

as shown in Fig. 19.9. In order to develop the tools we shall need in Sections 
19.5 and 19.6, it will be necessary to express Az in a form analogous to (2), 


z 
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Figure 19.8 Differentials dx and dy. 


Figure 19.9 The differential dz. 
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Az = f x (x o, Vo) Ax + / v (x 0 , Vo) Av + €j Ax + e 2 Ay, (3) 

where e\ and 62 —» 0 as Ax and Av —> 0. Unfortunately, in sharp contrast to the 
one-variable case, the mere existence of the partial derivatives f x and f y at (xq, Vo) 
is not enough to guarantee the validity of (3). Sufficient conditions for this con¬ 
clusion are given in the 


Fundamental Lemma Suppose that a function z =/(x, y) and its partial derivatives 
f x andfy are defined at a point (xo, Vo). and also throughout some neighborhood of this 
point. Suppose further that f x and f y are continuous at (xo, Vo)- Then the increment A z 
can be expressed in the form (3), where e\ and 62 ^ 0 as Ax and Ay —» 0. 


This statement is called a lemma for the usual reason: its significance lies not 
in itself, but rather in the use that can be made of it elsewhere. A proof is given 
in Appendix A. 19. 

We do not wish to linger on these matters, but nevertheless a few brief remarks 
are in order. 

Remark 1 In the case of a function of one variable, (1) and (2) are equivalent, 
and if either condition holds it is customary to denote Ax by dx and to write dy = 
/'(x 0 ) dx , so that dy is the change in y along the tangent line. A function z = 
/(x, y) for which f x (x 0 , yo) and / v (x 0 , Vo) both exist is said to be differentiable at 
(xo, Vo) if the conclusion of the lemma is valid—so that more is required than 
merely the existence of the partial derivatives. In this case—and only in this case! 
—we denote Ax and Ay by dx and dy , and we define the differential dz by* 

dz = /.(xo, yo) dx + /v(x 0 , yo) dy. 

Under these circumstances it can be proved that the surface z = /(x, y) has a tan¬ 
gent plane at (xo, Vo, Zo) and that dz is the change in z along this plane, as sug¬ 
gested in Fig. 19.9. The differential dz is usually written in the equivalent forms 

, dz . . dz , df df 

dz = r^dx + — dy or df = -r ~dx + dy . 

dx dy dx dy 

Remark 2 A function z =/(x, y) which is differentiable at a point is automati¬ 
cally continuous there. This follows at once from (3), which shows that 
Az —» 0 if Ax and Ay —> 0. In the single-variable case, we know that if a func¬ 
tion has a derivative at a point, then it is necessarily continuous there. However, 
this is not true for functions of more than one variable: the mere existence of the 
partial derivatives f x and f y at a point does not imply the continuity of /(x, y) at 
that point. This is shown by the example of the bizarre function discussed in Sec¬ 
tion 19.1, for which/ x (0, 0) = /,(0, 0) = 0 and yet the function is discontinuous 
at (0, 0). 

The concepts of a differentiable function and its differential, and also the Fun¬ 
damental Lemma, can be extended in an obvious way to functions of any finite 
number of variables. This would involve much additional writing but no new 
ideas, and we shall not burden the reader with the details. 


"Sometimes dz is called the total differential. 
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Let/(x, y, z) be a function (of three variables!) defined throughout some region 
of three-dimensional space, and let P be a point in this region. At what rate does 
/ change as we move away from P in a specified direction? In the directions of 
the positive x-, y-, and z-axes, we know that the rates of change of/are given by 
the partial derivatives df/dx, df/dy , and df/dz. But how do we calculate the rate 
of change off if we move away from P in a direction that is not a coordinate di¬ 
rection? In analyzing this problem, we will encounter the very important concept 
of the gradient of a function. 

Suppose that the point P under consideration has coordinates x, y, and z, so 
that P = (x, y, z); let R = xi + yj + zk be the position vector of P, and let the 
specified direction be given by a unit vector u, as shown in Fig. 19.10. If we 
move away from P in this direction to a nearby point Q = (x + At, y + Ay, z + 
Az), then the function / will change by an amount A/’. If we now divide this 
change A/ by the distance A s = |AR| between P and Q , then the quotient A/7 As 
is the average rate of change of/(with respect to distance) as we move from P 
to Q. For instance, if the value of / at P is the temperature at this point, then 
A//As is the average rate of change of temperature along the segment PQ. The 
limiting value of A/7As as Q approaches P, namely, 


#-lim £ 

ds a.y—> o As 


is called the derivative of f at the point P in the direction u, or simply the di¬ 
rectional derivative off. In the case of the temperature function, dflds represents 
the instantaneous rate of change of temperature with respect to distance—roughly 
speaking, how fast it is getting hotter—at the point P as we move away from P 
in the direction specified by u. 

This is all very well, but how do we actually calculate dflds in a specific case? 
To discover how to do this, we assume that/(x, y, z) has continuous partial de¬ 
rivatives with respect to x , y, and z. Indeed, to avoid the tedious repetition of hy¬ 
potheses, we make this a blanket assumption for every function we discuss, un¬ 
less we explicitly state otherwise. With this, the Fundamental Lemma enables us 
to write A/in the form 
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Figure 19.10 


* /. df A df . ^ df 

A/ = Ax + A- by + ^ L 
J dx dy dz 


i CjLAA 


(1) 


where ei, e 2 , e 3 —> 0 as Ar, Ay, and Az -> 0, that is, as As —» 0. Dividing (1) by 
As now gives 


A/ = 8/ Ax 8/ Ay 8/ Az Ax Ay Az 

As dx As dy As 3z As As 62 As 63 As’ 


( 2 ) 


and by taking the limit as As —> 0, we see that the last three terms in (2) approach 
zero and we obtain the formula 


df = flfdx_ h dfdy + dfdz (3) 

ds dx ds dy ds dz ds’ 

This formula should be recognized as a special kind of chain rule, in the sense 
that as we move along the line through P and parallel to u,/is a function of x, 
y, and z, where x, y, and z are in turn functions of the distance s, and (3) shows 
how to differentiate/with respect to s. 
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We observe that the first factor in each product on the right of (3) depends 
only on the function / and the coordinates of the point P at which the partial de¬ 
rivatives of /are evaluated, while the second factor in each product is indepen¬ 
dent of / and depends only on the direction in which df/ds is being calculated. 
These facts suggest that the right side of (3) ought to be thought of—and writ¬ 
ten— as the dot product of two vectors, as follows: 


ds dy dz ) \ds ds J ds 


\ 3* dy dz ) ds 


(4) 


The first factor here is a vector called the gradient of/. It is denoted by the sym¬ 
bol grad j\ so that by definition 


grad/= f i + f j + f k - 


(5) 


With this notation, (4) can be written as 

df . , ,, dR 

i = (grad/) '^r 


( 6 ) 


But we know that dR/ds is a unit vector, and since it has the same direction as 
u, it equals u. Formula (6) is therefore equivalent to 

/ = (grad/) • u. (7) 



Figure 19.11 Directional derivative. 


This tells us how to calculate df/ds, because (5) is presumably simple to com¬ 
pute from the given function/', and then to evaluate at the given point P, and the 
dot product (7) of two known vectors is easy to find. 

For a given function / and a given point P, grad/is a fixed vector which can 
be placed so that its tail lies at P. We also place the tail of u at P, as shown in 
Fig. 19.11. To understand the significance of grad/ we use the definition of the 
dot product and the fact that u is a unit vector to write (7) in the form 

= |grad f\ cos 6 , (8) 

as 

where 6 is the angle between grad/and u. Since the direction of u can be cho¬ 
sen to suit our convenience, (8) immediately yields the first fundamental prop¬ 
erty of the gradient: 


Property 1 The directional derivative df/ds in any given direction is the scalar pro¬ 
jection of grad/in that direction (see Fig. 19.11). 


In this sense, the single vector grad/contains within itself the directional de¬ 
rivatives off at P in all possible directions. 

Next, if u is chosen to point in the same direction as grad/, so that 0=0 and 
cos 0=1, then (8) shows that df/ds has its maximum value—that is,/increases 
most rapidly—in this direction. Also, this maximum value equals |grad/1. These 
remarks give the next two fundamental properties of the gradient: 
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Property 2 The vector grad / points in the direction in which /increases most rapidly. 


Property 3 The length of the vector grad /is the maximum rate of increase off. 

As these remarks show, even though formulas (7) and (8) are equivalent, they 
play very different roles in our thinking, for we use (7) to calculate df/ds and (8) 
to understand the intuitive meaning of the vector grad /. 

Example 1 If/(jc, y, z) = x 2 — y + z 2 , find the directional derivative df/ds at the 
point (1,2, 1) in the direction of the vector 4i - 2j + 4k. 

Solution At the point (1,2, 1), we have grad/= 2;ti - j + 2zk = 2i - j + 2k. 
We obtain a unit vector u in the desired direction by dividing the given vector 
by its own length, 

4i - 2j + 4k 2 . 1 . 2, 

u = r - . — = - t - - j + -k. 

V16 + 4 + 16 3 3 3 

Formula (7) now gives 

f = (grad/)-u 

= (2i - j + 2k) • (fi - |J + fk) = 3. 

Thus, the function / is increasing at the rate of 3 units per unit distance as we 
leave (1,2, 1) in the given direction. 


Example 2 Let the temperature of the air at points in space be given by the func¬ 
tion/^, y, z) = x 2 - y + z 2 - A mosquito located at (1,2, 1) wishes to get cool 
as soon as possible. In what direction should it fly? 

Solution We saw in Example 1 that grad/= 2i — j + 2k at the point (1, 2, 1). 
Since the direction of grad / is that in which the temperature increases most 
rapidly, the mosquito should fly in the opposite direction, that of —grad/ = -2i + 
j-2k. 


The fourth fundamental property of the gradient is useful in geometry. In or¬ 
der to explain what it is, we denote the point under consideration by Pq = 
(xo, yo> zo) 1° emphasize that it is fixed in this discussion, and we let Co be the 
value of our function/at the point Pq. Then the set of all points in space at which 
/(;c, y, z) has the same value Cq constitutes, in general, a level surface through Pq 
whose equation is f(x, y, z) = Cq. We wish to show that the vector grad/is nor¬ 
mal (perpendicular) to this level surface at the point Po, as suggested on the left 
in Fig. 19.12. To this end, we consider a curve that lies on the surface and passes 
through Po- If we move to a nearby point Q on this curve and measure s along 
the curve, then A/ = 0 because / has the same value at all points on the surface, 
and therefore df/ds = 0 at Pq in the direction of the tangent to the curve. For¬ 
mula (6) remains valid and implies that 

( g rad /)-^- = o . 


(9) 
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Figure 19.12 

level surface. 


Gradient is normal to 



z) = 


c o 



where dR/ds is the unit tangent vector to the curve at Po. The vanishing of the 
dot product in (9) tells us that grad/is perpendicular to this tangent vector. But 
the same reasoning applies to every curve on the surface that passes through Po, 
so grad / is perpendicular to the tangent vectors to all these curves (Fig. 19.12, 
right). Since these tangent vectors determine the tangent plane at Po, and being 
normal to the surface means being normal to this tangent plane, we have: 


Property 4 The gradient of a function f[x, y, z) at a point Po is normal to the level 
surface of/that passes through P 0 . 


In the context of this discussion, we point out that the equation of any surface 
can be written in the form f(x, y, z) — co, and can therefore be regarded as a level 
surface of the function/(x, y, z). If Po = Uo. To. zo) is a point on this surface, then 
Property 4 tells us that the vector 



is normal to the tangent plane at Po, so if N ^ 0, the equation of this tangent 
plane is 

(I),. 1 ' ■'»» + (lk (y - »> + (!),.<* - ■ °- (io> 

We observe that this equation includes equation (4) in Section 19.3 as a special 
case; for if the surface is given in the form z = g(x, y), then this can be written 
as g(x, y) — z = 0, so the surface is a level surface of the function f(x, y , z) = 
g(x,y)~z , and this makes the coefficients in (10) equal to g. r Uo. To). 
g y (x o, To). ~ 1 ■ 

Example 3 Find the equation of the tangent plane to the surface xy 2 z 3 = 1 2 at 
the point (3, -2, 1). 

Solution This surface is a level surface of the function /(;c, y, z) = xy 2 z 3 . The 
vector grad/at the point (3, —2, 1) is normal to the surface at this point. This 
vector is 


grad / = y 2 z 3 i 4- 2xyz 3 j 4- 3xy 2 z 2 k 

= 4i — 12j 4- 36k = 4(i — 3j 4- 9k). 
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Therefore the equation of the tangent plane is 

(x- 3) - 3(y + 2) +9(z- 1) = 0 
or 

x - 3y + 9z = 18. 


Remark 1 The main uses of directional derivatives and gradients are found in 
the geometry and physics of three-dimensional space. However, these concepts 
can also be defined in two dimensions, and they have similar (but thinner) prop¬ 
erties. Thus, a curve f(x, y) = c 0 can be thought of as a level curve of the func¬ 
tion z = f(x, y); and if the gradient of this function is defined by 


g rad /=-| i + -| j . 


then the value of this gradient at a point Pq = (xo, yo) on the curve is a vector 
that is normal to the curve. 


Remark 2 The gradient of a function f(x, y, z) can be written in “operational 
form” as 




/. 


The del operator preceding the function / is usually denoted by the symbol V 
(an inverted delta, read “del”) so that 


V 


_o_. 

dx 1 




This del operator is similar to, but more complicated than, the familiar differen¬ 
tiation operator d/dx. When del is applied to a function j\ it produces a vector, 
namely, the vector grad /. In this notation, formulas (5), (6), and (7) become 

gradf= V/ f = V/-f, and f = V/-u. 


We will make very substantial use of the operator V in Chapter 21. 


PROBLEMS 

1 Find the gradient of/at P if 

(a) f(x, y, z) = xy + xz + yz, P = (~ 1,3, 5); 

(b) f(x, y, z) = e*y cos z, P = (0, 2, 0); 

(c) f(x, y , z) = In (x 2 + y 2 + z 2 ), P = (1,2, -2); 

(d ) f(x,y,z) = xy/z, P = (2, -1, 5). 

2 Find the directional derivative of / at P in the direction 
of the given vector: 

(a) f(x,y, z) = xy 2 + * 2 Z + yz, P = (1,1,2), i + 2j - k; 

(b) f(x, y , z) = In (x 2 + y 2 + z 2 ), P = (0, 0, 1), vector 

from P to (2, 2, 0); 

(c) f(x, y, z) = x sin y + y sin z + z sin x, P = ( 1,0, 0), 

2V3i + 2j; 


(d) f(x, y, z) = xye z + yze x , P = (1,0, 0), vector from P 
to (2, 2, 1). 

3 Find the maximum value of the directional derivative of 
/ at P, and the direction in which it occurs: 

(a) f(x, y, z) = sin xy + cos yz, P = (-3, 0, 7); 

(b) f(x, y, z) = e x cos y + e y cos z + e z cos 
P = (0,0,0); 

(c) /(*, y, z) = 2xyz + y 2 + z 2 , P = (2, 1,1); 

(d) /(*,y, z) = e x y z ,P = ( 2,1,1). 

4 In what direction should one travel, starting at the ori¬ 
gin, to obtain the most rapid rate of decrease of the func¬ 
tion 

/(*, y, z) = ( 2 - x - y) 3 + (3* + 2y - z + l) 2 ? 
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5 Find the unit vectors normal to the surface xyz = 4 at the 
point (2, —2, — 1). 

6 If /(x, y, z) = x 2 + 4 y 2 — 8 z, find dflds at (4, 1, 0) 
(a) along the line (x - 4)/2 = (y — 1)/1 = z/(— 2) in the 
direction of decreasing x; (b) along the normal to the 
plane 3(x - 4) - (y — 1) + 2z = 0 in the direction of 
increasing x; (c) in the direction in which/increases most 
rapidly. 

7 Suppose that the temperature T at a point P = (x, y, z) is 
given by T = 2x 2 — y 2 + 4 z 2 . Find the rate of change of 
T at the point (1, -2, 1) in the direction of the vector 
4i — j + 2k. In what direction does T increase most 
rapidly at this point? What is this maximum rate of in¬ 
crease? 


8 Find the tangent plane and normal line to the hyperboloid 
x 2 + y 2 — z 2 = 5 at the point (4, 5, 6), 

9 Show that the tangent plane to the quadric surface ax 2 + 
by 2 + cz 2 = d at the point (x 0 , Vo, zo) has ax^x + byoy + 
czoz — d as its equation. 

10 Show that the del operator has the following properties 
that demonstrate its close similarity to the differentiation 
operator d/dx: 

(a) V(/+ g) = V/ + Vg; 

(b) V(/g) =fVg + gVf\ 

(d) V/" = nf n ~'Vf. 


19.6 

THE CHAIN RULE FOR 
PARTIAL DERIVATIVES 


The single-variable chain rule for ordinary derivatives tells us how to differenti¬ 
ate composite functions. It says that if w is a function of x where x is in turn a 
function of a third variable r, say w = f(x) where x = g(t ), then 

dw _ dw_ dx_ 

dt~ dx dt • U) 


We know from ample experience that this is an indispensable tool of calculus; it 
is used more frequently than any other differentiation rule. 

The simplest multivariable chain rule involves a function w = /(jc, y) of two 
variables t and y, where x and y are each functions of another variable r, x = g(t) 
and y = h{t). Then w is a function of r, 

w =f[g(t), h(t)] = F(t), 

and we shall prove that the derivative of this composite function is given by the 
formula 


dw _ dw dx dw dy 
dt dx dt dy dt‘ 

This is the chain rule for this situation. 

The proof of (2) is easy. We begin by changing t to t + A t, where Ar =£ 0. This 
increment in t produces increments Ajc and Ay in t and y, which in turn produce 
an increment Aw in w. Since all the functions we discuss are assumed to have 
continuous partial derivatives, the Fundamental Lemma enables us to write Aw 
in the form 


Aw = -r— Ax + Ay + €\ Ax + e 2 Ay, (3) 

dx dy 

where e\ and €2 —> 0 as Ax and Ay —> 0. On dividing (3) by Ar, we obtain 

Aw dw Ax ,dw Ay , Ax Ay 

“7— = -T—7- + - 5 —r 1 + €1 — + 62 (4) 

Ar 3x Ar 3y Ar Ar Ar 

If we now form the limit as Ar —> 0, then Ax and Ay also —> 0, so e\ and €2 —> 0, 
and (4) immediately yields (2). 
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Example 1 If w = 3x 2 + 2xy — y 2 where x = cos t and y = sin t, find dxv/dt. 


Solution Formula (2) tells us that 
dw 

— = (6 jc + 2 y)(-sin t) + {2x - 2 y) cos t. 
at 

By substituting x = cos t and y = sin r, we can express this in terms of t alone, 
dw 

—— = (6 cos r + 2 sin r)( —sin t) + (2 cos t — 2 sin r)(cos t) 
dt 

= —6 sin t cos t — 2 sin 2 t + 2 cos 2 t — 2 sin t cos t 
= 2(cos 2 t — sin 2 t) — 8 sin t cos t = 2 cos 2t — 4 sin 2 1. 

We can check this result by first substituting and then differentiating, which gives 
w = 3 cos 2 t + 2 sin t cos t - sin 2 t 


and 


= 6 cos r(—sin t) + 2 sin r(—sin 0 + 2 cos 2 t ~ 2 sin t cos t 
dt 

= 2(cos 2 t - sin 2 0 ~ 8 sin t cos t = 2 cos 2t - 4 sin 2r. 


as before. 


In the situation of formula (2), it is convenient to call w the dependent vari¬ 
able, x and y the intermediate variables , and t the independent variable. We no¬ 
tice that the right side of ( 2 ) has two terms, one for each intermediate variable, 
and that each of these terms resembles the right side of the single-variable chain 
rule ( 1 ). 

Formula (2) extends in an obvious way to any number of intermediate vari¬ 
ables. For instance, if w = f(x , y, z) where x , y, and z are each functions of r, then 

dw _ dw dx dw dy dw dz 

dt dx dt 3y dt dz dt' 

The proof of this is essentially the same as the proof of (2), except that it uses 
the Fundamental Lemma for three variables instead of two. 

Further, x , y, and z here need not be functions of only one independent vari¬ 
able, but can be functions of two or more variables. Thus, if x , y, and z are each 
functions of the variables t and u , then w is also a function of t and w, and its 
partial derivatives are given by 

dw_ _ dw_dx_ dw_dy_ dw_dz_ 

dt ~dxdt dy dt dz dt (6) 

and 

dw _ dw_dx_ + dwdy_ + dw dz ^ 

du dx du 8 y du dz du 

We use roundback d 's everywhere here because every function depends on more 
than one variable. It is necessary to be very clear about the meanings of the let¬ 
ters in formulas like these. For example, on the left side of ( 6 ), w is considered 
a function of t and w, while on the right side it is considered a function of x, y, 
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and z. The proofs are the same as before, and all of these formulas—(2), (5), 
( 6 ), (7), and their extensions to any number of intermediate and independent vari¬ 
ables—are collectively called the chain rule. 

In Section 19.4 we defined the differential dw of a function w = f(x, y, z) by 
the formula 


dw 3w dw 

aw = -z— dx -I- dy -I- -r— dz- 
ox dv ' dz 


( 8 ) 


The chain rule (5) tells us that if jc, y, z are themselves functions of a single in¬ 
dependent variable r, then it is permissible to calculate dw/dt by formally divid¬ 
ing (8) by dt. Similarly, if jc, y, z are functions of the independent variables r, u 
and we want to calculate 3w/3r, then the chain rule ( 6 ) tells us that we can find 
dw/d1 by dividing ( 8 ) by dt and writing roundback c/’s in place of ordinary d' s to 
show that there is another independent variable present which is being held fixed. 

The individual terms on the right side of ( 8 ) are sometimes called the partial 
differentials of w with respect to jc, y, z. From this point of view, the quantity dw 
defined by ( 8 ) deserves the name total differential, as we remarked in Section 
19.4. 


Example 2 A function of several variables is said to be homogeneous of degree 
n if multiplying each variable by t (where t > 0 ) has the same effect as multi¬ 
plying the original function by t n . Thus,/(*, y) is homogeneous of degree n if 

f(tx,ty) = t n f(x,y). (9) 

For example, /(jc, y) = jc 2 + 3jcy is homogeneous of degree 2, because/(fjc, ty) = 
(tx) 2 + 3(/*)(fy) = t 2 (x 2 + 3jcy) = t 2 f(x, y). Similarly,/(x, y) = {x + y)/(x - y) is 
homogeneous of degree 0,/(jc, y) = (jcy — x 2 e xly )ly is homogeneous of degree 1 , 
and f(x, y, z) = Vjc 3 — 3jcy 2 + 2z 3 is homogeneous of degree f . Most functions, 
for instance f(x, y) = y 2 + x sin y, are not homogeneous at all. 

There is a theorem of Euler about homogeneous functions that has several im¬ 
portant applications: ///(jc, y) is homogeneous of degree n, then 

X fc + y 3y = n ^ X ' y ^ ( 10 ) 

To prove this, we hold jc and y fixed and differentiate both sides of (9) with re¬ 
spect to t. We can clarify this process by writing u = tx and v = ty , so that (9) 
becomes 


/ (u, v) = t n f (x, y). 

Then by using the chain rule to differentiate with respect to f, we obtain 

3 / 3 ^ 

dv dt 


3/ 3 u 
3 u dt 


H ~ ~ = nt n ~ ] f(x,y) 


or 

x + y —■ = nt n ~f{x, y), 
du dv 

and putting t = 1 yields (10). Similarly, if f(x\, JC 2 ,. .. , jc,„) is homogeneous of 
degree n, then the same argument shows that 

x \ + x 2 + * * * + x m = nf{x],x 2 , • • • , x m ). 
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Euler’s theorem has some interesting consequences for economics. As an ex¬ 
ample, suppose that/(x, y) is the production (measured in dollars) of x units of 
capital and y units of labor. If the amounts of capital and labor are doubled, then 
it is reasonable to expect that the resulting production will also double, that is, 
that/(2x, 2 v) = 2 /(jc, y). More generally, we expect that 

f(tx, ty) = tf{ x , y), 

so the production function is homogeneous of degree 1. [In economics, this prop¬ 
erty of f(x, v) is called constant returns to scale.] Euler’s theorem now says that 

f 0 c,y)=x fc +y fr (11) 

The partial derivatives df/dx and df/dy are called the marginal product of capi¬ 
tal and the marginal product of labor, respectively. In this language, (11) is a the¬ 
orem of quantitative economics whose verbal statement is, “The total value of 
production equals the cost of capital plus the cost of labor if each is paid for at 
the rate of its marginal product.” Under these circumstances there are no surplus 
earnings, and in the real world this is a very bad thing.* 


Example 3 Many applications of the chain rule involve calculating the effect on 
some equation or expression when new variables are introduced. As an illustra¬ 
tion of a method that will be useful for solving the wave equation in Section 19.9, 
we now solve the partial differential equation 



dw 

3y’ 


a F 0. 


( 12 ) 


That is, we find the most general function w = /(;c, y) that satisfies this equation. 
To do this, we introduce new independent variables u , v by writing 

u = x + ay, v— x — ay. (13) 


We think of w as a function of u and v, 

w = F(u, v), 


and we find the w^-equation equivalent to (12) by using the chain rule to write 

dw _ dw du dw dv _ dw dw 
dx du dx dv dx du dv ’ 


dw _ dw du + dw dv _ dw dw 

dy du dy dv dy du dv 

By substituting in these expressions, we see that (12) transforms into the partial 
differential equation 


^ dw 
2 a — = 0 
dv 


or 



Tor further information on these matters, see pp. 81-84 of J. M. Henderson and R. E Quandt, Mi¬ 
croeconomic Theory (McGraw-Hill, 1971); or Chapter 12, “Homogeneous Functions and Euler’s The¬ 
orem,” in D. E. James and C. D. Throsby, Quantitative Methods in Economics (Wiley, 1973). For an 
application of Euler’s theorem to advanced theoretical mechanics, see p. 531 of the present writer’s 
text. Differential Equations, 2nd ed. (McGraw-Hill, 1991). 
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This equation is very easy to solve, because it says that the function w = F(u, v) 
is constant when u is held fixed and v is allowed to vary, and therefore is a func¬ 
tion of u alone. This means that our desired solution of (12) is 

w = g(u) = g(x + ay), 

where g{u) is a completely arbitrary (continuously differentiable) function of u. 
We apologize to students for introducing “out of the blue” the apparently un¬ 
motivated transformation equations (13). However, some of the developments of 
Section 19.9 will make this procedure seem fairly natural. 


Example 4 Partial derivatives are the main mathematical tools used in thermo¬ 
dynamics. It is the universal practice in this science to avoid confusion by using 
subscripts on partial derivatives to specify the variable (or variables) held fixed 
in the differentiation. Thus, if w = F(x , y) then dw/dx would be denoted by 

dw \ 
dx)y 

This notation tells us that w is being thought of as a function of * and y, and that 
y is held fixed and x is the variable of differentiation. This usage may seem su¬ 
perfluous, but the following situation—which is quite common in thermody¬ 
namics—shows that it is not. 

If w =f(x, y) where y is a function g(x, t) of x and another variable t, so that 
w is a composite function of x and f, we find its partial derivative with respect 
to x. 

This is a typical chain rule situation with x and y the intermediate variables 
and x and t the independent variables: 


w =f(x, y) 


where 


x = x, 
y = g(x, t). 


The chain rule therefore gives 


3w _ dw dx + 3w dy 
dx dx dx dy dx ’ 


(14) 


so 


3w _ chv + dw dy 
dx dx dy dx' 


(15) 



Unfortunately this equation contains two partial derivatives of w with respect to 
x. By an effort of thought one can keep in mind that dxv/dx on the left of (15) is 
the derivative of the composite function, while dw/dx on the right is the deriva¬ 
tive of w =f(x, y). Nevertheless, this ambiguous notation invites confusion and 
is contrary to the overall spirit of mathematical symbols, which are intended to 
make it easy to be correct with a minimum of thought. However, if we use the 
subscript notation of thermodynamics, then (14) can be written as 


/3vv\ / dw\ (dx\ (dw\ fdy) 

[dxjr [dxj y [dxj, \dyjx\dxj; 
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Since idx/dx) t = 1, this becomes 


dw ) _ (dw) 
dx jy 


dx 


(16) 


which is somewhat clumsy but much less vulnerable to misunderstanding than 
(15).* 


"Students whose main interest is physics may wish to read discussions of these matters in some of 
the standard treatises. See, for example, p. 19 of Enrico Fermi, Thermodynamics (Dover, 1956); p. 28 
of Philip M. Morse, Thermal Physics (W. A. Benjamin, 1969); or pp. 30-33, 52-55 of F. W. Sears, 
Thermodynamics (Addison-Wesley, 1953). 


PROBLEMS 


In Problems 1-4, find dw/dt in two ways, (a) by using the 
chain rule and then expressing everything in terms of t, and 
(b) by first substituting and then differentiating. 

1 w = e x2+y2 , x = cos t, y = sin t. 

2 w = xy + yz + zx, x = 3 t 2 , y = e\ z = e~ f . 

3 w = — 5 —- r, x = r, y = 3 1. 

x — y 

4 w = In U 4 + 2* 2 y + 3y 2 ), x = t, y = It 2 . 


In Problems 5 and 6 , find dw/dt and dw/du by the chain rule 
and check your answers by using a different method. 

5 w = x 2 + y 2 , x = t 2 — u 2 , y = 2 tu. 

6 w = — 7 , x = t cos u, y = t sin u. 

x z + y L 

7 If/is any (continuously differentiable) function, show 
that w = f(x 2 — y 2 ) is a solution of the partial differ¬ 
ential equation 


y 


dw 

dx 


+ 



= 0 . 


Hint: Write w = f(u) where u = x 2 - y 2 and apply the 
chain rule with only one intermediate variable. 

8 If a and b are constants and w = f(ax + by), show that 


, dw dw 

b J7 = a !fy' 

9 If w = f(x 2 - y 2 , y 2 - X 2 ), show that 
dw dw 

y ^ + x l^ = 0 - 


10 If w = ft-2—L LZJL 


xy 


yz 


show that 


2 dw 2 dw 2 dw 

x -& + y l^ + z H = a 


11 The differential dw of a function w =f(x, y, z ) is de¬ 
fined by ( 8 ) only for the case in which x, y, z are inde¬ 
pendent variables. If x, y, z are not independent, but in¬ 


stead are functions of independent variables f, u, then 
dw must be defined by 


12 


13 


, dw , dw , 
dw = at + — du. 

dt du 

Show that the definition implies ( 8 ), so that ( 8 ) remains 
valid regardless of whether x, y, z are independent or 
not. 

If u and z;are both functions of ;c,y, z, show that 

(a) d{c) = 0 , c a constant; 

(b) d{cu) = c du; 

(c) d{u + v) = du + dv; 

(d) d{uv) = u dv + vdu; 

, . ,(u\ vdu — udv 


V L 

(f) if w =f(u ), then dw =f'(u) du. 

Verify Euler’s theorem (10) for each of the following 
functions: 

(a) fix , y) = xy 2 + x 2 y - y 3 ; 

(b) fix, y) = _ 

(c) fix, y) = Vx 2 + ~y 2 \ 


(d)/U, y) = 


Vx + 


14 If w = f(x, y) where x = r cos 6 and y = r sin 6 , show 
that 


(dw_\ 2 I dw\ 2 _ (dw_\ 2 1 ( dw\ 2 

W \dyj "W 

15 If a is a constant and w = fix , y) where x = u cos a — 
v sin a and y = u sin a + i^cos a, show that 

(dw) 2 (dw) 2 _( dw) 2 (dw) 2 

\dxj + \ dy ) \du) \ 3 i;/ ’ 

16 If w = fix,y) where x = e u cos v and y = e u sin v, 
show that 
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17 Obtain formula (16) by using differentials, as follows: 
(a) Write 



(b) If the internal energy E of the quantity of gas un¬ 
der discussion is a f unction of V and T, then, since 
T is a function of p and V, E is indirectly a func¬ 
tion of p and V. Show that 


dw 



dx + 








(b) Substitute dy from the third formula into the first, 
and compare the result with the second. 

* 18 Let the pressure, volume, and temperature of a given 
quantity of a certain gas be denoted (as usual) by p, V, 
T. These variables are not independent, but are con¬ 
nected by an equation of the general form 


It is sometimes necessary to work with functions of the form 

w = F(w, v, x) = f{x , y) dy , 

Ju 

where u = u(x) and v = i{x) are functions of x. The chain rule 
yields 

dw _ dw du dw dv 3w 
dx du dx dv dx dx' 


ftp, V,T) = 0, 


which is called the equation of state. This equation de¬ 
termines any one of the variables as a function of the 
other two. 

(a) By calculating dp and dV, and eliminating dV, show 
that 

M = _L_ 

\dV)T ( dVldp) T ' 





= o, 



dV\ fdT 
dT )p\ dp )v 


- 1 . 


and by applying the Fundamental Theorem of Calculus and 
differentiating under the integral sign (Problem 32 in Section 
19.2), we obtain 


d_ 

dx 


fix, y) dy 




Tx ffx ’>'> 


dy. 


This is known as Leibniz's formula. 


19 Verify Leibniz’s formula in the following cases: 

(a) u = x, v— x 2 , fix, y) = x + y; 

(b) u = x, v= x 2 , fix, y) = x 3 y 2 + x 2 y 3 \ 

(c) u = x, v = x 2 ,fix, y) = In y. 


19.7 

MAXIMUM AND 
MINIMUM PROBLEMS 


In the case of functions of a single variable, one of the main applications of de¬ 
rivatives is to the study of maxima and minima. In Chapter 4 we developed var¬ 
ious tests involving first and second derivatives, and we used these tests for graph¬ 
ing functions and attacking a wide variety of geometric and physical problems. 
Maximum and minimum problems for functions of two or more variables can be 
much more complicated. We confine ourselves here to an introduction to such 
problems, including a two-variable version of the second derivative test (Remark 
3, Section 4.2). 

Suppose that a function z = fix , y) has a maximum value at a point Pq = 
(xq, yo) in the interior of its domain. This means that fix,y) is defined, and also 
fix, y) ^fixo,yo), throughout some neighborhood of Pq, as shown on the left in 
Fig. 19.13.* If we hold y fixed at the value y 0 , then z =f(x,y 0 ) is a function of 
x alone, and since it has a maximum value at x = xq , its derivative must be zero 
there, as in Chapter 4. That is, dz/dx = 0 at this point. In just the same way, 
dzldy = 0 at this point. The equations 


dz 

dx 


= 0 


and 


dz 

dy 


= 0 


( 1 ) 


*In this discussion we are considering only a so-called relative (or local) maximum, which takes into 
account only points near to Pq, but for simplicity we drop the adjective. 
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Figure 19.13 


are therefore two equations in two unknowns that are satisfied at the maximum 
point (xo, yo). In many cases we can solve these equations simultaneously to find 
the point (jto, yo), and thus the actual maximum value of the function. 

Exactly the same considerations apply to the minimum value shown in the cen¬ 
ter of the figure. However, when we try to locate maximum or minimum values 
of a function by solving equations (1), it is necessary to keep in mind that these 
equations are also satisfied at a saddle point like that shown on the right, where 
the function has a maximum in one direction and a minimum in some other di¬ 
rection. Equations (1) mean only that the tangent plane is horizontal, and it is 
then up to us to decide what significance this fact has. 

By analogy with our earlier definition in Chapter 4 for functions of one vari¬ 
able, we call a point (jto, yo) where both partial derivatives are zero a critical point 
of f(x, y). 

Example 1 Find the dimensions of the rectangular box with open top and a fixed 
volume of 4 ft 3 which has the smallest possible surface area. 

Solution If jc and y are the edges of the base, and z is the height, then the area 
is 


A = xy + 2 xz + 2 yz- 


Since ryz = 4, we have z - 4/jcy, and the area to be minimized can be expressed 
as a function of the two variables x and y, 

A = xy + - + (2) 

y x 


We seek a critical point of this function, that is, a point where 


dA 

dx 


= y 





To solve these equations simultaneously, we first write them as 

x 2 y = 8, *y 2 = 8. 

Dividing gives x/y = 1, so y = jc and either equation becomes jc 3 = 8. Therefore 
x = y = 2, and it follows from this that z = 1, so the box with minimum surface 
area has a square base and a height one-half the edge of the base. 


In this example it is geometrically clear that the critical point (2, 2) is actually 
a minimum point, and not a maximum or saddle point. However, in a more com¬ 
plicated situation we might find a critical point and yet be completely unable to 
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state its nature, based on commonsense considerations alone. A useful tool for 
classifying critical points is provided by the second derivative test : 

If fix, y) has continuous second partial derivatives in a neighborhood of a critical 
point Uo,yo)> am ^ tf a number D (called the discriminant) is defined by 

D = f xx (x 0, yo)fyy(Xo, Jo) - [fxy(x 0, jo)] 2 . (3) 

then (x 0 , Jo) ,s 

(i) a maximum point if D > 0 andf^x o, yo) < 0; 

(ii) a minimum point if D > 0 andf^xo, yo) > 0; 

(iii) a saddle point if D < 0. 

Further, ifD = 0, then no conclusion can be drawn, and any of the behaviors 
described in (i) to (iii) can occur. 

A complete proof of this theorem requires machinery that is not available to us. 
We refer interested students to more advanced books.* 

Example 1 (continued) As an illustration of the use of the second derivative test, 
we apply it to verify that the critical point (2, 2) found in Example 1 is a mini¬ 
mum point of the function (2). Here we have 

A =J6 A = ii A =1 

*xx x i> * n yy yi » ^xy 1 » 

so the discriminant (3) has the value D = 2*2 — 1 2 = 3>0. Since A xx is also 
positive at the point ( 2 , 2 ), the test tells us that this critical point is indeed a min¬ 
imum point, as claimed. 


Example 2 Find the critical points of the function 

z = 3x 2 + 2xy + y 2 + \0x + 2y + 1, 
and use the second derivative test to classify them. 


Solution Here we have 

^ = 6x + 2y + 10 = 0 and 
so the system of equations we must solve is 


= 2x + 2j + 2 = 0, 
dy 


3 x + y = -5, 


x + y = -1. 


By simple manipulations we easily see that x = — 2 and y = 1, so there is a sin¬ 
gle critical point ( — 2, 1). At this point we have 

D = ZxxZyy ~ Z X y 2 = 6*2 — 2 2 = 8>0, 

and since z xx = 6 > 0, the critical point is a minimum point. 


"See, for example, pp. 157-159 of R. C. Buck, Advanced Calculus (McGraw-Hill, 1978). 
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Success in finding maximum and minimum points for a function z — fix, y) 
clearly depends on our ability to solve the two simultaneous equations/ v = 0 and 
f y = 0. In Examples 1 and 2 these equations were very easy to solve. However, 
as students can readily imagine, there are many complicated situations that arise 
in which routine methods of solving simultaneous equations are quite useless. 
The only general advice we can give is to try to solve one of the equations for 
one of the unknowns in terms of the other, substitute this in the second equation, 
and try to solve the result. Apart from this, make good guesses and be ingenious 
—advice that is easier to give than to follow! 


PROBLEMS 

In Problems 1-8, find the critical points and classify them by 
means of the second derivative test. 

1 z = 5x 2 — 3 xy + y 2 - \5x - y + 2. 

2 z = 2x 2 + jcy 4- 3y 2 + l(k — 9y + 11. 

3 z = x 5 + y 4 - 5x-32y - 3. 

4 z = x 2 + y 3 — 6 xy. 

5 z = x 2 y + 3xy - 3x 2 - 4x + 2 y. 

6 z = 3 xy 2 + y 2 — 3x — 6y + 7. 

7 z = x 3 + y 3 + 3*y + 5. 

8 z — xy(2x + 4y + 1). 

9 For each of the following functions z = fix, y), show 
that f x , f y , and D are all 0 at the origin. Also show that 
at the origin (a) has a minimum, (b) has a maximum, 
and (c) has a saddle point. 

(a) fix, y) = x 4 + /. 

(b) /(*, y ) = -* 4 - y 4 . 

(c) f(x, y) = xy. 

10 Show that a rectangular box with a top and fixed vol¬ 
ume has the smallest surface area if it is a cube. 

11 Show that a rectangular box with a top and fixed sur¬ 
face area has the largest volume if it is a cube. 

12 If the equations fix) = 0 and fix) = 0 have no com¬ 
mon roots, show that any critical points of z = yfix) + 
g(;t) must be saddle points. 

13 If the sum of three numbers x, y, and z i s 12, what must 
these numbers be for the product of ;t, y 2 , and z 3 to be 
as large as possible? 

*14 The function z = iy - x 2 )iy — 2x 2 ) has a saddle point 
at ( 0 , 0 ). 

(a) Verify that (0,0) is the only critical point. 

(b) Show that the second derivative test fails to estab¬ 
lish that this critical point is a saddle point. 

(c) Show that this critical point is a saddle point by di¬ 
rect examination of the sign of the function near 
(0, 0). 

(d) It seems reasonable to suppose that a critical point 
Po = Uo, yo) of z = fix, y) will necessarily be a 
minimum point for this surface if every vertical sec¬ 
tion of the surface through Po has Po as a minimum 
point. Show that this idea is false by examining the 


vertical section of z = (y — x 2 )iy — 2x 2 ) in the 
plane y = mx. 

15 A rectangular box has three faces in the coordinate 
planes and one vertex P = ix, y, z) in the first octant on 
the plane ax + by + cz = 1. Find the volume of the 
largest such box. 

16 Solve Problem 15 if P lies on the ellipsoid x 2 /a 2 + 
y 2 /b 2 + z 2 /c 2 = 1. Hint: Use implicit differentiation. 

17 Solve Problem 15 if P lies on the paraboloid z = 1 — 
x 2 - y 2 . 

18 If z = fis, t) is the square of the distance between a vari¬ 
able point on the line 

x = — 2 + 4s, y = 3 + s, z = -1 + 5s 
and a variable point on the line 

x = —\ - 21, y = 3t, z = 3 + t, 

show that this function has one critical point which is a 
minimum. In this way find the distance between the lines. 

19 Find the distance from the origin to the plane x + 2 y + 
3 z = 14. Hint: Minimize w = x 2 + y 2 + z 2 by treating 
y and z as the independent variables. 

20 The sides of an open rectangular box cost twice as much 
per square foot as the base. Find the relative dimensions 
of the largest box that can be made for a given cost. 

21 Find the equation of the plane through (2, 2, 1) that cuts 
off the smallest volume from the first octant. 

22 If (* 1 , y0, ix 2 , yf), and ix^, y 3 ) are the vertices of a tri¬ 
angle, find the point ix, y) such that the sum of the 
squares of its distances from the vertices is as small as 
possible. 

23 If a, /3, y are the angles of a triangle, find the maxi¬ 
mum value of sin a + sin /3 + sin y, 

*24 Among all triangles with given fixed perimeter, show 
that the equilateral triangle has the largest area. Hint: 
Let the perimeter be 2s, use Heron’s formula 
A = V sis — a)is — b)is — c), and maximize In A. (Can 
you solve this problem without calculation, by merely 
thinking about it?) 

*25 Among all triangles inscribed in a given circle, show 
that the equilateral triangle has the greatest area. Hint: 
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If a, p, y are the central angles subtending the sides of 
the triangle, so that a + /3 + y= 27 r, observe that the 
area of the triangle is a constant multiple of sin a + 
sin P — sin (a + /3). 

26 When an electric current of magnitude / flows through 
a wire of resistance R, the heat generated is proportional 
to 1 2 R. Two terminals are connected by three wires of 
resistances R\,R2, /? 3 - A given current flowing between 
the terminals will divide in such a way as to minimize 
the total heat produced. Show that the currents /j, Ij , h 
in the three wires will satisfy the equations I\R\ = 
I2 R2 — hRi- 

*27 A pentagon consists of an isosceles triangle on top of 

a rectangle. If the perimeter P is fixed, find the dimen¬ 
sions of the rectangle and the height of the triangle that 
yield the maximum area. 

*28 Show that the surface^ = ( 2x 2 + y 2 )e { ~ x2 ~y 2 looks like 
two mountain peaks joined by two ridges with a hol¬ 
low depression between them. 

29 A laboratory scientist performs an experiment n times 
and obtains n pairs of data, 

(x h y\), (x 2 ,y 2 \ ...» (x nf y n ). 

The theory underlying her experiment suggests that 
these points should lie on a straight line y = mx + b, 
but they do not because of experimental error. She then 
determines the line that gives the “best fit” for the data 
in the sense of the method of least squares (see Prob¬ 
lem 30 in Section 4.4): She chooses m and b to mini¬ 
mize the sum of the squares of the vertical deviations 
(Fig. 19.14), 

n 

S = S(m, b) = ^ (mxi + b — y,) 2 . 

/=i 



Show that m and b are determined as the simultaneous 
solution of the equations 

m £ Xi 2 + b £ Xj = £ xty h 
m^Xi + nb = ^ >’|. 

30 Use the method of least squares explained in Problem 
29 to find the line that best fits the data (1,1.7), (2, 1.8), 
(3, 2.3), (4,3.2). 

*31 Use the second derivative test to verify that S in Prob¬ 
lem 29 is actually minimized by the stated values of m 
and b. Hint: It is necessary to use the fact that (Sx ,) 2 < 
riZxj 2 unless the x {s are all equal; as a start toward es¬ 
tablishing this, show that the maximum value of 
/( jc , y, z) = x + y + z on the sphere x 2 + y 2 + z 2 = a 2 
is V3 a, and conclude that (x + y + z ) 2 ^ 3(* 2 + y 2 + 
z 2 ) for any three numbers x, y, z. 


19.8 

CONSTRAINED 
MAXIMA AND MINIMA. 

LAGRANGE 

MULTIPLIERS 


In this section we explain the method of Lagrange multipliers by means of intu¬ 
itive ideas that depend on the geometric meaning of gradients. This method is 
used for maximizing or minimizing functions of several variables subject to one 
or more constraints. It is an important tool in economics, differential geometry, 
and advanced theoretical mechanics. 

We begin with the simplest case, that of two variables and one constraint. 

In Section 19.7 we learned how to calculate maximum and minimum values 
of a function z = f(x, y) of two independent variables x and y. However, in many 
problems x and y are not independent, but instead are connected by a side con¬ 
dition or constraint in the form of an equation 


g(x,y) = 0 . 


(1) 


In Chapter 4 we became thoroughly familiar with situations of this kind. The fol¬ 
lowing is a simple illustration. 


Example 1 Find the dimensions of the rectangle of maximum area that can be 
inscribed in a semicircle of radius 
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Solution I* is clear from Fig. 19.15 that the problem is to maximize the func¬ 
tion 


A = 2 xy 


( 2 ) 


subject to the constraint 

x 2 + y 2 = a 2 . (3) 

In Example 3 of Section 4.3 we solved this problem by using the constraint (3) 
to express A as a function of only one variable, 

y = Va 2 - x 2 , so that A = 2jcV a 2 - x 2 . 



We then calculated dA/dx, set it equal to zero, solved the resulting equation, and 
so on. This example will be continued after some remarks and explanations. 


The procedure we have just described works well enough for this problem, but 
as a general method it has two defects. First, in this particular case equation (3) 
is easy to solve for y, but in another problem the constraint ( 1 ) might be so com¬ 
plicated that it would be difficult or impossible to solve. The other defect lies in 
the fact that even though the variables x and y play equal roles in the problem, 
they are handled differently in the solution: We singled out one variable, jc, to be 
the independent variable, and the other, y, to be the dependent variable. It is of¬ 
ten more convenient, and certainly more elegant, to treat such problems in a sym¬ 
metric form, in which no preference is given to any one of the variables over the 
others.* 

We now return to the general problem of maximizing a function /(jc, y) sub¬ 
ject to the constraint g(x, y) = 0. To understand what is going on, we sketch the 
graph of g(x, y) = 0 (Fig. 19.16) together with several level curves /(jc, y) = c of 
the function /(jc, y), noting the direction in which c increases. In the figure, for 
instance, we suppose that c\ < C 2 < C 3 < c 4 . To find the maximum value of / 
(jc, y) along the curve g(jc, y) = 0, we look for the largest c for which /(jc, y) = c 
intersects g(jc, y) = 0. At such an intersection point ( Pq in the figure) the two 
curves have the same tangent line, so they also have the same normal line. But 
the vectors 




and 


, dg . 

gradg= aU 




are normal to these curves, and are therefore parallel to each other at the point 
P 0 . Hence one vector is a scalar multiple of the other at P o, that is, 

grad/ = A grad g (4) 

for some number A. (This argument assumes that grad g =£ 0 at Po, so that the 
curve #(jc, y) = 0 actually has a tangent at this point.) 


*The great physicist Einstein once said—probably in a fit of impatience with mathematicians and 
their ways—that “Elegance is for tailors,” but he was wrong. For mathematicians and theoretical 
physicists alike, the aesthetic factor in their thinking is as indispensable as the senses of taste and 
smell are for a master chef. 
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The vector equation (4), together with g(jc, y) 
tions 

_ x dg_ _ dg 

dx ax' dy dy’ 


0, yields the three scalar equa- 


g(x, y) = 0. (5) 


Accordingly, we have three equations that we can try to solve simultaneously for 
the three unknowns jc, y, and A. The points ( jc , y) that we find are the only pos¬ 
sible locations for the maximum (or minimum) values of /(jc, y) with the con¬ 
straint g(jc, y) = 0. The corresponding values of A may emerge from the process 
of solving (5), but they are usually not of much interest to us. The final step is 
to calculate the value of /(jc, y) at each of the solution points ( x , y) in order to 
distinguish maximum values from minimum values. 

The method of Lagrange multipliers is simply the following handy device for 
obtaining equations (5): Define a function L(jc, y, A) of the three variables jc, y, 
and A by 

L(x, y, A) = f(x, y) - A g(x, y), (6) 


and observe that equations (5) are equivalent, in the same order, to the equations 





(7) 


The variable A is called the Lagrange multiplier Thus, to find the constrained 
maximum or minimum values of /(jc, y) with the constraint g(jc, y) = 0, we look 
for the unconstrained (or free) maximum or minimum values of the function L 
defined by (6). We emphasize that this method has two major features that can 
be of practical value, and are often important for theoretical work: It does not 
disturb the symmetry of the problem by making an arbitrary choice of the inde¬ 
pendent variable, and it removes the constraint at the small expense of introduc¬ 
ing A as another variable. 


Example 1 (continued) To solve the inscribed rectangle problem by this new 
method, we first express the constraint (3) in the form x 2 + y 2 — a 2 = 0 and then 
write down the function 


The equations (7) are 


L = 2xy — A(jc 2 + y 2 — a 2 ). 


— = 2 y — 2 Ax = 0, 


— = 2x — 2 Ay = 0, 
dy 


dL 

3A 


—(x 2 + y 2 — a 2 ) = 0. 


(8) 

(9) 

( 10 ) 


Equations (8) and (9) yield y = A* and jc = Ay, and substituting in (10) gives 

A 2 (jc 2 + y 2 ) = a 2 . 

But (10) tells us that jc 2 + y 2 = a 2 , so A 2 = 1 and A = ± 1. The value A = — 1 
would imply that y = — jc, which is impossible because both jc and y are positive 
numbers, so A = 1 and y = jc. This gives the shape of the largest inscribed rec¬ 
tangle, namely, twice as long as it is wide, because 
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length = 2x = 2 y = 2(width). 

If we want the actual dimensions of this largest rectangle, we substitute y = x 
into x 2 + y 2 = a 2 to find that x = y = jV2a , so the length = 2 x = V2 a and the 
width = y = yV2 a. 


One of the merits of the method of Lagrange multipliers is that it extends very 
easily to situations with more variables or more constraints. For instance, to max¬ 
imize /(x, y, z) subject to the constraint g(x , y, z) = 0, the gradient 




must be normal to the surface g(x , y, z) = 0 (Fig. 19.17), so grad/must be par¬ 
allel to grad g, and again we have 

grad / = A grad g. 


The four equations 


dx dx' 3y 3y’ dz dz’ 


g(x, y, z) = 


in the four unknowns x, y, z, A are again equivalent to the simpler 
dL 


dx 


dL dL dL 

* ft 37*°' aT = 0 - 


o, 

equations 


where L = /(*, y, z) - Ag(x, y, z). 

Similarly, suppose we want to maximize or minimize/(x,y, z) subject to two 
constraints g(x, y, z) = 0 and h(x, y, z) = 0. Each constraint defines a surface, and 
in general these two surfaces have a curve of intersection. As before, a point Po 
where/(x, y, z) has a maximum or minimum value on this curve is a point where 
a level surface of/is tangent to the curve, that is, a point where grad/is normal 
to the curve (Fig. 19.18). But the vectors grad g and grad h determine the nor¬ 
mal plane to the curve at Po, and since grad/lies in this plane, there must be 
scalars A and /jl (two Lagrange multipliers this time) with the property that 

grad / = A grad g + fi grad h. 


(This argument assumes that grad g ^ 0 and grad h ^ 0, and that these vectors 
are not parallel.) Just as before, this vector equation and the two constraint equa¬ 
tions are easily seen to be equivalent to the following five equations in five un¬ 
knowns: 






where L =f(x , y, z) — Ag(x, y, z) — fxh(x, y, z). 

We illustrate these methods with two examples. 




Example 2 Find the point on the plane x + 2y + 3z = 6 that is closest to the 
origin. 


Solution We want to minimize the distance Vx 2 + y 2 + z 2 subject to the con¬ 
straint x -h 2 y + 3z — 6 = 0. If the distance is a minimum, its square is a mini- 
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mum, so we simplify the calculations a bit by minimizing x 2 + y 2 + z 2 with the 
same constraint. Let 


L = x 2 + y 2 + z 2 ~ A(x + 2y + 3z — 6 ). 
Then the equations we must solve are 


3 L 

dx 

dL 

dL 

dz 

dL 
3A 


2x - A = 0, 

2y - 2A = 0, 

2z - 3A = 0, 

— (x + 2y + 3z - 6 ) = 0. 


Substituting the values for x, y, and z from the first three equations into the fourth 
gives 

7 A + 2A + § A = 6 or -y-A = 6 or A = f. 

It now follows that x = f, y = f, and z = f , so the desired point is (f, f, f). 


Example 3 Find the point on the line of intersection of the planes x + y + z = 
1 and 3x + 2y + z = 6 that is closest to the origin. 

This time we want to minimize x 2 + y 2 + z 2 subject to the two constraints x + 
y + z — 1 = 0 and 3x + 2y + z — 6 = 0. If we write 

L = x 2 + y 2 + z 2 - A(x + y + z - 1) - /jl(3x + 2 y + z - 6), 

then our equations are 

= 2x — A — 3/x = 0, 


y— = 2y — A — 2ijl = 0, 
dy 

y— = 2z — A — f± = 0, 
az 


= -(x + y + z - 1 ) = 0 , 

4^-= -(3a: + 2y + z - 6) = 0. 
aix 


The first three equations give 


x = j(A + 3 /ll), y = {(A + 2/x), z = {(A + ju,). 


When these expressions are substituted in the fourth and fifth equations and the 
results are simplified, we get 

3A + 6(jl = 2, 

3A + 7/x = 6 , 
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so /x = 4 and A = — 3 . These values give jc = j, y = y, z — —j, so the desired 

• 4. • /7 1 5x 

point is (j, y, —). 


Remark 1 In economics, Lagrange multipliers are used to analyze the problem 
of maximizing the total production of a manufacturing firm subject to the con¬ 
straint of fixed available resources. For example, let 

P = f(x, y ) = Ax a + /3 = 1, 

be the production (measured in dollars) resulting from x units of capital and y 
units of labor. Then this function—known to economists as the Cobb-Douglas 
production function —is homogeneous of degree 1 in the sense explained in Ex¬ 
ample 2 of Section 19.6. If the cost of each unit of capital is a dollars, and of 
each unit of labor is b dollars, and if a total of c dollars is available to cover the 
combined costs of capital and labor, then we want to maximize the production 
P = /(;c, y) subject to the constraint ax + by = c. In Problem 23 we ask students 
to show that production is maximized when x = ad a and y = fic/bf 

Remark 2 At the beginning of this section we said that Lagrange multipliers 
also have applications to differential geometry and advanced theoretical me¬ 
chanics. These applications are too complicated to describe here, but the details 
can be found on pp. 521-523 and 529 of the present writer’s text, Differential 
Equations , 2nd ed. (McGraw-Hill, 1991). 


*For further details, interested students can look up Cobb-Douglas production functions in the in¬ 
dexes of the books mentioned in the first footnote of Section 19.6. 


PROBLEMS 

Solve all of the following problems by Lagrange multipliers. 

1 A rectangle with sides parallel to the axes is inscribed 
in the region bounded by the axes and the line x + 2 y = 
2. Find the maximum area of this rectangle. 

2 Find the rectangle of maximum perimeter (with sides 
parallel to the axes) that can be inscribed in the ellipse 
X 2 + 4y 2 = 4. 

3 Find the rectangle of maximum area (with sides paral¬ 
lel to the axes) that can be inscribed in the ellipse x 2 + 
4y 2 = 4. 

*4 On each of the following curves, find the points that are 
closest to the origin and those that are farthest from the 
origin: 

(a) x 2 + xy + y 2 = 3; (b) x 4 + 3xy + y 4 = 2. 

5 If a cylinder has fixed volume Vo, find the relation be¬ 
tween the height h and the radius of the base r that min¬ 
imizes the surface area. 

5 Find the maximum and minimum values of f(x, y) = 
2x 2 + y + y 2 on the circle x 2 + y 2 = 1 . 

7 Find the maximum and minimum values of f{x, y) = 
x 2 — xy + y 2 on the circle x 2 + y 2 = 1. 


g Find the ellipse x 2 /a 2 + y 2 /b 2 = 1 that passes through 
the point (4, 1) and has the smallest area. Hint: The area 
of this ellipse is 7 Tab. 

*9 Find the ellipsoid x 2 /a 2 + y 2 /b 2 + z 2 lc 2 = 1 that passes 
through the point (1,2,3) and has the smallest volume. 
Hint: The volume of this ellipsoid is rabc. 

JO Find the maximum value of /(j c, y, z) = 2x + 2y — z 
on the sphere x 2 + y 2 + z 2 = 4. 

j j Find the minimum value of/(x, y, z) = x 2 + 2y 2 + 3z 2 
on the plane x — y - z = 1 . 

12 Find the maximum value of f(x,y , z)=*+y + zon 
the ellipsoid x 2 /a 2 + y 2 /b 2 + z 2 /c 2 = 1 . 

13 Find the maximum volume of a rectangular box if the 
sum of the lengths of its edges is 12 a. 

14 Find the maximum volume of a rectangular box if the 
sum of the areas of its faces is 6 a 2 . 

15 (a) Show that of all triangles inscribed in a given cir¬ 

cle, the equilateral triangle has the maximum 
perimeter. Hint: If a is the radius of the circle and 
a , /3, y are the central angles subtending the three 
sides, what is the perimeter? 
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(b) In part (a), show that the inscribed equilateral tri¬ 
angle also has the maximum area. 

16 Find the point on the line of intersection of the planes 
x + 2y + z = 1 and —3x — y + 2z = 4 that is closest 
to the origin. 

17 (a) Find the point on the plane ax + by + cz + d = 0 

that is closest to the origin, and use this informa¬ 
tion to write down a formula for the distance from 
the origin to the plane. 

(b) Adapt the method used in part (a) to show that the 
distance from an arbitrary point (jcq, yo, zo) to the 
given plane is 

\axp + byp + czp + d\ 

Va 2 + b 2 +c 2 

18 (a) Show that the triangle with the greatest area A for 

a given perimeter is equilateral. Hint: If ;c, y, z are 
the sides, then A = vs{s — ;c)(s - y)(s — z), where 
2 s = x + y + z. 

(b) Show that the triangle with the smallest perimeter 
for a given area is equilateral. 

19 If the sum of n positive numbers x\, X 2 , . . . , x n has a 
Fixed value s, show that their product * 1 X 2 • • • x n has 
s n /n n as its maximum value, and conclude from this that 
the geometric mean of n positive numbers can never ex¬ 
ceed their arithmetic mean: 


^ Xi 2 = 1 and ^ y , 2 = 1 . 

<= 1 /= 1 

(b) Use part (a) to prove that for any numbers a\, a 2 , 
.. . , a n and b\, b 2 ,. .. , b n . 



The inequality in (b) is an important fact in higher math¬ 
ematics called the Schwarz inequality. 

*21 Use the method of Problem 20 1 o establish Holder’s in¬ 
equality: If 1/p + Mq— 1 and the a ?s and bf s are non¬ 
negative numbers, then 



Observe that when p = q = 2, Holder’s inequality re¬ 
duces to the Schwarz inequality. 

22 Refer back to Example 4 in Section 4.4, and the nota¬ 
tion in Fig. 4.28, to obtain Snell’s law of refraction by 
minimizing the total time of travel, 


S/X1X2 ■ • • X n 2= 


X\ + x 2 + • • • + x n 
n 


T = 


a 

v a cos a 


+ 


b 

v w cos /3’ 


*20 (a) Find the maximum value of subject to the constraints tan a + b tan j3 = a constant. 

„ 23 Show that to maximize the Cobb-Douglas production 

^ Xj >7 function P =/(*, y) = Ax a yP (a + (3 = 1) subject to 

i= 1 the constraint of fixed total costs, ax + by = c, we must 

with the constraints put * = ac!a and y = (3c/b. 


19.9 

(OPTIONAL) LAPLACE’S 
EQUATION, THE HEAT 
EQUATION, AND THE 
WAVE EQUATION. 
LAPLACE AND FOURIER 


A very large part of mathematical physics is concerned with three classic partial 
differential equations: Laplace’s equation. 


the heat equation, 


and the wave equation, 


3 2 w 

d 2 w 

d 2 w 

f\. 

a? 

+ W 

+ 1?“ 

o. 

(d 2 W 

d 2 w 

d 2 w\ 

dw 

[dx 2 


9 z 2 ) 

~17’ 

d 2 w 

d 2 w 

9 2 w\ _ 

9 2 w 

dx 2 

+ a 7 

+ 1?J‘ 

= W' 


(1) 


( 2 ) 


(3) 


As the notation indicates, in (2) and (3) the variable w is understood to be a 
function of the time t and the three space coordinates x, y, z of a point P, and 
in (1) w depends only on t, y, z and is independent of t. The quantity a is a 
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constant. Each of our three equations also has simpler two- and one-dimensional 
versions, depending on whether two space coordinates are present, or only one. 
Thus, 


8 2 w 3 2 w _ 

a * 2 ay 2 ~ u 


(4) 


is the two-dimensional Laplace equation, and 


2 a 2 w aw 


and 


2 a 2 w 

a^ 


a 2 w 

~df 


are the one-dimensional heat equation and wave equation. 

A full study of these equations can occupy years, because their physical mean¬ 
ing is extraordinarily rich, and also much concentrated thought is necessary to 
master the various branches of advanced mathematics that are needed to solve 
and interpret them. In this section we consider several aspects of these equations 
that do not require too much technical background. 


LAPLACE’S EQUATION 


If a number of particles of masses m \, m 2 ,... , m n , attracting according to the 
inverse square law of gravitation, are placed at points Pi, P 2 , . . . , P n , then the 
potential due to these particles at any point P (that is, the work done against their 
attractive forces in moving a unit mass from P to an infinite distance) is 


Gm\ + Cm 2 + 


PP 1 


PP 2 


Gm n 
PP n ’ 


(5) 


where G is the gravitational constant.* If the points P, Pi, P 2 ,... , P n have rec¬ 
tangular coordinates (x, y, z), (x\,y h zi), (x 2 ,y 2 ,Z 2 ), • • • , (x n ,y n ,z n ), so that 

PPi = V(* - x\) 2 + (y - >’i) 2 + (z- Zl) 2 , 


with similar expressions for the other distances, then it is quite easy to verify that 
the potential w satisfies Laplace’s equation (1). This equation does not involve 
either the particular masses or the coordinates of the points at which they are lo¬ 
cated, so it is satisfied by the potential produced in empty space by an arbitrary 
discrete or continuous distribution of mass. 

The function w defined by (5) is called a gravitational potential If we work 
instead with electrically charged particles of charges q\, q 2 , ... , q n , then their 
electrostatic potential has the same form as (5) with the m’s replaced by q's and 
G by Coulomb’s constant, so it also satisfies Laplace’s equation. In fact, this 
equation has such a wide variety of applications that its study is a branch of math¬ 
ematics in its own right, known as potential theory. 


THE HEAT EQUATION 

When we study the flow of heat in thermally conducting bodies, we encounter 
an entirely different type of problem leading to a partial differential equation. 


'"See Example 2 in Section 7.7. In this example we show that if two particles of masses M and m are 
separated by a distance a, then the work done in separating them to an infinite distance is GMm/a. 
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In the interior of a body where heat is flowing from one region to another, the 
temperature generally varies from point to point at any one time, and from time 
to time at any one point. Thus, the temperature w is a function of the space co¬ 
ordinates jc, y, z and the time r, say w = /(;t, y, z, t). The precise form of this func¬ 
tion naturally depends on the shape of the body, the thermal characteristics of its 
material, the initial distribution of temperature, and the conditions maintained on 
the surface of the body. The French physicist-mathematician Fourier studied this 
problem in his classic treatise of 1822, Theorie Analytique de la Chaleur (Ana¬ 
lytic Theory of Heat). He used physical principles to show that the temperature 
function w must satisfy the heat equation (2).* We shall retrace his reasoning in 
a simple one-dimensional situation, and thereby derive the one-dimensional heat 
equation. 

The following are the physical principles that will be needed. 

(a) Heat flows in the direction of decreasing temperature, that is, from hot re¬ 
gions to cold regions. 

(b) The rate at which heat flows across an area is proportional to the area and 
to the rate of change of temperature with respect to distance in a direction 
perpendicular to the area. (This proportionality factor is denoted by k and 
called the thermal conductivity of the substance.) 

(c) The quantity of heat gained or lost by a body when its temperature changes, 
that is, the change in its thermal energy, is proportional to the mass of the 
body and to the change of temperature. (This proportionality factor is de¬ 
noted by c and called the specific heat of the substance.) 



Figure 19.19 


We now consider the flow of heat in a thin cylindrical rod of cross-sectional 
area A (Fig. 19.19) whose lateral surface is perfectly insulated so that no heat 
flows through it. This use of the word “thin” means that the temperature is as¬ 
sumed to be uniform on any cross section, and is therefore a function only of the 
time and the position of the cross section, say w =/(;c, t). We examine the rate 
of change of the heat contained in a thin slice of the rod between the positions 
x and x + Ax 

If p is the density of the rod, that is, its mass per unit volume, then the mass 
of the slice is 


Am = pA Ax. 


Furthermore, if Aw is the temperature change at the point jc in a small time in¬ 
terval A t, then (c) tells us that the quantity of heat stored in the slice in this time 
interval is 

AH = c Am Aw = cp A Ax Aw, 


so the rate at which heat is being stored is approximately 


AH 

At 


cp A Ax 


Avr 
Ar' 


( 6 ) 


We assume that no heat is generated inside the slice—for instance, by chemical 
or electrical processes — so that the slice gains heat only by means of the flow 


*The same partial differential equation also describes a more general class of diffusion processes, 
and is sometimes called the diffusion equation. 
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of heat through its faces. By (b) the rate at which heat flows into the slice through 
the left face is 



X 


The negative sign here is chosen in accordance with (a), so that this quantity will 
be positive if dw/dx is negative. Similarly, the rate at which heat flows into the 
slice through the right face is 





so the total rate at which heat flows into the slice is 


kA^- 

OX 


kA 


3w 

3* 


If we equate the expressions (6) and (7), the result is 


kA 


3w 

dx 


-kA^ 

x + Aa OX 


= cp A Ax 


Aw 


(7) 


or 


k [ 9w/3.x| ;c +Aa: ~ dvr/d-yl* 
cp Ax 


Aw 

~a7' 


Finally, by letting Ax and At —> 0 we obtain the desired equation, 

9 3 2 w 3w 

a?““a? 


where a 2 = k/cp. This is the physical reasoning that leads to the one-dimensional 
heat equation. The three-dimensional equation (2) can be derived in essentially 
the same way. 


THE WAVE EQUATION 


All phenomena of wave propagation, for example, of light or sound or radio 
waves, are governed by the wave equation (3). We shall consider the simple case 
of a one-dimensional wave described by the one-dimensional wave equation 


, a 2 w = a 2 w 

dx 2 dt 2 


( 8 ) 


Such a wave involves some property w = f(x, t), such as the position of a parti¬ 
cle, the intensity of an electric field, or the pressure in a column of air, that de¬ 
pends not only on the position jc but also on the time t. 

In order to understand the connection between waves and equation (8), we 
consider a function w = F(x — at). At t = 0, it defines the curve w = F(x ), and 
at any later time t = 1 1 , it defines the curve w = F(x - at\). It is easy to see that 
these curves are identical except that the latter is translated to the right through 
a distance at i, and therefore with velocity 
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This shows that the function w = F(x — at) represents a traveling wave that moves 
to the right with velocity a , as suggested in Fig. 19.20. If we assume that w = 
F(u) has a second derivative, then by the chain rule applied to w = F(u) where 
u = x — at, we have 




dw 

3r 


= F'(u)-(-a) = ~aF\u\ 


B 2 w 

dx 2 


= F "(u), 


3 2 w 

"aJ 5 " 


— aF"(w) ■ (“«) = a 2 F"(u). 


It is clear from this that w = F(x - at) satisfies the one-dimensional wave equa¬ 
tion (8). 

Similarly, the function w = G(x + at) represents a traveling wave that moves 
to the left with velocity a , and it is equally easy to show that this function is a 
solution of (8). By the linearity of differentiation, it follows that the sum 

w = F(x — at) + G(x + at) (9) 


is also a solution. In fact, it can be shown (see Problem 8) that if F and G are 
arbitrary twice-differentiable functions, then (9) is the general solution of (8), 
in the sense that every solution of (8) has the form (9). It is fairly clear that the 
function (9) represents the most general one-dimensional wave, and this result 
confirms it. 


NOTE ON LAPLACE 

Pierre Simon de Laplace (1749-1827) was 
a French mathematician and theoretical astronomer who was 
so famous in his own time that he was known as the New¬ 
ton of France. His main scientific interests throughout his 
life were celestial mechanics and the theory of probability. 

At the age of 24 he was already deeply engaged in the 
detailed application of Newton’s law of gravitation to the 
solar system as a whole, in which the planets and their satel¬ 
lites are not governed by the sun alone but interact with one 
another in a bewildering variety of ways. Even Newton had 
been of the opinion that divine intervention would occa¬ 
sionally be needed to prevent this complex mechanism from 
degenerating into chaos. Laplace decided to seek reassur¬ 


ance elsewhere, and succeeded in proving that the ideal so¬ 
lar system of mathematics is a stable dynamical system that 
will endure unchanged for all time. This achievement was 
only one of the long series of triumphs recorded in his mon¬ 
umental treatise Mecanique Celeste (published in five vol¬ 
umes from 1799 to 1825), which summed up the work on 
gravitation of several generations of illustrious mathemati¬ 
cians. Many anecdotes are associated with this work. One 
of the best known describes the occasion on which Napoleon 
tried to get a rise out of Laplace by protesting that he had 
written a huge book on the system of the world without once 
mentioning God as the author of the universe. Laplace is 
supposed to have replied, “Sire, I had no need of that hy- 
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pothesis.” The principal legacy of the Mecanique Celeste to 
later generations lay in Laplace’s wholesale development of 
potential theory, with its far-reaching implications for a 
dozen different branches of physical science ranging from 
gravitation and fluid mechanics to electromagnetism and 
atomic physics. Even though the concept of the potential is 
due to Lagrange, Laplace exploited it so extensively that ever 
since his time the fundamental differential equation of po¬ 
tential theory has been known as Laplace’s equation. 

His other masterpiece was the treatise Theorie Analytique 
des Probability (1812), in which he incorporated his own 


discoveries in probability from the preceding 40 years. This 
book is generally agreed to be the greatest contribution to 
this part of mathematics ever made by one man. In the in¬ 
troduction he says, “At bottom, the theory of probability is 
only common sense reduced to calculation.” This may be 
so, but the following 700 pages of intricate analysis—in 
which he freely used Laplace transforms, generating func¬ 
tions, and many other highly nontrivial tools—has been said 
by some to surpass in complexity even the Mecanique 
Celeste. 


NOTE ON FOURIER 

Jean Baptiste Joseph Fourier (1768-1830), 
an excellent mathematical physicist, was a friend of 
Napoleon (so far as such people have friends) and accom¬ 
panied his master to Egypt in 1798. On his return he be¬ 
came prefect (governor) of the district of Is&re in south¬ 
eastern France, and in this capacity built the first real road 
from Grenoble to Turin. He also befriended the boy Cham- 
pollion, who later deciphered the Rosetta Stone as the first 
long step toward understanding the hieroglyphic writing of 
the ancient Egyptians. 

During these years he worked on the theory of the con¬ 
duction of heat, and in 1822 published his famous Theorie 
Analytique de la Chaleur, in which he made extensive use 
of the series that now bear his name. These series were of 
profound significance in connection with the evolution of 
the concept of a function. The general attitude at that time 
was to call f(x) a function if it could be represented by a 
single expression like a polynomial, a finite combination of 
elementary functions, a power series 2~=o a n* n i or a trigono¬ 
metric series of the form 

^0 + X cos nx 4- b n sin nx). 


If the graph of f(x) were “arbitrary”—for example, a poly¬ 
gonal line with a number of comers and even a few gaps— 
then/(x) would not have been accepted as a genuine func¬ 
tion. Fourier claimed that “arbitrary” graphs can be repre¬ 
sented by trigonometric series and should therefore be 
treated as legitimate functions, and it came as a shock to 
many that he turned out to be right. It was a long time be¬ 
fore these issues were completely clarified, and it was no 
accident that the definition of a function that is now almost 
universally used was first formulated by Dirichlet in 1837 
in a research paper on the theory of Fourier series. Also, the 
classical definition of the definite integral due to Riemann 
was first given in his fundamental paper of 1854 on the sub¬ 
ject of Fourier series. Indeed, many of the most important 
mathematical discoveries of the nineteenth century are di¬ 
rectly linked to the theory of Fourier series, and the appli¬ 
cations of this subject to mathematical physics have been 
scarcely less profound. 

Fourier himself is one of the fortunate few: his name has 
become rooted in all civilized languages as an adjective that 
is well known to physical scientists and mathematicians in 
every part of the world. 



PROBLEMS 


1 (a) Verify that the function 

1 

w = 

V(* - JC]) 2 + (y - >>i) 2 + (z ~ Zi) 2 
satisfies Laplace’s equation (1). 


(b) A differential equation is called linear if the sum of 
two solutions is a solution, and any constant times a 
solution is a solution. Verify that Laplace’s equation 
(1) is linear, and conclude from part (a) that the po¬ 
tential (5) is a solution. 
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2 (a) Determine whether or not the function 

1 

w= , ' . -= 

V(jc - x\) 2 + (y ~y\) 2 

is a solution of Laplace’s equation (4) in two di¬ 
mensions. 

(b) Show that the function w = In [(jc — x\) 2 + (y — 
yi) 2 ] is a solution of Laplace’s equation (4) in two 
dimensions. 

3 Verify that each of the following functions satisfies 
Laplace’s equation (1): 

(a) w = jc 2 + 2y 2 — 3z 2 ; 

(b) w = x 2 — y 2 + 57z; 

(c) w = 4z 3 - 6(jc 2 + y 2 )z; 

(d) w = e~ 2x sin2y + 3z; 

(e) w = e 3x e 4y cos 5z; 

(f) w = £ 13;c sin 12ycos5z. 

*4 If w = f{x, y) is transformed into w = F{r, 6) by the 
equations jc = rcos 0, y = r sin 6 (these are the trans¬ 
formation equations from rectangular to polar coordi¬ 
nates), show that the two-dimensional Laplace equation 

d 2 w d 2 w _ 

al 2 " ay 2 

becomes 

a 2 w 1 1 3 2 w _ 

aJ r + 7'a7 + 

5 Use Problem 4 to show that each of the functions xv\ = 
r n sin nd and W 2 = r n cos satisfies Laplace’s equation 
in two dimensions. 

6 Suppose that a solution w = /(jc, f) of the one-dimen¬ 
sional heat equation 

9 a 2 w 3w 


has the form /(jc, r) = g(x)h(t), that is, is the product of 
a function of x and a function of t. 

(a) Show that a 2 g"(x)h(t) = g(x)h’(t). 

(b) Part (a) implies that 

g'Xx) _ h\t) 
g(x) a 2 h(t ) ’ 

where the left side is a function of jc alone and the 
right side is a function of t alone. Deduce that there 
is a constant A such that g"lg = A and ti/(a 2 h) = A. 

(c) Assume that the constant A in part (b) is negative, 
and can therefore be written in the form A = — k 2 for 
some positive number k. Show that g(jc) = 
ci sin kx + C 2 cos fc c is a solution of the equation 
g"!g — -k 2 for every choice of the constants cj and 
c 2 . 

(d) Show that h(t) = ce~ a2k2t is a solution of the equa¬ 
tion h! / (a 2 h) = —k 2 for every choice of the constant 
c. Thus all of the functions 

w = /(jc, t) = ce~ a2k2{ {c i sin kx + c 2 cos kx) 

are solutions of the heat equation, and any sum of 
such solutions is a solution. 

7 A steady-state solution of the heat equation (2) is one 
that does not depend on t, and in this case the heat equa¬ 
tion reduces to Laplace’s equation (1). Solve the one¬ 
dimensional Laplace equation. 

*8 Use the chain rule to show that under the change of vari¬ 
ables specified by u = jc - at, v = x + at, the equation 

9 d 2 w 8 2 w 3 2 w 

a 1 -r-^r = -r-^- becomes -r—— = 0. 

dx 2 dt 2 dudv 

Hint: See Example 3 in Section 19.6. Use this result to 
show that w = F(x - at) + G(jc + at) is the most gen¬ 
eral solution of the one-dimensional wave equation. 


19.10 

(OPTIONAL) IMPLICIT 
FUNCTIONS 


In Section 3.5 we stated that when we are given an equation 

F{x,y) = 0, 

there usually exists at least one function 

y =/W 


( 1 ) 

( 2 ) 


that “solves” (1), in the sense that (2) reduces (1) to an identity in jc. With the 
idea in mind that y in (1) stands for this function of jc, we then differentiated the 
identity (1) with respect to jc and went on to solve the resulting equation for dy/dx , 
calling the process “implicit differentiation.” For instance, if we have the equa¬ 
tion 

jc 2 y 5 — 2jty +1=0, 


then by differentiating with respect to jc we obtain 


(3) 
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5v 4 ^ + 2xf 


lx^j--2y = V>, 
ax 


(4) 


so 


dy _ 2y - 2xf 
dx 5x 2 y 4 -2x’ 

Most students feel slightly uncomfortable about implicit differentiation, and with 
good reason. For one thing, in this particular case we have no idea whether (3) 
actually defines y as a function of x or not; and if it doesn’t, then the subsequent 
calculation leading to (5) has no meaning at all. Also, the procedure itself is a 
bit clumsy, because it requires us to keep in mind the different roles played by 
the variables x and y. We are now in a position to clarify the meaning of this 
process, and also to give a precise statement of the conditions under which an 
equation of the form (1) defines a differentiable function (2). 

We broaden the discussion slightly, and instead of (1) consider an equation of 
the form 


F(x,y) = c, (6) 

whose graph is a level curve of the function z = F(x, y). For example, the graph 
of 

x 2 +y 2 =l (7) 

is a circle about the origin (Fig. 19.21, left), and this is a level curve of the func¬ 
tion F(x, y) = x 2 + y 2 . Generally, as in this case, the graph of (6) will be some 
sort of curve that is not the graph of a single function. However, even though the 
entire graph of (7) is not the graph of a single function, it is clear that every point 
(x 0 , yo) on this graph with y 0 ^ 0 lies on a portion of the graph that is the graph 
of a function—indeed, of a differentiable function. Specifically, if yo > 0 then 
(x 0 , yo) lies on the graph of the function 

?=/,(*) = Vl x 2 , (8) 

and if yo < 0 then (jc 0 , yo) lies on the graph of the function 

y =h(x) = -Vl -jt 2 . (9) 

Similarly, the graph of (6) might consist of the graphs of two or more differen¬ 
tiable functions y = f(x), as suggested on the right in the figure. 



Figure 19.21 
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PARTIAL DERIVATIVES 


We next point out that the function z = F(x, y) has the constant value c along 
the graph of any such function y = /(jc), 

z = F[xJ(x)] = c. 


As usual, we assume that F(j c, y) has continuous partial derivatives, so it is per¬ 
missible to write 


or equivalently, 


dz _ 3F + dF_ dy_ _ 
dx dx dy dx 


dz 


dy 


— = F x {x,y) + F y (x,y) — = 0. 


dx 


( 10 ) 


The middle term here is just the chain rule evaluation of dz/dx when z = F(jc, y) 
and y is a function of jc, and the result is zero because z is constant as a function 
of jc. If F y (x, y) =£ 0, equation (10) can be solved for dy/dx. 


dy_ = F x (x, y) 
dx F y (. x , y) ’ 


(ID 


In the language of Section 3.5, any differentiable function y = /(jc) with the prop¬ 
erty that 

F[x,f(x)] = c 


is an implicit function defined by (6), and (11) provides a general formula for the 
derivative of such a function. 

If we apply formula (11) to equation (7), where F(j c, y) = jc 2 + y 2 , we obtain 
F x = 2x and F v = 2y, so ~ y i= 0. (12) 

y dx Fy y 

In this case we know from (8) and (9) that (7) actually determines two implicit 
functions y = /(jc), so the calculations (12) are legitimate and apply to either func¬ 
tion as long as we avoid points where y = 0. However, suppose that instead of 
(7) we have one of the equations 

.r 2 + >’ 2 = — 1 or x 2 + y 2 = 0. (13) 


By acting blindly without thinking, we can write down the calculations (12) just 
as before and “find” dy/dx. The difficulty with this is obvious: Since the graph 
of (13) is either empty or consists of a single point, no implicit function y = /(jc) 
exists, and these calculations would be nothing more than a kind of mathemati¬ 
cal doubletalk, which seems to be saying something but really says nothing at 
all. 

In order to avoid committing such nonsense, it is necessary to have definite 
knowledge that implicit functions exist. This is the purpose of the 


Implicit Function Theorem Let F(x, y) have continuous partial derivatives through¬ 
out some neighborhood of a point (*o, yo), and assume that F(x o, yo) = c and F y (: Co, yo) =£ 
0. Then there is an interval I about xq with the property that there exists exactly one 
differentiable function y =/(;c) defined on I such that yo = f{xd) and 

F[xJ(x)] = c. 
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Further, the derivative of this function is given by the formula 

dy = _F± 

dx Fy 

and is therefore continuous. 


It should be understood that this theorem is a purely theoretical statement to 
the effect that the specified implicit function y = f(x) does in fact exist, and it 
has no bearing on the issue of whether a simple formula can be found for this 
function. A proof is given in Appendix A.20. 

Example 1 We consider once more the equation mentioned earlier, 

F(x,y) = x 2 y 5 - 2xy +1=0. (3) 

It is clear that the point (1,1) lies on the graph, so the graph is not empty. Since 
F x = 2xy 5 — 2 y and F y = 5x 2 y 4 — 2x , our theorem guarantees that equation (3) 
determines an implicit function y =/(;c) about any point of the graph where F y — 
5x 2 y 4 — 2x^=0, for instance the point (1, 1). It is instructive to write down equa¬ 
tion (10) for this case, 

(2 xy 5 - 2y) + (5x 2 y 4 -2x)^ = 0, (14) 

and to compare the result with (4), where implicit differentiation is carried out 
by the old method. Equation (14) evidently yields 

dy _ 2 y — 2.n ,s 
dx 5 x 2 y 4 - 2x' 

just as before. 


The simplicity of our present method is even more clearly visible when there 
are three variables in the given equation. 

Thus, suppose an equation F(x, y, z) = c defines a certain implicit function z = 
f(x , y\ and let us find dzJdx in terms of the function F(x , y, z). The equations 

w = F(x, y, z), 

x = x, y = y, z=f(x,y), 
give w as a composite function of jc and y. Also, 

w = F[x,y,f(x,y)\ = c , 


so if we differentiate this with respect to x, the chain rule yields 

dw _ dF dx dFdy dF dz _ dF dF dz _ ~ 

dx dx dx dy dx dz dx dx dz dx 

We therefore obtain 

dz _ dF/dx 
37" ~dF/dz' 


(15) 


and this formula is valid wherever dF/dz A 0. In just the same way, we also have 


dz _ dF/dy 
Ity- ~ dF/dz' 


( 16 ) 
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PARTIAL DERIVATIVES 


As students have surely guessed, there is also an Implicit Function Theorem that 
covers this situation. Briefly, it says that if dF/dz =£ 0 at a point (jco, yo, Zo) on a 
surface F(jc, y, z) = c, then in a neighborhood of this point the surface defines a 
unique implicit function z =/(*, y) such that zo =/Uo> and furthermore the 
partial derivatives of this function are given by (15) and (16). 

Example 2 j t ls eaS y t0 verify that the point (1,2, —1) lies on the graph of the 
equation 

x 2 z + yz 5 + 2*y 3 = 13, (17) 

so this graph is not empty. If the equation defines an implicit function z = /(jc , y) 
in a neighborhood of this point, then we can calculate dz/dx by implicit differ¬ 
entiation in the old way. This means we differentiate (17) implicitly with respect 
to jc, thinking of y as a constant, which gives 

x 2 -p- 4- 2 xz + y • 5z 4 + 2y 3 = 0, 

ox ox 


SO 

dz _ 2 xz + 2 y 3 

~ x 2 + 5yz 4 ‘ 

This procedure is unsatisfactory because we don’t know in the beginning whether 
any such function z = /(jc, y) actually exists—after all, (17) is a fifth-degree equa¬ 
tion in z —and also because in the implicit differentiation each of the three vari¬ 
ables has to be treated in a different way, and it is quite easy to lose track of what 
is going on. Our present ideas provide a much better method. We have 

F(x, y, z) = x 2 z + yz 5 + 2xy 3 , 


so 

= 2 xz + 2y 3 and = x 2 + 5yz 4 . 

ox dz 

It is easy to see that dF/dz = 11 =£ 0 at (1, 2, — 1), so the Implicit Function The¬ 
orem guarantees that z = f(x , y) exists. Also, by (15) we have 

dz _ dF/dx _ 2 xz + 2 y 3 
dx dF/dz x 2 + 5yz 4 ’ 

which avoids the messy implicit differentiation. 


Remark two-variable version of the Implicit Function Theorem enables us 
to complete a long-standing piece of unfinished business. In the earlier chapters 
of this book we gave quite a bit of attention to the important problem of finding 
the inverse function of a given function g(y) = jc, in other words, the problem of 
solving the equation 

F(x,y) = g(y)-x = 0 (18) 

for the variable y. Specifically, this is the way the familiar functions y = lnx, 
y = sin -1 x, and y = tan -1 jc were defined. Each of these inverse functions was 
discussed earlier in an ad hoc but perfectly legitimate way. We are now in a po- 
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sition to draw the general inference that when g(y) has a continuous derivative 
and dF/dy = g\y) =£ 0, then (18) can indeed be solved for y, y = f{x\ and also 
that this function has a continuous derivative given by 

dy _ dF/dx _ — 1 _ 1 

dx dF/dy g'(y ) dx/dy' 

This completes the line of thought that was briefly described in Remark 2 of Sec¬ 
tion 9.5. 


PROBLEMS 

In Problems 1-6, use formula (11) to compute dy/dx. 

1 y 2 — 3x 2 —1=0. 

2 * 6 + 2y 4 = 1. 

3 x sin y = x + y. 

4 sin y + tan y = x 2 4- x 3 . 

5 e xy = 2xy 2 . 

^ e x sin y = e y cos x. 

In Problems 7-10, use formulas (15) and (16) to compute 
dz/dx and dz/dy. 

^ In z = z + 2y - 3x. 

^ tan -1 jc + tan -1 y + tan -1 z = 9. 

9 

* z = xy sin xz. 

^ sin xy + sin yz + sin xz = 1. 


Use formulas (15) and (16) to find the largest value of z 
on the ellipsoid 2x 2 + 3y 2 + z 2 + yz _ xz = 1. 

I 2 The folium of Descartes (Problem 16 in Section 17.1) 
has x 3 + y 3 = 3 axy as its equation. Use formula (11) to 
find the highest point on the loop. 

1^ If F(x, y) has continuous second partial derivatives and 
the equation Fix, y) = c defines y = f(x) as a twice-dif- 
ferentiable function, show that if F y i= 0, 

d 2 y F xx Fy 2 — 2 F^F x F y T F yy F 2 
~dx 2= F 3 ‘ 

1^ Compute d 2 yldx 2 by using Problem 13 if 
(a) x 4 y 5 = 1; (b) e y = x + y. 


CHAPTER 19 REVIEW: DEFINITIONS, METHODS 


Think through the following. 

1 Domain, continuity, and level curves for z = /(;c, y). 

2 Definition and geometric meaning of partial derivatives of 
z =/(*, y)- 

3 Equality of mixed partial derivatives. 

4 Equation of tangent plane to z = fix, y). 


3 Directional derivative and gradient. 

6 The del operator. 

7 The chain rule. 

8 Method of Lagrange multipliers for constrained maxima 
and minima. 
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20.1 

VOLUMES AS ITERATED 
INTEGRALS 


MULTIPLE 

INTEGRALS 


A continuous function/(x, y) of two variables can be integrated over a plane re¬ 
gion R in much the same way that a continuous function of one variable can be 
integrated over an interval. The result is a number called the double integral of 
fix , y) over R and denoted by 

JlfOc. y) dA or II f(x , y) dx dy. 

R R 

A different but closely related concept is that of an iterated (or repeated ) inte¬ 
gral. We discuss iterated integrals in this section, and in the next section return 
to the topic of double integrals and explain what they are and how they are re¬ 
lated to iterated integrals. 

In Section 7.3 we discussed the “method of moving slices” for finding vol¬ 
umes. Thus, if A(x) is the area of the section cut from a solid by a plane per¬ 
pendicular to the x-axis at a distance x from the origin, then the formula 

V= ( b A{x) dx (1) \ 

Ja 

gives the volume of the solid between the planes x = a and x—b. The essence 
of this formula lies in the idea that 

dV = Aix) dx 

is the volume of a thin slice of the solid of thickness dx. The total volume (1) is 
then found by adding together (or integrating) these elements of volume as our 
typical slice sweeps through the complete solid, that is, as x increases from a 
to b. 

However, if the section itself has curved boundaries—as happens in many 
cases—then the determination of A(x) also requires integration. For instance, the 
section shown in Fig. 20.1 extends from the ry-plane z = 0 up to the curved sur¬ 
face z = fix, y). By considering x to be arbitrary but momentarily fixed between 
a and b, we see that the area of this section is 

Aix) = f y f * fix , y) dy, (2) 

■O', (x) 

where y = y\ (jc) and y = y2ix) are the equations of the curves that bound the base 
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z 



Z = fix , y) 


■Aix) 



= y 2 ix) 


y 


Figure 20.1 


on the left and right. To find the total volume V we now insert (2) in (1) and ob¬ 
tain the iterated integral 



(3) 


Students should notice particularly that in (3) we first integrate f(x, y ) with re¬ 
spect to y, holding jc fixed. The limits of integration depend on this fixed but ar¬ 
bitrary value of x, and so does the resulting value of the inner integral. This in¬ 
ner integral is precisely the function A(x) given by (2), which we then integrate 
with respect to x from a to b to obtain the iterated integral (3). To summarize, 
we start with a positive function f(x, y) of two variables; we first “integrate y 
out,” which gives a function of jc alone; and then we “integrate x out,” which 
gives a number—the volume of the solid. 

On the other hand, in some cases it may be more convenient to cut the solid 
by a plane perpendicular to the y-axis and to form the iterated integral in the 
other order, first integrating x and then y, 



(4) 


These two possible orders of integration are suggested in Fig. 20.2, representing 
the base of the solid, with (3) shown on the left and (4) on the right. The iter¬ 
ated integrals (3) and (4) are usually written without brackets, as 


y 


x^iy) <.x <>x 2 {y\ c <,y <d 


y\ (x) <; y <> y 2 ix ), a <x <b 


x = *iO) 



| y =>’iU) 

4- 



1 A 


a 


x b 


x 


Figure 20.2 
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rr- 

Ja Jy 


y 2 (x) 

h'S*) 


f(x, y) dy dx and 


d f* 2 C;y) 

(*,00 


n 


f(x, y) dx dy\ 


however, we can always retain the brackets for additional clarity if we wish to 
do so. The order in which the integrations are carried out (first with respect to 
y and then with respect to jc, or the reverse) is determined by the order in which 
the differentials dx and dy are written in these iterated integrals: We always work 
from the inside out. 


Example 1 Use an iterated integral to find the volume of the tetrahedron bounded 
by the coordinate planes and the plane x + y + z = 1. 



Figure 20.3 


Solution section in the plane jc = a constant is the triangle shown in Fig. 
20.3, with base extending from y = 0 to the line y = 1 — jc. Its area is 

r i —jc ri —jc 

A{x) = J o z dy = J o (1 ~ x ~ y)dy. 

We now find the desired volume by integrating this from jc = 0 to jc = 1, 



The correctness of this result can be verified by elementary geometry, from the 
fact that the volume of any tetrahedron is one-third the area of the base times the 
height. 


Example 2 Determine the region in the jry-plane over which the iterated inte¬ 
gral 

j [ 2 fix, y) dydx 


extends. 



Solution j n the i nn er integral, with jc fixed between -1 and 2, y varies from the 
curve y = j c 2 up to the line y = 4 (see Fig. 20.4). In the second integration jc in¬ 
creases from — 1 to 2. The region is that shown in the figure, and is bounded by 
the curve y = x 2 and the lines y = 4 and jc = — 1. Students should notice partic¬ 
ularly how we determine what the region is by examining the limits of integra¬ 
tion. 


Example 3 Th e iterated integral 


lo & 2ydydx (5) 

extends over a certain region in the jcy-plane. Write an equivalent integral with 
the order of integration reversed, and evaluate both integrals. 
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Solution \y e see that the given integral extends over the region shown in Fig. 
20.5, between the curves y = x 2 and y = x, where 0 < x < 1. With the order of 
integration reversed, y is first held fixed between y = 0 and y = 1, and x increases 
from x = y to x = y ]/2 . The required integral is therefore 

If* 2 y dxdy = f [2 .xy] y y ' a dy = f (2>> 3/2 - 2>> 2 ) dy = -jy. 

The given integral (5) has the same value, 

L‘ £ 2ydydx = 1' iy 2 & dx = f <* 2 -^ 4 ) * = T 

because both iterated integrals give the volume of a certain solid, and this vol¬ 
ume must be the same regardless of how it is calculated. In computational prob¬ 
lems of this kind, we are naturally free to use any methods of integration we wish 
from our past experience—trigonometric substitution, integration by parts, etc. 



PROBLEMS 


Determine the regions over which the iterated integrals in 
Problems 1 and 2 extend. 

rVx 


1 


to £ f(x - y) ^ d y 2 foL f(x ' y) dy dx - 


Evaluate each of the iterated integrals in Problems 3-14. Also 
sketch the region R over which the integral extends. 

3 4 

5 
7 

9 

11 

13 

14 


Jo £ ( 2x + 2>) d y dx- 

L fo 3V > 2 + 9 dx dy - 

J r 7t/2 r cos x 

0 Jo ydydx - 

(3 fin y 

Ji Jo dx dy ‘ 

f 0 f 0 * cos y dx. 

f 2 f 2 * dy dx 
Ji Jx (x + y ) 2 * 

rr 

Jo Jo 


6 

8 

10 

12 


nv**. 

fe y City 

J t e*y dx dy. 

[f^y^dy- 

J f 7r f sin x 

0 Jo 


y 2 dy dx. 


sin (x 4- y ) dx dy. 


In Problems 15-18, write an equivalent iterated integral with 
the order of integration reversed. 


15 io \y ^ dy - 


16 

17 


2-2x 2 


f(x, y) dy dx. 


/■ r 

Jo Jo 

fi le- ^ X ' ^ dy - 


18 Ut^j ( ^ dydx - 

In Problems 19-24, write an equivalent iterated integral with 
the order of integration reversed, and evaluate both integrals. 


19 

fo fv-y 2x3 * dy ■ 20 

[2 C4~x 2 

Jo Jo ^ dy dx - 

21 

IT (5-2 x-y)dydx. 


22 

\ [ dx dy. 23 

J 1 Jin >> 

f 4 f}(y+2) 

LLv^ dxdy - 

24 

rV2 r\4-2x 2 


__ x dy dx. 

Jo J-V4^27 



In Problems 25-28, use iterated integrals to find the volumes 
of the given regions of space. Sketch each region. 

25 The region in the first octant bounded by the coordinate 
planes and the plane 



where a, b, c are positive numbers. 

26 The region in the first octant bounded by the plane 
x + y = 1 and the cylinder z = 1 — x 2 . 

27 The region in the first octant bounded by the plane 
y = x and the cylinder z = 4 — y 2 . 

28 The region in the first octant bounded by the surface 
z = 4 — x — y 2 . 

*29 Use any method to find the volume of the region 
bounded by the surface x 2/3 + y m + z m = a 2/3 . 
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MULTIPLE INTEGRALS 


20.2 

DOUBLE INTEGRALS 
AND ITERATED 
INTEGRALS 


The double integral of a function of two variables is the two-dimensional analog 
of the definite integral of a function of one variable. It is convenient here to call 
this latter type of integral a single integral , in contrast to the term double inte¬ 
gral. 

As we know, the value of the single integral /£/(*) dx is determined by the 
function f(x) and the interval [a, b\ In the case of a double integral, the interval 
[a, b] is replaced by a region R in the xy-plane, and the double integral of 
/(x, y) over R is denoted by the symbol 


If f(x, y) dA. (1) 

R 

The reason for the dA notation will be explained below. 

We recall that in Section 6.4 a single integral was defined as the limit of cer¬ 
tain sums. We now define the double integral (1) in much the same way. 

Consider a continuous function/(x, y ) defined on a region R in the ry-plane. 
It is necessary to assume that R is bounded , in the sense that it can be enclosed 
in a sufficiently large rectangle and doesn’t go off to infinity in any direction; 
otherwise, just as in the case of a single integral where a or b is infinite, the dou¬ 
ble integral will be improper. 

We begin by covering R with a network of lines parallel to the axes, as shown 
in Fig. 20.6, where the distances between consecutive parallel lines are permit¬ 
ted to be equal or unequal. These lines divide the plane into many small rectan¬ 
gles. Some rectangles will lie entirely or partly outside of /?, and these we ig¬ 
nore. Other rectangles will lie entirely inside R, and if there are n of these 
altogether—we assume there is at least one—then we number them in any or¬ 
der from 1 to n, denoting by AA* the area of the A:th rectangle. We now choose 
an arbitrary point fe, y*) in the kth rectangle and form the sum 

n 

X>’*) ^A k . (2) 

k= 1 

Finally, suppose that many more parallel lines are added to produce a network 
that divides the given rectangles into even smaller rectangles, and consider the 
sum (2) corresponding to this finer partition of the plane. If these sums approach 
a unique limit as n becomes infinite and the maximum diagonal of the rectan¬ 
gles (that is, the longest diagonal of any of the rectangles) approaches zero—in- 



Figure 20.6 
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dependent of the choice of dividing lines and the points (x■*, y k ) in the rectangles 
—then the double integral (1) is defined to be this limit: 



(3) 


R 


So far, it appears that the definition (3) differs very little from the corresponding 
definition of a single integral. However, there are certain technical difficulties in 
two dimensions that do not arise in one dimension. For one thing, plane regions 
can be much more complicated than intervals [a, b]. Nevertheless, the existence 
of double integrals can be rigorously proved under assumptions that are general 
enough for all practical purposes. In particular, it is enough to assume that the 
regions we consider contain their boundaries and that these boundaries consist 
of a finite number of smooth curves. 

We shall not attempt a careful theoretical treatment of double integrals. This 
is a difficult subject, and is best left to courses in advanced calculus.* Instead, 
we prefer to emphasize the intuitive meaning of double integrals, and to con¬ 
centrate our attention on their geometric and physical applications. 

As an illustration of this point of view, suppose that z = /(*, y) is the equa¬ 
tion of a surface in xyz-space that lies above the region R, so that/(x, y) > 0 in 
as shown in Fig. 20.7. Then/(;c*,y*) AA* is approximately the volume (height 
times area of base) of the thin column in the figure; the sum (2) is the sum of 
many such volumes and therefore approximates the total volume of the solid un¬ 
der the surface; and the limit (3), which is the double integral 



( 1 ) 


R 

gives the exact volume of this solid. 1 " 


*Even at this level, one needs an advanced calculus course of the traditional kind. For example, see 
Philip Franklin, A Treatise on Advanced Calculus (Wiley, 1940); or Angus E. Taylor, Advanced Cal¬ 
culus (Ginn, 1955). 

f The double integral (1) is actually the volume of a region in three-dimensional space, but it seems 
to be more natural to speak of the volume of a solid. 


z 
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Figure 20.7 
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It is clear that if /(jc, y) has a constant value, say /(jc, y) = c, then 

If fix, y) dA = cA, 

R 

where A is the area of the region R. In particular, if /(x, y) = 1 we have 

JJ dA = A. 

R 

We also point out that in the definition (3) there is no requirement that/(jc, y) 
must be positive. If/(;c, y) takes both positive and negative values, then the dou¬ 
ble integral represents an algebraic volume instead of a geometric volume; that 
is, the volume between the surface z = fix, y) and the jcy-plane counts positively 
when /(;c, y) > 0 and negatively when /(jc, y) < 0. 

Since the area of a rectangle with sides parallel to the axes can be written as 
AA = Ajc Ay, it is reasonable to use 


s 


f(x, y) dx dy 


(4) 



as an alternative notation for the double integral (1). In this form the double in¬ 
tegral resembles an iterated integral, and in fact, as we next explain, when the 
region R has a certain simple shape the double integral (1) is always equal to a 
suitably chosen iterated integral. This equality often misleads students into think¬ 
ing that double integrals are essentially the same as iterated integrals, but they 
are not. We shall say more below about the distinction between these two types 
of integrals. 

A region R is called vertically simple if it can be described by inequalities of 
the form 


a<x<b, yi (x) < y < y 2 U), (5) 

where y = y! (jc) and y = y 2 (jc) are continuous functions on [a, b\ A region of 
this kind is shown in Fig. 20.8. Similarly, a region R is called horizontally sim¬ 
ple if it can be described by inequalities of the form 


c < y < d, jci(y) < x < x 2 {y). 


(6) 



where x = *i(y) and jc = jc 2 (y) are continuous functions on [c, d]. The region in 
Fig. 20.9 has this property. 

The following are the basic facts about the use of iterated integrals to com¬ 
pute double integrals: if R is the vertically simple region given by (5), then 

II fix, y) dA = I Ij 2 ^ f(x, y) dy dx\ (7) 

R ' 

and if R is the horizontally simple region given by (6), then 

U f(x, y ) dA = | j[//U, y) dx dy. (8) 

R C V] 

In addition to their obvious practical value for the computation of double in¬ 
tegrals, these equations also serve to clarify the conceptual distinction be¬ 
tween double integrals and iterated integrals. A double integral is a number 
associated with a function /(jc, y) and a region R , and this number exists and 


Figure 20.9 
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has a meaning independently of any particular method of computing it. On 
the other hand, an iterated integral is a double integral plus a built-in compu¬ 
tational procedure . Thus, every iterated integral is a double integral, but not 
vice versa. 


Example 1 Compute the double integral ///? 2xy dA in two different ways, where 
R is the region bounded by the parabola x = y 2 and the straight line y = x. 


Solution h * s essen ti a l t0 always sketch the region R of integration before try¬ 
ing to evaluate a double integral In this case the region is shown in Fig. 20.10. 
It is clear that R is vertically simple with a = 0, b = 1, yjU) = jc, y 2 (x) = jc 1/2 , 
so by (7) 

II 2xydA = l o Jf 2 xy dy dx = J q [xy 2 ]f n dx 

R 


The region R is also horizontally simple with c = 0, d = 1, x\ (y) = y 2 , jc 2 (y) = 
y, so by (8) 



II 2xy dA = f Q j\ 2 xy dx dy = ^ [x 2 y ] y y 2 dy 



Figure 20.10 


Example 2 Compute ff R ( 1 + 2jc) dA, where R is the region bounded by 
x = y 2 and x — y = 2. 


Solution This region is shown in Fig. 20.11. In order to integrate first with re¬ 
spect to y and then with respect to x, we would need to compute two separate 
integrals, one to the left of the line x = 1 and the other to the right, because the 
limits of the y-integration are different in these two parts of the region: 

If (1 + 2x) dA = II If- (1 + 2x) dy dx + f ff (1 + 2x) dy dx. 


The other order is easier, and yields 


II (1 + 2x) dA = II If 2 (1 + 2x) dx dy = £ [x + x 2 F/ 2 


dy 





Example 2 shows that even when the region R is both vertically and horizon¬ 
tally simple, it may be easier to integrate in one order than in the other, and we 
naturally prefer to do things in the easiest way. Sometimes the choice of the or¬ 
der of integration is determined by the nature of the integrand /(jc, y), for it may 
be difficult—or even impossible—to compute an integral in one order, but easy 
to do so if the order of integration is reversed. 
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Example 3 Compute 




Figure 20.12 


Solution We cannot integrate in this order because / e* 2 dx is not an elemen¬ 
tary function. We therefore try the other order. This requires us to sketch the re¬ 
gion R by examining the limits on the given iterated integral. R is shown in Fig. 
20.12, and in the other order the underlying double integral has the value 


JJ 4e* 2 dA 

R 



dx 


2xe xl dx = e x2 \ = e 
o 


1 . 


PROBLEMS 

In Problems 1-6, use double integrals to find the areas of the 
regions bounded by the given curves and lines. 

1 The parabola x = y 2 and the line y = x — 2. 

2 The parabola y — x — x 2 and the line x + y = 0. 

3 The axes and the line 2x + y = 2a (a > 0). 

4 The y-axis, the line y = 3x, and the line y = 6. 

5 The x-axis, the curve y = e ~ x , and the lines x = 0, 
x = a (a> 0). 

6 The parabolas y = x 2 and y = 2x — x 2 . 

In Problems 7-10, find the volumes above the Jty-plane 
bounded by the given surfaces. 

7 The paraboloid z = x 2 + y 2 and the planes x = ± 1, 

y = ± 1 - 

8 The cylinder x 2 + y 2 = 1 and the plane x + y + z = 2. 

9 The cylinder y = 4 — x 2 and the planes y = 3x, z = 
x + 4. 

10 The cylinder x 2 + y 2 = a 2 and the paraboloid az = 
x 2 + y 2 . 

11 Find the volume of the solid bounded by the coordinate 
planes, the planes x — 2 and y — 5, and the surface 
2 z = xy. 


12 Find the volume of the solid in the first octant bounded 
by the cylinder 4 y — x 2 and the planes x — 0, z — 0, 
y — 4, and x - y + 2z = 2. 

In Problems 13-16, set up a double integral whose value is 

the stated volume, express this double integral in two ways as 

an iterated integral, and evaluate one of these. 

13 The volume under the plane z — 2y and above the first- 
quadrant region bounded by y = 0, ;t = 2, x 2 + y 2 = 16. 

14 The volume under the plane z = x + y and above the 
first-quadrant region inside the ellipse 9x 2 + 4y 2 = 36. 

15 The volume under the cylinder x = z 2 and above the 
region in the ry-plane bounded by x = 0 and y 2 + 
9x = 9. 

16 The volume in the first octant bounded by the cylinder 
z = 4 — y 2 and the planes x = 0, y = 0, z = 0, 3x + 
4y= 12. 

17 Calculate the value of///? ^ dA if R is the first-quadrant 
part of the ring between the circles x 2 + y 2 = a 2 and 
x 2 + y 2 = b 2 , where a < b. Do this two ways, corre¬ 
sponding to the two possible orders of integration. 

18 Compute the double integral in Example 2 in the other 
order, requiring two separate iterated integrals. 


20.3 

PHYSICAL 
APPLICATIONS 
OF DOUBLE 
INTEGRALS 


We have seen that the double integral 

jjf(x,y)dA (1) 

R 

gives the volume of a certain solid if /(x, y) > 0. This integral has many other 
useful interpretations that arise by making special choices of the function/(x, y). 
Before we discuss these, it will be convenient to return to the way of thinking 
about integration that was described and extensively illustrated in Chapter 7. 
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Figure 20.13 Two orders of 
integration. 


The limit-of-sums definition of (1) that was given in Section 20.2 is necessary 
from the point of view of logic and mathematical legitimacy. However, for work¬ 
ing with applications it is better to think of the volume given by (1) as composed 
of infinitely many infinitely thin columns, as suggested in Fig. 20.13a. A typical 
column stands on an infinitely small rectangular element of area dA whose sides 
are dx and dy, so that 

dA = dx dy = dy dx. (2) 

The height of this column is f(x, y), so its volume is 

dV=f(x, y) dA. 

The total volume V is now obtained by adding together—or integrating—all of 
these infinitely small elements of volume, 

v = jj dV = jj f(x, y) dA. (3) 

R 

We understand here that the complete double integral (1) is produced by allow¬ 
ing dA to sweep in any manner over the whole of the region R. In parts ( b ) and 
(c) of Fig. 20.13 we indicate the two ways of calculating (3) as an iterated inte¬ 
gral: in ( b ), we first allow dA to move across R along a thin horizontal strip, cor¬ 
responding to integrating first x and then y; and in (c), we first allow dA to move 
across R along a thin vertical strip, integrating first y and then x. As suggested 
by formula (2), the double integral (3) can be written in either of the forms 

/( /(*. y) dx dy or JJ f(x, y) dy dx, 

R R 

depending on which iterated integral we wish to consider; and to apply these 
ideas to a particular problem, all that remains is to insert suitable limits of inte¬ 
gration and carry out the calculations. 

This description of the intuitive meaning of the double integral (1) expresses 
the essence of the Leibniz approach to integration: to find the whole of a quan¬ 
tity, imagine it to be judiciously divided into a great many small pieces, and then 
add these pieces together. This is the unifying theme of the following applica¬ 
tions, and also of many further developments in the rest of this chapter. And here 
again, as so often before, the superb Leibniz notation almost does our thinking 
for us. 
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In Chapter 11 we discussed the concepts of moment, center of mass, and mo¬ 
ment of inertia for a thin plate of homogeneous material that occupies a given 
region R in the Jty-plane. The word “homogeneous” meant that the density <5 of 
the material (= mass per unit area) was assumed to be constant, that is, to have 
the same value at every point P = (x, y) in R. We are now in a position to allow 
8 to be a function of x and y, 8 = 8(x, y), so that thin plates of varying density 
can be brought within the scope of our methods. 


I. MASS 

If 8 = 8(x, y) is the density of our thin plate, then 5(x, y) dA is the mass con¬ 
tained in the element of area dA , and the total mass of the plate is 

M = If S(x, y) dA. (4) 

R 


II. MOMENT 

The moment of the element of mass 5(x, y) dA with respect to the x-axis is the 
mass multiplied by the “lever arm” y, namely, y5(x, y) dA , and the total moment 
of the plate with respect to the x-axis is 



Mx = If y % x ' ^ dA - 

R 

See Fig. 20.14. Similarly, the total moment with respect to the y-axis is 

M y = JJ x8(x, y) dA. 

R 


III. CENTER OF MASS 

This is the point (x, y) whose coordinates are defined by 

- _ My_ _ Hr x8(x , y) dA 
X M f/ R 8(x t y) dA 

and 


- = m x = Ur y) dA 

M fJ R 8(x, y) dA ' 


(5) 


(6) 


(7) 


( 8 ) 


Physically, this is the point at which the total mass of the plate could be con¬ 
centrated without changing its moment with respect to either axis. When the den¬ 
sity 8 is constant so that the mass of the plate is uniformly distributed, then the 
5’s can be removed from the integrals in (7) and (8) and canceled away. In this 
case the center of mass becomes the geometric center of the region R, and for 
this reason is usually called the centroid. 


IV. MOMENT OF INERTIA 

When the square of the lever arm distance is used instead of its first power [as 
in (5) and (6)], we get the moment of inertia of the plate about the correspond¬ 
ing axis. Thus, the moment of inertia l x about the x-axis is defined by 
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4 = JJ v 2 S(x, v) dA. (9) 

R 

Similarly, the moment of inertia 7 V about the y-axis is 

4 = If x 2 8(x, y) dA. (10) 

R 

Also of interest is the moment of inertia of the plate about the z-axis. This is of¬ 
ten called the polar moment of inertia , and is defined by 

I z = jj r 2 8(x, y) dA, (11) 

R 

where r 2 = x 2 + y 2 . As we explained in Section 11.4, the moment of inertia of 
a body about an axis is its capacity to resist angular acceleration about that 
axis; this quantity plays the same role in rotational motion as mass does in lin¬ 
ear motion. 

Students should explicitly notice that in each of the formulas (4), (5), (6), (9), 
(10), (11) we obtain the total quantity under discussion by adding together—or 
integrating—the “infinitesimal” parts of it associated with the element of area 
dA , as dA sweeps over the region R. 

Example A thin plate of material of variable density occupies the square R whose 
vertices are (0, 0), ( a , 0), (a, a ), (0, a). The density at a point P = (x, y ) is the 
product of the distances from P to the axes, 8 = xy. Find the mass of the plate, 
its center of mass, and its moment of inertia about the Jt-axis. 


Solution A sketch of the situation is shown in Fig. 20.15. We have 


M 


= If SdA= lolo xydydx = lo 

R 

=i° 2 r 

2 Jo 


■xy 


dx 


x dx — — a 4 . 
4 


The ^-coordinate of the center of mass is 




= ^ff x8dA = ffj fjx 2 ydydx 

R 

4 [° T 1 2 2? A 2 P 2J 2 

= ~zx y dx = —j\ x A dx = — a, 

d 4 Jo [2 7 Jo a 2 Jo 3 

and by symmetry we have x = y = ja. The desired moment of inertia is 


4 = 


II y 2SdA = lolo xy3 dydx = l 0 a 

= M 


jxy 4 


dx 


x dx = — a 6 . 

o 


y 

to,*)'* 



( 0 , 0)1 

Figure 20.15 


(x,y) 


It is customary to express the moment of inertia of a body about an axis in terms 
of its total mass A/, which in this case gives 


(a, a) 


x 

(a, 0) 


4 = \Ma 2 . 
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Remark 1 We emphasize that the symbols dA, dx, and dy in formula (2) do not 
designate differentials in the sense discussed in Section 19.4. Instead, they are 
merely notational aids that enable us to write down appropriate double integrals 
directly, without repeatedly going back to the complicated limit-of-sums defini¬ 
tions of these integrals. 

Remark 2 A surprising application of our present ideas is given in the Appen¬ 
dix at the end of this chapter, where Euler’s formula 

J_ - — 

n 2 6 

is obtained by evaluating a certain double integral. 


PROBLEMS 

In Problems 1-8, find the total mass M and the center of mass 
(x, y) of the thin plate of material that lies in the given region 
R and has the given density 8. Use symmetry wherever pos¬ 
sible to simplify calculations. 

1 R is the square with vertices (0, 0), ( a , 0), ( a, a ), (0, a)\ 
8 = x + y. 

2 R is the first-quadrant region bounded by the axes and 
the circle x 2 + y 2 = 1; 8 = xy. 

3 R is the region bounded by the parabola x = y 2 and the 
line x = 4; 8 = x. 

4 R is the region bounded by the axes and the line x + y = 
a, 8 = x 2 + y 2 . 

5 R is the region bounded by x = 0 and the right half of 
the circle x 2 + y 2 = a 2 \ 8 = x. 

6 R is the region bounded by the parabola y = x 2 and the 
line y = r, 8 = Vx. 

7 R is the region between y = sin x and the *-axis from 
x = 0 to x = 7 r, 8 — x. 


8 R is the region bounded by the parabola y = x 2 and the 
line y = x 2\ 8 = x 2 . 

9 Find the moment of inertia I x for the square plate con¬ 
sidered in the text if the density 8 is constant. 

10 Show that I z = I x + I y . Use this and the result of Prob¬ 
lem 9 to find the moment of inertia of a uniform (con¬ 
stant density) cube of edge a and mass M about one of 
its edges. 

11 If the density 8 is constant, find the moment of inertia l x 
of the thin triangular plate bounded by the line x + y = 
a and the axes x = 0, y = 0. 

12 Solve Problem 11 for the triangular plate bounded by the 
lines x + y = a, x = a, y = a. 

13 Solve Problem 11 if the density is 8 = xy. 

14 Find the polar moment of inertia l z of the circular plate 
bounded by x 2 + y 2 = a 2 if the density 8 is constant. 


4 It is often more convenient to describe the boundaries of a region by using po¬ 
lar coordinates r, 6 than by using rectangular coordinates jc, y. In these circum¬ 
stances we can usually save ourselves a lot of work by expressing a double 
DOUBLE INTEGRALS IN int ral 

POLAR COORDINATES 

[Jf(x,y)dA (I) 

R 

in terms of polar coordinates. The integrand is easy to transform by using the 
equations x = r cos 6, y = r sin 6 to write /(;c, y) as a function of r and 6 , 

/(*, y) = f(r cos 6 , r sin 6). 

For example, if f(x, y) = x 2 + y 2 , this becomes (r cos 0) 2 + (r sin 6) 2 = 
r 2 (cos 2 6 + sin 2 6) = r 2 . But what do we do about the element of area dA2 
The answer to this question is suggested by Fig. 20.16. We recall that the el¬ 
ement of area in rectangular coordinates, 

dA = dx dy, 
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is intended to remind us of the small rectangles with sides parallel to the axes 
that were used to define the double integral (1) in Section 20.2. In working with 
polar coordinates it is natural to subdivide the plane in another way, by a series 
of circles with centers at the origin and a series of rays emanating from the ori¬ 
gin. These circles and rays form many small cells that resemble rectangles, as 
shown by the shaded part of the figure. The double integral (1) can now be given 
an equivalent definition by means of a limit-of-sums process that uses these small 
“polar rectangles.” However, we omit the details and use Fig. 20.16 only to sug¬ 
gest the line of thought we should follow, as we now explain. 

The element of area dA = dx dy in rectangular coordinates is the area of the 
small rectangle swept out by an increase dx in x and an increase dy in y (see Fig. 
20.17a). Figure 20.16 suggests the approach to be used with polar coordinates: 
If r increases to r + dr and 0 increases to 0 + dQ (Fig. 20.17 b), then a small po¬ 
lar rectangle is swept out whose sides are dr, the change in r, and r dO .* The area 
of the small polar rectangle is therefore approximately 



dA = (dr)(r dO) = r dr dQ. 


( 2 ) 




Figure 20.17 The polar element of 
area. 


This is the basic formula of this section. It gives the element of area in polar co¬ 
ordinates, and it enables us to write the double integral (1) in polar form, as 


JJ fix, y) dA = JJ f(r cos 6 , r sin 6) r dr dQ. (3) 

R R 

Many of the regions R we deal with are radially simple, in the sense that they 
can be described by inequalities of the form 

a < 0 < j3, r\(Q) < r < r 2 (0). 


Figure 20.18 shows a region of this kind, and also suggests how the figure can 
be used to write the double integral (3) as an iterated integral, 

JJ fix, y) dA = J^ J 2( } f(r cos 0, r sin 0) r dr dQ. 

Here we integrate first r and then 0, working from the inside out as always. 
We visualize the element of area dA as first moving out across R along the in¬ 
dicated radial strip, from the inner curve r = r\(Q) to the outer curve r = r 2 (0). 
The resulting strip is then rotated from 0 = a to 0 = (3 in order to sweep 


’"The second side of this polar rectangle is a short arc of a circle of radius r that is cut off by a cen¬ 
tral angle dQ , and its length s is given by the formula s = r • dQ, because the angle is measured in 
radians. 
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over all of R. Iterated integrals can also be set up in the other order, but these 
are seldom used. 


Example 1 Find the area of the region R enclosed by the cardioid r = 
a{ 1 + cos 0). 


Solution This cardioid is shown in Fig. 20.19, and we find the area by inte¬ 
grating the element of area dA = r dr dO over the region, 


A = JJ dA = JJ r dr dd. 

R R 

For fixed 6 , we allow r to increase from r = 0 to r = a (1 + cos 6). As usual, we 
exploit all available symmetry, so we next allow 0 to increase from 0 to tt and 
obtain the total area by multiplying by 2, 


= 2 J n J a( 

Jo Jo 


a( 1 +cos 8) 


r dr d6 = 



«(l+cos 9) 

do 

o 


= 2 y a 2 ( 1 + cos 8) 2 dd = a 2 fj (1 


+ 2 cos 0 + cos 2 6) dd 




1 + 2 cos 6 + — [1 + cos 


20 ]) 


dd 


= a 2 \ 6 + 2 sin 6 + -j 6 + sin 28 


3 2 
= —- ira L . 

• 2 


This problem can also be solved by the method of Section 16.5, which would 
have started with the third integral in our calculation. However, our present 
method has much greater flexibility. It allows us, for example, to find the cen¬ 
troid of the region R by thinking of it as a thin plate of material of constant den¬ 
sity 8 = 1. It is clear by symmetry that y = 0, and we find x by writing 


.X 




We ask students to complete the details of this calculation in Problem 25. 



Example 2 Derive the formula for the volume of a sphere of radius a by our 
present methods. 


Solution ^ the sphere has center at the origin, its equation is jc 2 4- y 2 + z 2 = 
a 2 or r 2 + z 2 = a 2 , and the equation of the upper hemisphere is z = V a 2 — r 2 . 
By symmetry, we calculate the volume in the first octant (Fig. 20.20) and mul¬ 
tiply by 8. The region R over which we integrate is defined by 0 < 6 < 7r/2 and 
0 < r < a, so 


V=8ffzdA = 8 I”' 2 £ \V - 7 2 r dr dd 

R 

f77-/2 fa 1 , 

= 8 Jo J 0 ~ j (a 2 ~ r 2 ) U2 (-2r dr) dd = 


f TT 

-4 

Jo 


± (a 2 _ 7 * 2)372 


dO 
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Students should notice particularly how the presence of the r in the inner inte¬ 
gral makes this calculation work out smoothly. 


Example 3 As we learned in Section 12.5, the improper integral 

[°° e"* 2 dx 

Jo 

is important in the theory of probability and elsewhere. We shall find its value 
by a clever device that depends on an improper double integral in polar coordi¬ 
nates. Write 


/ = 


r 

Jo 


e y2 dy. 


Since it doesn’t matter what letter we use for the variable of integration, we have 

/2 = (f e ~ x2dx ){i e ~ y2dy } 

By moving the first factor past the second integral sign, this can be written in 
the form 

/2 = f (f dx ) e ~ y2 dy = f (f *) dy 

= fo fo e ~ (x2+y2) dx dy- 


This double integral is extended over the entire first quadrant of the jcy-plane. In 
polar coordinates it becomes 


^ c 7t/ 2 roo r 77/2 

I 2 = e r r dr dd = 

Jo Jo Jo 

so I = ^ Vir or 





f e * 2 dx = V7 ~r. 

h 2 


(4) 


This formula is especially remarkable because it is known that the indefinite 
integral 

J e ~ x2 dx 

is impossible to express as an elementary function.* 


There is a famous story about the nineteenth-century Scottish physicist Lord Kelvin. “Do you know 
what a mathematician is?” Kelvin once asked a class. He stepped to the blackboard and wrote 

e - * 2 dx = Vtt, 

J—oo 

which is clearly equivalent to (4). “A mathematician,” he continued, “is one to whom that is as ob¬ 
vious as twice two makes four is to you.” As a matter of fact, this formula is not obvious, either to 
the present writer or to any of the many mathematicians he has known. The conclusion seems to be 
that Kelvin was both showing off and trying to put down his class in a rather mean-spirited way. 
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PROBLEMS 


In Problems 1-13, use double integrals in polar coordinates 
to find the areas of the indicated regions. 

1 The circle r = a. 

2 The circle r = 2a cos 6. 

3 The region common to the circles r = a and r = 
2 a cos 6. 

4 One loop of r — a cos 26. 

5 The right loop of the lemniscate r 2 = 2 a 2 cos 26. 

6 The region inside the curve r = 2 + sin 36. 

7 The region inside the lemniscate r 2 = 2a 2 cos 26 and 
outside the circle r = a. 

8 The region inside r = tan 6 and between 6=0 and 
6 = 7t/4. 

9 The region inside the cardioid r = a( 1 + cos 6) and out¬ 
side the circle r = a. 

10 The region inside the circle r = a and outside the car¬ 
dioid r = a( 1 + cos 6). 

11 The region inside the cardioid r = 2a(\ + cos 6) and 
outside the circle r = 3 a. 

*12 The region between r = 7t/ 4 and r = tt!2, between r = 
9 and r = ±9 (9 > 0). 

13 The region inside the cardioid r = 1 + cos 6 and to the 
right of the line x = \. 

*14 If R is the region bounded by the lines y = x, y = 0, 
x = 1, evaluate the double integral 


ii 


dx dy 


(1 + x 2 + y 2 ) 


2 \ 3/2 


by changing to polar coordinates. 
15 Evaluate the integral 


2 a fV 2ax-x 2 


n 

Jo Jo 


(x 2 4- y 2 ) dy dx 


by changing to polar coordinates. 

In Problems 16-22, write the given integral in the form 



r dr d6. 


16 

18 

20 

22 


[2 rvTT? 

Jo Jo zdydx - 

r o 

J-i J-vt^p zdxd y- 

n V4-(jc-2) 2 

, 2 dy dx - 

[2 rVIy-y* 

Jo Jo z dx dy 


17 

19 

21 


p rV9-x 2 

Jo J-vrp z dy dx - 

L'L' zdydx - 

V2/2 fVl-y 2 

z dx dy. 


1 ; 


23 A cylindrical hole of radius b is drilled through the cen¬ 
ter of a sphere of radius a. 


(a) Find the volume of the hole. Notice that this for¬ 
mula gives the volume of the sphere when b = a. 

(b) Find the volume of the ring-shaped solid that re¬ 
mains. Express this volume in terms of the height 
h of the ring. Notice the remarkable fact that this 
volume depends only on h, and not on either the ra¬ 
dius a of the sphere or the radius b of the hole. 

*24 Find the centroid of the region enclosed by the loop of 
r = a cos 26 that lies in the first and fourth quadrants. 

25 Find the centroid of the region enclosed by the cardioid 
r = a( 1 + cos 6). 

*26 Find the centroid of the region enclosed by the right 
loop of the lemniscate r 2 = 2 a 2 cos 26. 

27 Find the centroid of the semicircular disk x 2 + y 2 < a 2 , 
y > 0. 

28 Find the volume of the solid cone 0 < z < h(a — r)/a. 

29 Find the volume of the solid under the cone z = 
2a — r whose base is bounded by the cardioid r = 
a( 1 + cos 6). 

30 Find the volume cut out of the sphere x 2 + y 2 + z 2 = 
4 a 2 by the cylinder x 2 + y 2 = 2ax. 

31 For the solid bounded by the xy-plane, the cylinder 
x 2 + y 2 = a 2 , the paraboloid z = b{x 2 + y 2 ) with b > 
0, find (a) the volume, (b) the centroid. 

32 Find the polar moment of inertia l z of the circular plate 
bounded by r = a if the density 8 is constant. (Com¬ 
pare this very easy calculation with the work needed to 
solve the same problem using rectangular coordinates, 
in Problem 14 in Section 20.3.) 

33 Find the polar moment of inertia I z of a thin plate of 
constant density 8 that has the shape of the circle r = 
2 a cos 6. 

34 Solve Problem 33 for a plate that has the shape of the 
cardioid r = a( 1 + cos 6). 

35 A thin plate is bounded by the circle r = a and has den¬ 
sity 8 = a 2 /(a 2 + r 2 ). Find its mass M and polar mo¬ 
ment of inertia l z . 

36 The center of a circle of radius 2 a lies on a circle of ra¬ 
dius a. Find the centroid of the region between the two 
circles. 

37 A thin plate of constant density 8 has the shape of a cir¬ 
cular sector of radius a and central angle 2a. Find the 
moment of inertia about the bisector of the angle. 

38 Find the centroid of the circular sector described in 
Problem 37. Obtain the result of Problem 27 as a spe¬ 
cial case of this. 

*39 Use the fact that /o e~ x2 dx = {Vtt to show that 

(a) \ e~ 2x2 dx = \ V27t; 

Jo 4 

(b) f e~ 3x2 dx = 

Jo 6 
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77-/2 p-tan* 1 r- 

-n— dx = — V 7r; 

cos^ 2 : 2 


e r- 

—~ dx = V7r; 

Vx 

1 


<«r 

mf 

(g) l Vx e~ x dx = j\Vr; 

(h) J — = V^r; 

Jo V-ln x 

(i) V-ln x dx = y Vtt. 

40 Use the method of Example 3 to evaluate 


r°° f°° dx dy 

Jo Jo (1 + x 2 + y 2 ) 2 ' 

41 There is a slight difficulty with the calculation of f 2 
in Example 3, because we have not discussed improper 
double integrals. In this problem we outline a somewhat 
less cavalier approach to formula (4). In Fig. 20.21 we 
show a quadrant of a circle of radius a , which is inside 
a square of side a, which in turn is inside a quadrant 
of a circle of radius V2a. Denote these regions by R\ y 
Ri, R?>' 

(a) Show that 


I! 


e rl dA = -^(1 — e ° 2 ) 


and 


j! e- r2 dA =j( 1 - e~ 2 ° 2 ). 

*3 

(b) Show that 

II e~ r2 dA = lj lj e - {x2+ * 2) dx dy 

=(f ^) 2 

(c) Use (a) and (b) to show that 

f (1 - e-*) < e"* 2 < f 0 - « _2 “ 2 )- 

(d) Use (c) to conclude that 




The definition of a triple integral follows the same pattern of ideas that was used 
to define a double integral in Section 20.2. We shall therefore confine ourselves 
to a very brief explanation. 

A triple integral involves a function /(*, y, z) defined on a three-dimensional 
region R. We divide R into many small rectangular boxes (and parts of boxes) 
by planes parallel to the coordinate planes, and we denote the volume of the kih 
box that lies wholly inside R by AV*. Next, we evaluate the function at a point 
(Xk, yk, Zk ) in the kth box and form the product f(Xk, y *, Zk) AV*. Finally, we form 
the sum of these products over all the boxes that lie inside R , 
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n 


X 

!c= 1 


/(**. yb zk) a v k . 


The triple integral of/(x, y, z) over R is now defined to be the limit of these sums 
as n becomes infinite and the maximum diagonal of the boxes (that is, the longest 
diagonal of any of the boxes) approaches zero, 

[[[ fix, y, Z) dV = lim ^ f(x k , yb Zk) A V k - 
r *=1 


( 1 ) 
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Sometimes we use the alternative notation 


HI fix, y, z) dx dy dz, (2) 

R 

with no implication intended about the order of integration. This arises from the 
fact that since the volume of a box with faces parallel to the coordinate planes 
can be written as AV = Ax Ay Az, we have the element of volume formula 


dV = dx dy dz. 


(3) 



Figure 20.22 suggests the way the triple integral can be formed directly from the 
function /(;t, y, z) and the element of volume dV, in the manner explained in the 
previous two sections: that is, we multiply dV by f(x, y, z) and integrate (or add 
together) the quantities /(;t, y, z) dV as the element of volume dV sweeps over 
the entire region R. As before, this way of thinking is merely a convenient ab¬ 
breviation of the complex limit-of-sums process that constitutes the actual defi¬ 
nition of the triple integral. 

The main theoretical fact is that the triple integral (1) [or (2)] exists if 
f(x, y, z) is continuous and the boundary of R is reasonably well behaved. We 
shall not pursue this issue any further. And the main practical fact is that triple 
integrals can often be calculated as iterated integrals. 

Before we discuss iterated triple integrals, we quickly extend the ideas of Sec¬ 
tion 20.3 to the present context. First, if the region R is thought of as a solid body 
of variable density 8 = 8(x, y, z) [= mass per unit volume], then 8dV is the ele¬ 
ment of mass—that is, the mass contained in the element of volume—and the 
total mass is 


M = HI 8dV. 

R 

Similar considerations lead to formulas for the moments with respect to the var¬ 
ious coordinate planes, denoted by M yz , M xz , and M^\ and also to formulas for 
the moments of inertia about the various axes, denoted by / x , / v , and I z . These 
formulas (see Fig. 20.22) are 

M yz = Iff x 8dV, M xz = Uly SJV, M xy = JjJ z 8dV\ 


lx = III (y 2 + z 2 ) 8dV , ly = III (x 2 + Z 2 ) 8dV , f z = jjj (x 2 + y 2 ) 8dV. 

R R R 

Also, the equations 

— Id yz — M XZ - Mxy 

X ~ ~M' y ~ "aT* Z ” ~M~' 

define the center of mass of the body, or the centroid if 8 is constant. 

Just as we did with double integrals, we usually evaluate triple integrals by it¬ 
eration. For example, if R is described by inequalities of the form 

a<x< b, y j(x) < y < y 2 (x), z\(x, y) < z < Zi(x, y), 


as shown in Fig. 20.23, then 
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fllf(x>y> z) 


dV = 


r[c 


dx. 


We usually omit the parentheses and brackets, and write this in the form 


( b fcOw) 
L Jy } (x ) iz^y) 


f(x, y , z) dz dy dx. 


As always, we integrate from the inside out, hereuitegrating first with respect to 
z, then with respect to y, and finally with respect to x. Other orders of integra¬ 
tion are often possible, and the order we choose in any specific problem is de¬ 
termined by a little foresight and our preference for easy calculations over hard 
ones. 


Example 1 Find the centroid of the tetrahedron bounded by the coordinate planes 
and the plane x + y + z = 1. 

Solution We can treat the tetrahedron (Fig. 20.24) as a solid of density 8 = 1 
so that mass equals volume. By geometry the volume of the tetrahedron is V = 
and z is defined by 

l= ^\\\ zdV - 

R 



Figure 20.23 


0 , 0 , 0 ) 


Figure 20.24 
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If we integrate first z, then y, then x, this means that we must write 


Z = ±ffjzdz dy dx , 

R 

with suitable limits of integration inserted. To find the z-limits we use the indi¬ 
cated equation of the slanting plane and imagine that the element of volume 
shown in the figure—like an elevator car in an elevator shaft—moves up from 
z = 0 to z = 1 - x - y. Next, the resulting column generates a slice by moving 
across the solid from left to right, fromy = 0 to y = 1 — x. And finally, the slice 
moves through the solid from back to front, from x = 0 to jc = 1. Thus, 


1 f 1 fi-x fl-x-y 

1 Jo Jo Jo 


z dz dydx = 




\-x-y 


dy dx 


= 3 Io'Io X(1 -x-y) 2 dydx = 3 f o '[-j 

= Jo 1(1 -* ): 


(1 - * - y ) 3 


dx 


^ dx= --(1 -x) 4 


1 = ]_ 
o 4 ‘ 


By the symmetry of the situation we see that the centroid is the point ({, {). 

We could also have found z by integrating in any other order, for instance, first 
x, then y, then z, 


1 n n-z ri-y~z 
Z= 1 i I Jo z dx dy dz, 

6 

where the limits of integration are determined as they are above, that is, by ex¬ 
amining the figure. Students should verify that this integral gives the same result 
as before. 


Example 2 Use a triple integral to find the volume of the sphere x 2 + y 2 + z 2 = 


Solution The total volume is 8 times the volume in the first octant, so by inte¬ 
grating in the order z, y , x (see Fig. 20.25) we have 


V= 8 



f\ / 'a 2 —x 2 —y 2 
1 dzdydx 


= 8 


fa fVa 2 -x 2 

Jo Jo 


Va 2 — x 2 — y 2 dy dx. 


(4) 


To calculate the inner integral here, we use the method of trigonometric substi¬ 
tution with y = A sin 6, dy = A cos 6 dO to obtain the auxiliary formula 


(A . /— -- _ Ctt/2 ^ 1 a f 7 r/2 

J o VA 2 — y 2 dy = A 2 J q cos 2 6 d6 = — A 2 J q (1 + cos 26) 

= j 2 A2 [ e+ h i] 


dd 


■ sin 26 


71/2 1 

= d- tc4 2 . 
o 4 


With A = V a 2 — x 2 , this enables us to write (4) as 
V = 8 J o v(a 2 — x 2 ) dx = 2 tt 


4 3 

= 3 
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Figure 20.25 


and the calculation is complete. Of course, we are thoroughly familiar with this 
result, which we have already obtained by a number of different methods. Our 
purpose here is to provide another illustration of the technique of triple integra¬ 
tion. 


PROBLEMS 


In Problems 1-10, evaluate the given iterated integral. 


2 

3 

4 

5 

6 

7 

8 
9 

10 


Iff I 8 xYzdzdydx. 

loCIo Xxd ^ d y- 

!o !o io SmJ f dzdydx ' 


n n-y p 2 +>' 2 

Jo Jo Jo 

ITf 

n fi-x [2-x 

Jo Jo Jo x >’ z dzdydx - 

C 2 CX' 4 -z 2 C 4 —y 7 -:?- 

Jo Jo U,- 4 ^ dydz - 

rr 

Jo Jo 


y dz dx dy. 


4 y 

x 3 cos -y e z dz dy dx. 


xyzVx 2 + y 2 + z 2 dx dy dz. 


C2 C4-x 2 -y 

Jo Jo Jo 

I' r £ (x+2z) dz dx dy - 


y dz dx dy. 


11 Change the order of integration by putting suitable lim¬ 
its on the right side: 

l l fo /(*> y. z) dz dy dx = jjf f(x , y, z) dx dy dz. 


14 Evaluate both integrals in Problem 11 if f{x, y, z) = yz. 

15 Change the order of integration by putting suitable lim¬ 
its on the right side: 


>dx 


Cl fV l-x 2 f\-x 2 -y 2 

J-iJ-VT^Jo /U- y. z)dzdyc 

= Iff fix, y, z) dxdy dz. 


16 Same directions as Problem 15: 


'6 C6—x C6 — x—y 


f 0 ’ fix, y, z) dz dy dx 

= ffj fix, y, z) dx dy dz. 


In Problems 17-24, use triple integration to find the volumes 
of the given regions. 

17 The region in the first octant bounded by the cylinder 
x = 4 — y 2 and the planes y = z, x = 0, z = 0. 

18 The region above the ry-plane bounded by the surfaces 
z 2 = 16 y, z 2 = y, y = x, y = 4, and x = 0. 

19 The region bounded by the paraboloids z = x 2 + 9y 2 
and z = 18 - x 2 - 9y 2 . 

20 The region bounded by the paraboloids z = 8 — x 2 — 
y 2 and z = x 2 + 3y 2 . 

21 The region bounded by the ellipsoid 



12 Evaluate both integrals in Problem 11 if/(jc, y, z) = 1 . 

13 Evaluate both integrals in Problem 11 if f(x, y, z) = x. 


22 The region bounded by the cylinder z = 4 — y 2 and the 
paraboloid z = x 2 + 3y 2 . 
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23 The tetrahedron bounded by the coordinate planes and 
the plane 



where a, b, c are positive numbers. 

24 The region bounded by the cylinder x 2 + y 2 = Ax, the 
xy-plane, and the paraboloid 4z = x 2 + y 2 . 

25 The density of a cube is proportional to the square of 
the distance from one corner. Show that the mass is what 
it would be if the density were constant and equal to 
the original density at another comer adjacent to the 
first. 

26 If the density 8 = xy\ find the moment with respect to 
the ry-plane of the part of the sphere x 2 + y 2 + z 2 ^ a 2 
that lies in the first octant. 

27 The cube bounded by the coordinate planes and the 
planes x = a, y = a, z = a has density 8 = cz where c 
is a constant. Find its moment of inertia l z about the 
z-axis. 


*28 Show that 


1 L L cos ( x + y + z)dz d y dx 

n . a . b . c a + b + c 

= 8 sin — sin — sin — cos---. 

2 2 2 2 

*29 Show that the four-dimensional “sphere” x 2 + y 2 + 
z 2 + u 2 = a 2 has volume 

fa fVa 2 -x 2 fVa 2 -x 2 -y 2 fXa 2 -x 2 -y 2 ~z 2 

V= 16 Jo Jo Jo Jo dudzdydx 


Hint: Notice that the inner triple integral is the volume 
of the first octa nt of a three-dimensional sphere of ra¬ 
dius Va 2 — x 2 . 

*30 Use the result of Problem 29 to find the volume of the 
five-dimensional “sphere” jc 2 + y 2 + z 2 + u 2 + v 2 = 


20.6 

CYLINDRICAL 

COORDINATES 


If a solid has axial symmetry—that is, symmetry about a line in space—it is of¬ 
ten convenient to place its axis of symmetry on the z-axis and use cylindrical co¬ 
ordinates r, 6, z (Fig. 20.26) for the calculation of triple integrals. Instead of the 
element of volume in rectangular coordinates, 

dV = dx dy dz. 



Figure 20.26 Cylindrical co¬ 
ordinates. 


we use the element of volume in cylindrical coordinates, 

dV=rdrd6dz . (1) 

It is easy to understand this formula by starting at a point (a; ft z) and giving the 
coordinates small increments dr, dO, dz .. These increments sweep out a small cell 
in space which is approximately a rectangular box with edges r dd, dr, and dz, 
as shown in Fig. 20.27, and dV as given by (1) is simply the product of these 
edges. Triple integrals now have the form 

/// z) ~ jjj /( r cos ft r s i n ft z) r dr d6 dz. 

R R 



Figure 20.27 The cylindrical element 
of volume. 
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We can often calculate such an integral by writing it as an iterated integral, in 
the manner illustrated in the following examples. 

Example 1 Use a triple integral in cylindrical coordinates to find the moment 
of inertia of a uniform solid cylinder of height h, base radius a , and mass M about 
its axis. 


Solution Place the cylinder in the position shown in Fig. 20.28. The word “uni¬ 
form” in this context means that the density 8 is constant. The mass contained 
in dV is 8dV, and the moment of inertia of this mass about the z-axis is r 2 8dV. 
The total moment of inertia of the cylinder about its axis is therefore 

HI r 2 8dV = HI r 2 8 r dr dd dz 

R R 

= 8 \ 27T \° f* r 3 dz dr dO 
Jo Jo Jo 

f 2 tt fa 1 [2nT 

= 8h r 3 dr dd = 8h ■ — a 4 d6 

Jo Jo 4 Jo 

= 8 ■ y TTcdh - J Ma 2 , 


z 



y 


Figure 20.28 


since M — 8 • ira 2 h. That the limits on these integrals are all constants is a con¬ 
sequence of the fact that cylindrical coordinates are perfectly suited to this prob¬ 
lem. 


Example 2 Use a triple integral in cylindrical coordinates to find the volume of 
the sphere x 2 + y 2 + z 2 = a 2 . 


Solution The cylindrical equation of the sphere is r 2 + z 2 = a 2 , so the equation 
of the upper hemisphere is z = va 2 — r 2 . We multiply the volume above the xy- 
plane by 2, and find this volume by integrating in the order z , r, 0 , as suggested 
in Fig. 20.29: 


V= 2 

= 2 


C2tt Ca n 

Jo Jo Jo 


Va^? 


r dz dr dd = 2 




■2\3/2 


r 2 ) 


de = 


f f rV a 2 ~ r 2 dr d6 
Jo Jo 

( 2 it i o 4 

2 — a? d6 = — it a 3 . 

Jo 3 3 


Of course, we obtain the same result as in Example 2 in Section 20.5, but the 
calculation is much easier here because cylindrical coordinates are better than 
rectangular coordinates for working with spheres. 



Example 3 Find the moment of inertia of a uniform solid sphere of radius a and 
mass M about a diameter. 

Solution We may assume that our present sphere occupies the region bounded 
by the sphere r 2 + z 2 — a 2 in Example 2. If the constant density is denoted by 
8, then the moment of inertia about the z-axis is 

















738 


MULTIPLE INTEGRALS 


4 = 2 j*” £ r 2 ■ Sr dz dr dO = 25 j“ r 3 Va 2 - r 2 dr dd 

= 8 • 477 f r 3 \/a 2 — r 2 Jr. 

Jo 

(In the last step here we integrated out of the indicated order for the purpose of 
disposing of the simple ^-integral so that we could concentrate our attention on 
the harder r-integral. Students will become accustomed to this type of shortcut.) 
To evaluate this integral we use the substitution r = a sin </>, dr = a cos dcf) to 
write 


| r 3 Va 2 — r 2 dr = a 5 j sin 3 </> cos 2 </> 

= a 5 j (cos 2 (f> — cos 4 0) sin $ d$ 


= fl 5 f y COS 5 0 — y COS 3 (f )). 


This gives 


I z = 8 • 4mz 5 


since M = 8 - T7ra 3 . 


— COS 5 (j) - y cos 3 </> 


7T/2 Q 9 

= 5 * — 7Tfl 5 = — Mfl 2 , 

0 15 J 


PROBLEMS 


Use cylindrical coordinates to solve the following problems. 

1 Find the volume of the solid bounded above by the pa¬ 
raboloid z = 1 - x 2 - y 2 and below by the ;cy-plane. 

2 Find the mass of the solid in Problem 1 if the density 
is 

(a) proportional to the distance from the ;ty-plane, 8 = 

cz; 

(b) proportional to the distance from the z-axis, 8 = cr; 

(c) proportional to the square of the distance from the 
origin, 8 = c(r 2 + z 2 ). 

3 A uniform solid cone of height h and base radius a rests 
on the ^y-plane with its vertex on the positive z-axis. 
Find its center of mass. 

4 If the mass of the cone in the preceding problem is M, 
find its moment of inertia l z about the z-axis 

(a) by integrating first with respect to z; 

(b) by integrating first with respect to r. 

5 A cylindrical hole of radius b is bored through the cen¬ 
ter of a uniform solid sphere of radius a. If the density 
is denoted by 8, find the mass of the ring-shaped solid 
that remains, and also its moment of inertia about the 
axis of the hole. Notice that this result generalizes the 
result of Example 3. 

6 A wedge is cut from a uniform solid cylinder of radius 


a by a plane tangent to the base and inclined at a 45° 
angle to the base. Find its moment of inertia about the 
axis of the cylinder. 

7 A uniform solid cone has height h , radius of base a, and 
mass M. Find its moment of inertia about an axis 
through the vertex and parallel to the base. Hint: Let 
the cone have its vertex at the origin and its axis on the 
z-axis, and find I x . 

8 A uniform solid cone has height h, radius of base a, and 
mass M. Find its moment of inertia about a diameter of 
the base. Hint: Let the cone have its base in the xy-plane 
and its axis on the z-axis, and find I x . 

9 A uniform solid hemisphere is bounded above by the 
sphere x 2 + y 2 + z 2 = a 2 and below by the ^y-plane. 
Find its center of mass. (The result of this problem is 
a theorem of Archimedes.) 

10 Find the mass of a cylindrical solid of height h and base 
radius a if the density at a point is proportional to the 
distance from the axis of the cylinder. 

11 A cylindrical hole of radius a i s bored through the cen¬ 
ter of a solid sphere of radius 2 a. Find the volume of 
the hole. 

12 Find the volume of the region bounded above by the 
plane z = 2x and below by the paraboloid z = x 2 + y 2 . 
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13 Find the volume of the solid bounded above b y the plane 
z = x and below by the paraboloid z = x 2 + y 2 . 

14 Find the mass of the solid in Problem 13 if the density 
at each point is proportional to the square of the dis¬ 
tance from the z-axis. 

15 Find the volume of the region bounded above by the 
plane z = x + y, below by the ry-plane, and on the sides 
by the cylinder x 2 + y 2 = a 2 and the planes x = a, y = 
a. 

*16 Find the volume of the region bounded above by the 
paraboloid z = x 2 + y 2 , below by the .ry-plane, and on 
the side by the hyperboloid 



17 Find the volume of the region bounded above and be¬ 
low by the sphere x 2 + y 2 + z 2 = 4a 2 and on the side 
by the cylinder (x - a) 2 + y 2 = a 2 . 

*18 If the region i n Problem 17 i s filled with matter o f con¬ 
stant density 6=1, find the moment of inertia of this 
solid about the z-axis. 

19 Find the moment of inertia of a uniform solid cylinder 


of radius a and mass M about a generator. Hint: Place 
the cylinder so that a generator lies on the z-axis. 

20 Find the volume of the region bounded above by the 
sphere x 2 + y 2 + z 2 = 2 a 2 and below by the paraboloid 
az = x 2 + y 2 . 

*21 Find the moment of inertia of a uniform solid sphere of 
radius a and mass M about a tangent line. Hint: Place 
the sphere with its center at the origin and let the tan¬ 
gent line be the line of intersection of the planes x = a, 

y = o. 

22 Find the volume of the region inside the cylinder r = 
a sin 6 which is bounded above by the sphere x 2 + y 2 + 
z 2 = a 2 and below by the upper half of the ellipsoid 
x 2 /a 2 + y^/a 2 + z 2 /b 2 = 1 where b < a. 

23 Find the volume of the region bounded above by the 
sphere x 2 + y 2 + z 2 = a 2 and below by the cone z = r 
cot a. Use this result to find the volume of a hemisphere 
of radius a. 

*24 Find the volume of the spherical segment of height h 
which is cut from a sphere of radius a by a plane at a 
distance a — h from the center. 


Just as cylindrical coordinates help us deal with problems involving symmetry 
about a line, spherical coordinates are designed to fit situations with symmetry 
about a point, as in the case of a solid sphere whose density is proportional to 
the distance from its center. We became acquainted with the spherical coordi¬ 
nates p, </>, 6 (see Fig. 20.30) in Section 18.7. We now put them to use in the cal¬ 
culation of certain triple integrals. 

In order to express a triple integral 


20.7 

SPHERICAL 

COORDINATES. 

GRAVITATIONAL 

ATTRACTION 


jJj/U, y, z) dV 

R 

in spherical coordinates, we need to be able to write x, y, z as functions of p, </>, 
6. This is easy to do by simply looking at Fig. 20.30: 

z = p cos 4 >» 
r — p sin 4>> 
x = p sin 4> cos 6, 
y = p sin 0 sin 6. 

We must now find a formula for the element of volume dV in terms of p, </>, 6. 
To do this we start at a point P = (p, </>, 6) and give small increments dp , d4 >, dd 
to its spherical coordinates. As we see in Fig. 20.31, the displacement of P in the 
p-direction has length dp, that in the (^-direction has length p d(p, and that in the 
0-direction has length p sin (frdd. These three lengths are the edges of the “spheri¬ 
cal box” shown in the figure, so the volume of this box is {dp) (p d(f>) (p sin </> dd) 
and we have 


z 



Figure 20.30 Spherical coordinates. 


dV = p 2 sin 4> dp d<f) dd. 
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Figure 20.31 The spherical element 
of volume. 



To calculate a triple integral in spherical coordinates we therefore write 
JJJ/U z) dV = jjjfip sin </> cos 6 , p sin </> sin 6 , p cos </>)p 2 sin (f> dp dcf) dd. 

R R 

In any particular problem we try to express this as an iterated integral in such a 
way that dV sweeps over the region R in a convenient manner. In most cases the 
nature of the region R will suggest an appropriate order of integration, together 
with corresponding limits of integration. 


Example 1 Use a triple integral in spherical coordinates to find the volume of 
the sphere x 2 + y 2 + z 2 = a 2 . 


Solution The equation of this sphere in spherical coordinates is p = a. We cal¬ 
culate the integral 



Figure 20.32 


V = III dV = HI p 2 sin 4> dp d<t> dd 

R R 

by integrating in the order p, </>, 6. The first integration, as p increases from 0 to 
a, adds the elements of volume dV to give the volume of the “spike” shown in 
Fig. 20.32; the second, as </> increases from 0 to 77, adds the volumes of these 
spikes to give the volume of the wedge in the figure; and the third, as 6 increases 
from 0 to 277, adds the volumes of these wedges around the z-axis to give the 
volume of the entire sphere. The actual calculation is 

v = l l l P 2 sin <{> dp d<t> dd 

= [/ o v^[/; sin ,^[r- 

= a 3 • 2 • 2tt = ^- na 3 , 

3 3 

as expected. This problem is perfectly suited to spherical coordinates, as we see 
from the simplicity of this calculation compared with those given in the corre¬ 
sponding examples in Sections 20.5 and 20.6. 


Example 2 Find the centroid of the region bounded by the sphere p = a and the 
cone </> = a. 
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Solution This region (Fig. 20.33) is shaped like a filled ice cream cone. Its vol¬ 
ume is 

v = l l J“ P 2 sin (t> dp d<t> dG 


1 _ f a 

= ~ a 3 • 2n J o sin $ d(fr 

2 

= — ua 3 {\ — cos a). 


As a check, this gives yraz 3 as the volume of the sphere when a = it. Now for 
the centroid. It is clear by symmetry that x = y = 0. To find z, we must first find 
the moment of the region with respect to the xy-plane, 


M*y = 


III zdV - l o l 0 l 0 (p COS </>) p 2 sin </> dp d(\> d6 

R 

l o sin (f) cos (f) d<& 


= y TTCl 


A 4 • n 

= — 7 tot sm z a. 
4 


Finally, we have 

- Mxy_ 

V 


z = • 


2 7Tfl 3 ( 1 - cos a) 


1 3 

■ sin 2 a = — ^(1 + cos a). 


When a = tt!2 this specializes to z — |«, which is the result of Problem 9 in 
Section 20.6. 


In our next example we discuss an idea with important implications for sev¬ 
eral branches of physical science. 


Example 3 The gravitational attraction of a thin spherical shell. Suppose that 
matter of total mass M is uniformly distributed on the surface of a sphere of ra¬ 
dius a centered at the origin (Fig. 20.34). Show that the gravitational force F ex¬ 
erted by this thin spherical shell on a particle of mass m located at a point (0, 0, 
b ), with b > a, is exactly what it would be if all the mass of the shell were con¬ 
centrated at its center. That is, show that 


\F\ = G 


Mm 


(1) 


where G is the constant of gravitation. 


Solution By symmetry it is clear that the vector F is directed downward, so 
F = F z k where F z is negative. By Fig. 20.31 the element of area on the surface 
of the sphere is 

dA = a 2 sin </> d(f> d6\ (2) 

and since the mass per unit area on the surface is M/4ira 2 , the mass contained in 
dA is 

dM = ^~ sin 4> d<f> dG. 



Figure 20.33 
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Figure 20.34 



z 



Newton’s law of gravitation states that the magnitude of the force this element 
of mass exerts on m (see Fig. 20.34) is 


G 


dM • m 


xv^ 


GMm 
477 W 2 


sin c/> dcf) d6. 


with downward component 

dM • m GMm , ,, ,n 

G - 5 — cos a = — - y cos a sin </> d(p d6. 

W- 47TW z 


We now find the magnitude of the total force the shell exerts on m by integrat¬ 
ing this expression over the surface of the sphere, 

_ ( 2tt f ” GMm 

|F|_ Jo Jo wT COS a sm <}> d<t> d6 


GMm ( 77 1 . , . 

—-— —7 cos a sin <p do. 

2 Jo w 2 ^ ^ 


(3) 


To calculate this integral we change the variable of integration from 0 to w and 
integrate from w = b — atow = b+a (see the figure). The reason for this strat¬ 
egy will become clear as we proceed. To accomplish the necessary transforma¬ 
tion of the integral in (3), we first use the law of cosines to write 

w 2 = a 2 + b 2 — lab cos </>, (4) 


so 


2 w dxv = lab sin d(f) 


or 

± J± w dw 

sin 0 d(p = —-—. (5) 

ab 

To write cos a as a function of w, we use the fact that 

w cos a + a cos (f> = b 


or 


cos a = 


b — a cos (ft 
w 
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With the aid of (4), this becomes 

b — [( a 2 + b 2 — w 2 )/2b] b 2 — a 2 + w 2 

cos a = - =- — -. 

w 2 bw 

When (5) and (6) are substituted in (3), we obtain 

GMm fb+a l / b 2 — a 2 + w 2 \ w dw 


|F| = - 


-r a -L 

Jb—a VV Z 


2 Jb-a w 2 \ 2bw 

GMm [b+a / b 2 — a 2 


ab 


4 ab 2 

The value of the integral here is 

(Jb 2 - a 2 ) V +a 


Cb+a 

Jb-a 


dw. 


+ W 


(6) 


(7) 


= [ — (b — a) + (b + a) + (b + a) — (b — a)] 


= 4a, 


so (7) becomes 


|F| 


GMm 
4 ab 2 


• 4 a = 


G 


Mm 

b 2 


and the proof of (1) is complete. 


The conclusion reached in this example implies one of Newton’s greatest the¬ 
orems in mathematical astronomy: Under the inverse square law of gravitation, 
a uniform solid sphere attracts an outside particle as if its mass were concen¬ 
trated at its center, for such a sphere can be thought of as if it were composed 
of a great many concentric thin spherical shells, like the layers of an onion, and 
each shell attracts in this way. Indeed, our discussion proves even more, namely, 
that the same statement holds for a solid sphere of variable density, provided that 
the density depends only on the distance from the center. Newton’s theorem shows 
that in computing the mutual gravitational attraction of various bodies in the so¬ 
lar system, like the sun, the earth, and the moon, it is legitimate to replace these 
huge bodies by equal point masses—that is, particles—located at their centers. 
It is believed by some historians of science that Newton delayed the publication 
of his theory of the solar system for 20 years until he was able to prove this the¬ 
orem. 


PROBLEMS 


Use spherical coordinates to solve the following problems. 

1 If the region in Example 2 is filled with matter of con¬ 
stant density 8, find the moment of inertia of the result¬ 
ing solid about the z-axis. Use this result to show that 
the moment of inertia of a uniform solid sphere of ra¬ 
dius a and mass M about a diameter is jMa 2 

2 In Example 2, z —» f a as a 0. Explain this, in view of 
the fact that the region approaches a line segment as 
a 0 and the centroid of a line segment is its midpoint. 

3 Find the volume of the torus p = 2a sin </> (see Fig. 
18.42). 

4 If 0 < b < a and 0 < a < tt, find the volume of the re¬ 


gion bounded by the concentric spheres p = b, p = a and 
the cone (f) = a. 

5 Find the centroid of the hemispherical shell 0 < /? < p < 
a, z ^ 0. 

6 Find the moment of inertia about the z-axis of the shell 
in Problem 5 if it is a solid of constant density 8. 

1 A wedge is cut from a solid sphere of radius a by two 
planes that intersect on a diameter. If a is the angle be¬ 
tween the planes, find the volume of the wedge. 

8 Find the mass of a solid sphere of radius a if the density 
at each point equals the distance from the surface. 

*9 Use a triple integral (in spherical coordinates) to verify 
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that the volume of a cone of height h and base radius r 
is \irr 2 h. 

10 If the density of a solid sphere of radius a is proportional 
to the distance from the center, 8 = cp , show that its mass 
is CTra A . 

11 Let n be a nonnegative constant, and consider a solid 
sphere of radius a centered at the origin whose density 
is proportional to the nth power of the distance from the 
center, 8 = cp n . 

(a) Find the moment of inertia of this sphere about the 
z-axis. 

(b) Show how the result obtained in (a) yields the con¬ 
clusion that the moment of inertia, about a diameter, 
of a uniform solid sphere of radius a and mass M is 
\Ma 2 . 

12 In Problem 11, allow the exponent n to be negative and 
determine what restriction must be placed on n if the 
mass of the sphere is to be finite. Hint: Find the mass 
between concentric spheres p = b and p = a with 0 < 
b < a, and then let b —» 0. 

13 Sketch the region bounded by the surface p = a(\ - 
cos </>), and find its volume. 

14 Find the mass of a solid sphere of radius a centered at 
the origin if the density at a point P equals the product 
of the distances from P to the origin and to the z-axis. 

15 Consider a solid sphere of radius a centered at the ori¬ 
gin with variable density 8 = 8(p , </>, 6). 

(a) Set up an iterated integral for the mass M with the 
integrations in the order 6 , </>, p. 

(b) Simplify the integral in (a) as much as possible for 
the special case in which the density is a function of 
p alone, say <5(p, </>, 6) = f{p). 

(c) Show that the formula in (b) can be obtained directly 
by using thin spherical shells, without any use of it¬ 
erated integrals. 

16 Apply formula (2) to find the area of the polar cap on a 
sphere of radius a which is defined by 0 < 4> < and 
use this result to find the total surface area of the sphere. 

17 In Example 3, assume that the particle m lies inside the 
spherical shell, so that b < a, and show that in this case 


the integral in (7) has the value zero. This proves the re¬ 
markable fact that a uniform thin spherical shell of mat¬ 
ter exerts no gravitational force whatever on bodies lo¬ 
cated inside its cavity. Further, the same conclusion is 
also true for any nonthin spherical shell in which matter 
of variable density fills the space between two concen¬ 
tric spheres, provided that the density depends only on 
the distance from their common center. 

18 Assume that the earth is spherical and o f constant den¬ 
sity, and imagine that a small tunnel is bored through the 
center. Neglecting the effect of this removal of matter, 
show that the gravitational attraction of the earth on a 
particle in the tunnel is directly proportional to the dis¬ 
tance from the particle to the center of the earth. Is this 
necessarily true if the density is variable but depends only 
on the distance from the center? 

19 Assume that the region discussed in Example 2 is filled 
with matter of constant density 8, and find the gravita¬ 
tional attraction it exerts on a particle of mass m placed 
at the origin. 

20 If the rounded top is cut off the solid in Problem 19, leav¬ 
ing a cone of height h = a cos a, what now is the grav¬ 
itational attraction exerted on a particle of mass m placed 
at the origin? 

21 Assume that matter of constant density 8 (= mass per 
unit area) is spread over the entire xy-plane and that a 
particle of mass m is located at the point (0, 0, b) on the 
z-axis. Show that the gravitational attraction exerted by 
the planar mass on the particle is given by the following 
improper integral in polar coordinates, 

ff GM8b 

JJ (r 2 + b 2 ) 3 ' 2 r dr d °’ 

where R is the entire xy-plane. Evaluate this integral by 
computing it over a circle of radius a centered at the ori¬ 
gin and then letting a —>Notice the remarkable fact 
that the value of this integral does not depend on b, so 
that the attractive force of the infinite plane on the par¬ 
ticle is independent of the distance from the plane. Why 
is this obvious without calculation? 


20.8 

AREAS OF CURVED 
SURFACES. LEGENDRE’S 
FORMULA 


In Section 7.6 we discussed the problem of finding the area of a surface of rev¬ 
olution. We now consider the area problem for more general curved surfaces, 
specifically, those that have equations of the form 

^ =/(*, y), 

where both partial derivatives f x (x, y) and f y (x, y) are continuous functions. 

The method we describe rests on the simple fact that if two planes intersect at 
an angle y (see Fig. 20.35), then all areas in one plane are multiplied by cos y 
when projected on the other, 


A = S cos y. 
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This is clearly true for the area of a rectangle with one side parallel to the line 
of intersection of the planes, and it follows for other regions by a limiting process. 
In just the same way, we project an element of surface area dS down from the 
given curved surface z = /(a, y) onto an element of area dA in the Ay-plane, as 
shown in Fig. 20.36. Here we have 

dA = dS cos y, 

where y is the angle between the vertical line in the figure and the upward-point¬ 
ing normal to the surface. This equation yields 

dS = — 
cos y 

so the total area of the curved surface is given by the formula 

dA 


-n«-s 


cos y 


(1) 


VfS +fy 


+ 1 


is the upward-pointing unit normal, and therefore cos y is its k-component, 

1 

cos y = ■ 


V /* 2 +/ v 2 + 

This enables us to write (1) in the form 




+/v 2 + 1 dA, 



Figure 20.35 


where R is the region in the Ay-plane that lies under the part of the surface z = 
/(a, y) whose area we wish to find. The element of area dA in Fig. 20.36 is drawn 
without any special shape, because the double integral (1) is sometimes used with 
rectangular coordinates and sometimes with polar coordinates. 

In order to make (1) into a practical tool for actual calculations, we need a for¬ 
mula for cos y. We find this formula from the fact that the vector f x \ + f y j - k 
is normal to the surface, as we saw in Section 19.3. This particular normal vec¬ 
tor points downward, because its k-component is negative. If we reverse the di¬ 
rection and divide by the length, then we see that the vector 

“/J ~fyi + k 



( 2 ) 


which is the basic formula of this section. 


Example 1 Find the area of the upper half of the sphere a 2 + y 2 + z 2 = a 2 (Fig. 
20.37). 

Solution The upper hemisphere is represented by the equation z = 
V a 2 — a 2 - y 2 , so we have 

fx = 


Va 2 - x 2 -y 2 ' 


with a similar formula for f y . The integrand in (2) is therefore 


A _.+_zL 


a 2 — a 2 - y 2 a 2 — x 2 — y 2 


+ 1 


S/a 2 - x 2 — y 2 


so the area of the hemisphere is 


5 = 


[[ , a 

r S/a 2 - x 2 — y 2 


dA , 



(3) 
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where R is the region in the xy-plane bounded by the circle x 2 + y 2 = a 2 . [It is 
worth noticing that in this particular case the figure tells us directly that cos y = 
z/a , so the integrand in (1) is 


1 _ a_ _ _ a _ 

cos y ~ Z ~ Va 2 -x 2 - y 2 

and the integral (3) can be written down at once, without calculation.] We now 
evaluate the integral (3) by introducing polar coordinates, 


' = a f 2 ir f a 

Jo Jo 


r dr d6 

Vfl 2 - r 2 


= a Jo |^-Va 2 — r 2 

= a 2 f dO = 2 7TO 2 . 
Jo 


dO 


This result is in agreement with Archimedes’ formula from elementary geome¬ 
try, which states that the surface area of a sphere of radius a is 47ra 2 . 


Example 2 Find the area of the part of the paraboloid z = x 2 + y 2 that lies in¬ 
side the sphere x 2 + y 2 + z 2 = 6. 


Solution The boundary of the base region R is the projection on the xy-plane 
of the curve of intersection of the two surfaces. See Fig. 20.38. This is most eas¬ 
ily determined by writing the surfaces in cylindrical coordinates, z = r 2 and r 2 + 
z 2 = 6. When z is eliminated, we find that the boundary of R is the circle r 2 = 2 
or r = V2. In this case we have/(jc, y) = x 2 + y 2 , so f x = 2jc and f y = 2 y, and 
therefore the desired surface area is 


1 = Jj V4 jc 2 + 4y 2 + 1 dA. 


Again we carry out the calculation by using polar coordinates, which gives 

C2tt fV2 


•=r f V4^n 

Jo Jo 


r dr dO 


dO 


-2*. ± W 





Figure 20.38 
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Formulas (1) and (2) are the standard formulas of calculus for actually find¬ 
ing the areas of specific curved surfaces. They work well, and we hope they seem 
reasonable to students. Nevertheless, the theory of surface area is very difficult, 
in particular, the problem of giving a fully satisfactory definition of the concept 
itself. This problem has occupied the attention of mathematicians for almost a 
hundred years, and research on these matters continues to this day. Anyone who 
wishes to understand the nature of the difficulty should study the classic exam¬ 
ple of H. A. Schwarz (1890), which jolted the mathematical world of the time 
out of its complacency. Schwarz’s example is a simple and familiar curved sur¬ 
face whose area can be computed in several equally reasonable ways to yield 
wildly different results.* 

*This example is described in many places. See, for instance, p. 204 of D. V. Widder, Advanced Cal¬ 
culus, 2nd ed. (Prentice-Hall, 1961). 


PROBLEMS 

Solve Problems 1 -6 by using the ideas of this section but with¬ 
out integration. 

1 Find the area of the triangle cut from the plane x + 
2 y + 3z = 6 by the coordinate planes. 

2 Find the area above the .xy-plane cut from the cone z 2 = 
x 2 + y 2 by the cylinder x 2 + y 2 = 2 ax. 

3 Find the area cut from the plane x + y + z = 7 by the 
cylinder x 2 + y 2 = a 2 

4 Find the area cut from the plane z = by by the cylinder 
x 2 + y 2 = a 2 . 

5 Find the area of the part of the cone z 2 = x 2 + y 2 that 
lies between the .xy-plane and the plane 2z + y = 3. 
Hint: What is the area of an ellipse? 

6 In Problem 5, find the area of the ellipse in which the 
plane intersects the cone. 

7 Find the area of the part of the sphere x 2 4- y 2 + z 2 = 
a 2 that lies above the xy-plane and inside the cylinder 
x 2 + y 2 = ax. 

8 In Problem 7, find the area of the part of the cylinder 
above the xy-plane that lies inside the sphere. Hint: Find 
J7z ds, where h is the height of the cylinder and ds is 
the element of arc length in the xy-plane. 

9 Find the area cut from the paraboloid z = x 2 + y 2 by 
the plane z = 1. 

10 Find the area of the part of the surface z 2 = 2 xy that 
lies above the .xy-plane and is bounded by the planes 
x = 0, x = 2 and y = 0, y = 1. 

11 Find the area cut from the saddle surface az = x 2 - y 2 
by the cylinder x 2 + y 2 = a 2 . 

12 Find the area of the part of the sphere x 2 + y 2 + z 2 = 
la 2 that lies inside the upper half of the cone z 2 = 
x 2 + y 2 . 

13 U R is any region in the xy-plane, show that the area of 
the part of the paraboloid z = ax 2 -I- by 2 that lies above 
R is equal to the area of the part of the saddle surface 
z = ax 2 - by 2 that lies above (or below) R. Show that 


this statement is also true for the pairs of surfaces z = 
x 2 + y 2 , z = 2xy and z = In (x 2 + y 2 ), z — 2 tan -1 x/y. 

14 Find the area o f the part of the paraboloid z — 4 - x 2 - 
y 2 that lies above the xy-plane. 

* 15 Find the area o f the part of the cylinder z = 1 - x 3 that 
is cut out by the planes y = 0, z = 0, and y = ax where 
a > 0. 

16 Find the area of the part of the cylinder x 2 + y 2 = a 2 
that is cut out by the cylinder y 2 /a 2 + z 2 /b 2 = 1. 

17 Find the area of the part of the cylinder x 2 + z 2 = a 2 
that lies in the first octant and between the planes 
y = 3x and y = 5x. 

18 Find the area of the part of the cylinder y 2 + z 2 = a 2 
that lies inside the cylinder x 2 + y 2 = a 2 . 

19 The cylinder r 2 = 4 cos 26 intersects the xy-plane in a 
lemniscate. Find the area of the part of the paraboloid 
4 z = x 2 + y 2 that lies inside this cylinder. 

*20 In Section 7.6 we used the formula 



to find the area of the surface of revolution obtained 
when the curve y = f(x) is revolved about the .x-axis. 
Show that our new method is consistent with the old, 
by deriving this formula from (2). Hint: The equation 
of the surface of revolution is y 2 + z 2 = f(x) 2 . 

*21 A spherical triangle is the figure on the surface of a 
sphere which is bounded by arcs of three great circles 
(Fig. 20.39). If a is the radius of the sphere, then Le¬ 
gendre's formula t for the area of such a triangle is 


f A. M. Legendre (1752-1833) was an able French mathematician 
who had the bad luck to see most of his life’s work rendered obso¬ 
lete by the discoveries of younger and more brilliant men. In spite 
of this, he retained his amiable and generous disposition. 
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Figure 20.39 A spherical triangle. 


5 = a 2 (a + (3 + y - tt), 

where a, /3, y are the angles between the sides. (The 
quantity a + + y — 7r is called the spherical excess 

of the triangle.) Thus, on a given sphere the area of a 
triangle depends only on its angles. Prove Legendre’s 
formula by the following steps. 

(a) In the special triangle cut from the sphere x 2 + y 2 + 
z 2 = a 2 by the planes y = 0, y = x tan a, z = 


x tan p., show that the angles between the sides are 
a , 7t/2, and /3, where cos /3 = sin a sin pu. Hint: Use 
vectors. 

(b) Show that the projections on the xy-plane of the 
sides of the right triangle in (a) are 0 = 0, 0 = a , 
and r = a/V 1 + tan 2 pi cos 2 6, and that the area of 
this triangle is 


5 = 




sin /.i cos 0 
Vl — sin 2 pi sin 2 6- 


d6. 


(c) Carry out the integration in (b) and thereby show 
that the area of the right triangle is 


a 2 (a + p- yj. 

(d) Complete the proof of Legendre’s formula by di¬ 
viding an arbitrary triangle into two right triangles 
and using (c). 

Apply Legendre’s formula to show that the area of the 
complete sphere is 47ra 2 . Hint: Divide the surface into 
convenient triangles. 


CHAPTER 20 REVIEW: METHODS, FORMULAS 


Think through the following. 

1 Iterated integrals and double integrals: dA = dx dy. 

2 Mass, moment, center of mass, and moment of inertia for 
thin plates of variable density. 

3 Polar coordinates: dA = r dr d6. 

4 Triple integrals: dV = dx dy dz. 


5 Cylindrical coordinates: dV = r dr dO dz. 

6 Spherical coordinates: dV = pi 1 sin 4) dp d4> d6* 

7 dS = = V/ x 2 + // + 1 dA. 

cos y Jy 

*In Appendix A.21 we describe some general ideas that provide a uni¬ 
fied view of the three formulas dA = r dr dd , dV = r dr dd dz, and 
d V = p 2 sin (j) dp d(f) dd. 


APPENDIX: 
EULER’S FORMULA 

Y _L = zi 

BY DOUBLE 
INTEGRATION 


The geometric series 1/(1 — r) = 1 + r + r 2 4- ■ • * enables us to write 

io r rY ■ i: r »+■>>'+/+■■■> * * 


-r 

=r 


1 ' X + jx 2 y + J*y + • ■ •) 


dy 


, + y- + y- + 


9 


y + Y- + y- + 


dy 

1 1 1 
= l+^2+T2 + 


jo 2 2 3 2 

The sum of Euler’s series Si In 2 is therefore the value of the double integral 

f 1 dx dy 


I 


-p p- 

Jo Jo 1 


xy 


We evaluate this integral — and thereby determine the sum of the series—by 
means of a rotation of the coordinate system through the angle 6 = 7t/4. 













APPENDIX: EULER’S FORMULA f X = X BY DOUBLE INTEGRATION 

„-i " 2 6 

If we rotate the xy-system into the wf-system through an arbitrary angle 6, as 
shown in Fig. 20.40, then the transformation equations are 

x — u cos 6 — v sin 6 , 

y = u sin 6 + v cos 6. 

When 6 = tt/4 these equations become 

X = jVl(u - v\ 
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so we have 


y — \\Zl(u + v\ 


xy = ~ (u 2 - v 2 ) and I — xy = 


Figure 20.40 


2 — u 2 + v 2 


(V 2 [V2-u dv du 


/ = 4 \ V2 f“ - dvdu -^ 4 f V2 _ 

Jo Jo 2 — u 2 + J\fii2 J° 2 — u 2 + v 2 

If we denote the integrals on the right by I\ and U, then 

h 



Figure 20.41 


rV2/2 

r r w 

du. 

Jo 

[Jo 2 - M 2 + V 2 

r V2/2 

1 

( v ' 

Jo 

__ tan 

V2 - « 2 

\ V2 — u 2 

cVl /2 

1 / 

V \ 

Jo 

V2 — « 2 l 



du 



du. 


To continue the calculation, we use the substitution 

u = V2 sin 6 , V2 - u 2 = V2 cos 6 , du = V2 cos 6 dO , 

/ V2 sin 0 N 


tan 


= tan 


V2 - m 2 / \V2cos0 


= 6 . 


Then 


/] =4 Jo 


7t/6 1 /— 1 7t/6 -tj-2 

—=-0- V2cos 0 d 0 = 26 2 = 

V2 cos 0 0 IS 


To calculate /2 we write 

f V2 fVi-K dv 

/2_4 Jv 2/2 [Jo 

r V2 


2 - m 2 + 




.4/ 

- 4 / 


— . tan I ,-- 

V2 - W 2 V V2 - M 2 


V2-m 


V2 

V2/2 


V2 - a 

V2 - a 2 lV 2 - m 2 


tan 


->( 
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To continue the calculation, we use the same substitution as before, with the ad¬ 
ditional fact that 


V2 -u \ , 

■ ~ =tan~ ] 

V2 - u 2 ) 


l V2 - V2sin 0 \ 

\ V2 cos 6 / 


= tan 1 


/ 1 - sin 6 \ 
\ cos 6 J 


= tan 1 


I cos 6 \ 
y 1 + sin 0 j 


= tan 1 


/ sin (77/2 - 0 ) \ 

\ 1 + cos (77/2 - 0 ) j 


= tan 1 


2 sin 1(77/2 - 6) cos 1(77/2 - 6) 
2 cos 2 1(77/2 - 6) 


-<f- 


This enables us to write 


h = 4 f^ 2 — ^ ^ 0 ) V 2 cos e de = 4 

U V2cos0\ 4 2 / 


T°~J 02 
4 4 


tt/2 

77/6 



We complete the calculation by putting these results together, 





It is interesting to observe that 


n P P dx dy dz _ y 1 
Jo Jo Jo 1 — xyz “"i n 3 ’ 


so that any person who can evaluate this triple integral will thereby discover the 
sum of the series on the right—which has remained an unsolved problem since 
Euler first raised the question in 1736. 














'-) 'I LINE AND 

Z1 SURFACE 

INTEGRALS. 
GREEN’S THEOREM, 
GAUSS’S THEOREM, AND 

STOKES’ THEOREM 


This chapter brings together into a unified package several topics in multivari¬ 
able calculus that are important for physical science and engineering, as well as 
for mathematics itself. The main focal points of our work are the concepts of line 
integral and surface integral, which provide yet other ways (in addition to dou¬ 
ble and triple integrals) of extending ordinary integration to higher dimensions. 
Line integrals are used, for example, to compute the work done by a variable 
force in moving a particle along a curved path from one point to another. In their 
origin and applications, these integrals are therefore associated with mathemati¬ 
cal physics as much as they are with mathematics. The main result of the first 
part of this chapter (Green’s Theorem) uses partial derivatives to establish a con¬ 
necting link between line integrals and double integrals, and this in turn enables 
us to distinguish those vector fields that have potential energy functions from 
those that do not. Here again, as so often in our earlier work, mathematics and 
physics constitute a seamless fabric in which neither ingredient has much mean¬ 
ing or value without the other. 

Throughout this chapter we assume that the functions under discussion have 
all the continuity and differentiability properties that are needed in any given sit¬ 
uation. 

Our first problem is to formulate a satisfactory concept of work. If we push a 
particle along a straight path with a constant force F (constant in both direction 
and magnitude), then we know that the work done by this force is the product 
of the component of F in the direction of motion and the distance the particle 
moves. It is convenient to use the dot product to write this in the form 

W= FAR, (1) 

where A R i s the vector from the initial position of the particle to its final posi¬ 
tion (Fig. 21.1). Now suppose that the force F is not constant, but instead is a 
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vector function that varies from point to point throughout a certain region of the 
plane, say 

F = ¥(x, y) = Mix, y) i + Nix, y)j. (2) 

Suppose also that this variable force pushes a particle along a smooth curve C 
in the plane (Fig. 21.2), where C has parametric equations 

x = x(t ) and y = y(r), t\ ^ t < t 2 . (3) 

What is the work done by this force as the point of application moves along the 
curve from the initial point A to the final point B? 

Before answering this question, we remark that the vector-valued function (2) 
is usually called a force field. More generally, a vector field in the plane is any 
vector-valued function that associates a vector with each point (x, y) in a certain 
plane region R. In this context a function whose values are numbers (scalars) is 
called a scalar field. For example, the function fix, y) = x 2 y 3 is a scalar field de¬ 
fined on the entire xy-plane. Every scalar field fix , y) gives rise to the corre¬ 
sponding vector field 

V/U, y) = grad f(x, y) = ■— i + j. 

(Remember that the symbol V/ is pronounced “del /”) This is called the gradi¬ 
ent field of/; its intuitive meaning was described in Section 19.5. For the func¬ 
tion just mentioned, we have V/ = 2ry 3 i + 3x 2 y 2 j. Some vector fields are gradi¬ 
ent fields, but most are not. We shall see in the next section that those vector 
fields that are also gradient fields are of special importance. 

We now return to the problem of calculating the work done by the variable 
force 


F = Mix, y )i + N(x, y )j (2) 

along the smooth curve C. This leads to a new kind of integral called a line in¬ 
tegral and denoted by 

j F • dR or | Mix, y) dx + Nix, y) dy. 



We begin the definition by approximating the curve by a polygonal path as shown 
in Fig. 21.3. That is, choose points P 0 = A, P u P 2 , .. . , P n -u P n = B along C 
in this order, let R* be the position vector of P k , and define the n incremental 
vectors shown in the figure by AR* = R *+1 — R*-, where k = 0 , 1 , ... , n — 1, 
If we now denote by F* the value of the vector function F at P k and form the sum 

Xf*-AR*, (4) 

k =0 

then the line integral of F along C is defined to be the limit of sums of this form, 
and we write 

r n — 1 

F • dR = lim Y F* • AR*. (5) 

J C k =0 


In this limit the polygonal paths are understood to approximate the curve C 
more and more closely, in the sense that the number of points of division 
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increases and the maximum length of the incremental vectors approaches 
zero.* 

The idea behind the definition (5) is that F (being continuous) is almost con¬ 
stant along the short path segment AR fo so by formula (1) we see that F* • AR* 
is approximately the work done by F along the corresponding part of the curve, 
and therefore the sum (4) is approximately the work done by F along the entire 
curve C, with the limit (5) giving the exact value of this work. 

A quick intuitive way of constructing the line integral (5) is illustrated in Fig 
21.4. If JR is the element of displacement along C, then the corresponding ele¬ 
ment of work done by F is dW = F • JR. The total work is now obtained by in¬ 
tegrating (or adding together) these elements of work along the entire curve C, 



W = jdW = j F • dR. 


( 6 ) 


For additional insight into the meaning of this formula, we think of the position 
vector R as a function of the arc length s measured from the initial point A. Since 
we know that dRJds is the unit tangent vector T (Section 17.4), we can write 

f F • dR = f F • -^5- ds = f F • T ds. (7) 

J c J c ds J c 


The line integral (6) can therefore be thought of as the integral of the tangential 
component of F along the curve C. It can be seen from (7) that line integrals in¬ 
clude ordinary integrals as special cases; for if the curve C lies along the x-axis 
between x- a and x—b, and if F = f(x) i, then (7) reduces to f%f(x) dx. 

If the variable vector F is given by F = M(x, y)i + N(x, y)j, then since R = 
x\ + yj and JR — dx i + Jyj, the formula for computing the dot product of two 
vectors yields 

F • JR = M(x, y) dx + N(x, y) dy. 

The line integral (6) can therefore be written in the form 

j F • dR = j M(jc, y)dx + N(x, y) dy. 


The parametric representation x = x(t) and y = y(r), t\ < t < for the curve C 
allows us to express everything here in terms of t, 



JR = 


M(x, y) dx + N(x, y) dy 
'c 


n r dx dy 

= { r (x ’y ) dF +mx ’ y) ir_ 


dt. 


This is an ordinary single integral with t as the variable of integration, and it can 
be evaluated in the usual way. 

So much for generalities. We now turn our attention to getting some practice 
in the actual calculation of line integrals. 


*The term line integral for the limit (5) is perhaps unfortunate, because the curve C need not be a 
straight line segment. Curve integral would be more appropriate, but the terminology is well estab¬ 
lished and cannot be changed now. 
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Example 1 Evaluate the line integral 

/ = f x 2 y dx + (x ~ 2y) dy , 

J c 

where C is the segment of the parabola y = x 2 from (0, 0) to (1, 1) (see Fig. 21.5). 


Solution We may parametrize the curve in any way that seems convenient. (It 
is not difficult to show that the value of the line integral does not depend on what 
parametric equations are used, provided that the orientation or direction is kept 
the same.) The simplest parametric representation of this curve is 

x = t, y = t 2 , where 0 < t < 1. 


Here we have dx = dt and dy — It dt, so the line integral is 

1= \'t 2 - t 2 dt + (t - 2t 2 )2t dt 
J o 

= [ 1 [r 4 + 2 1 2 - 4 1 3 ] dt = 


1 - 2 

- 1 5 + - 1 3 - r 4 
5 3 


11 = __L 

0 ” 15 ’ 


To illustrate the fact that the value of the line integral is independent o f the choice 
of parameter, let us use the representation 

x = sin t, y = sin 2 t, where 0 < t < 7t/2. 

This time we have dx = cos t dt and dy = 2 sin t cos t dt, so 

CTTll 

1 = sin 2 t • sin 2 t • cos t dt + (sin t - 2 sin 2 t)2 sin t cos t dt 
J o 

r 7J-/2 

= [sin 4 t + 2 sin 2 t - 4 sin 3 t] cos t dt 
J o 


= | — sin 5 t + — sin 3 t - sin 4 t 


-nil 2 

o - ”7? 


as before. 


Every curve C that we use with line integrals is understood to have a di¬ 
rection, from its initial point to its final point. Even though the value of a 
line integral does not depend on the parameter, it does depend on the direc¬ 
tion. If - C denotes the same curve traversed in the opposite direction, then we 
have 


I F dR = - f F-dR, 

or equivalently, 

J Af dx + N dy = —J M dx + N dy. 

That is, integrating in the opposite direction changes the sign of the integral. This 
can be seen at once from Fig. 21.3 and the definition (5), because the directions 
of all the incremental vectors AR* are reversed. 
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Example 2 Evaluate the line integral. 

/ = f x 2 y dx + (x — 2 y) dy , 

J c 

where C is the straight line segment y = t from (0, 0) to (1, 1). 


Solution This is the same integrand as in Example 1, and the initial and final 
points of the curve are the same, but the curve itself is different (see Fig. 21.5). 
Using x as the parameter, so that the parametric equations are x = x and y = x, 
we have dx — dx and dy — dx, so 


/ = f x 2 • x dx + (x - 2x) dx 
J o 

= [ ft 3 - x] dx = 

J o 


1 4 1 2 

— x^ — — x* 

4 2 


V 


which we observe is different from the value — y 5 obtained along the parabolic 
path. 


The integral in this example can be written as 

I F • dR, where F = x 2 yi + (jc - 2y)j. 

J c 

If F is thought of as a force field, then the work done by F in moving a particle 
from (0, 0) to (1, 1) is different for the two curves in Examples 1 and 2. This il¬ 
lustrates the fact that in general the line integral of a given vector field from one 
given point to another depends on the choice of the curve, and has different val¬ 
ues for different curves. 

If a curve C consists of a finite number of smooth curves joined at corners, 
then we say that C is a piecewise smooth curve, or a path. The value of a line 
integral along C is then defined as the sum of its values along the smooth pieces 
of C. This is illustrated in the first part of our next example. 


Example 3 Evaluate the line integral 

I y dx + (x + 2 y) dy 
J c 


from (1, 0) to (0, 1), where C is (a) the broken line from (1, 0) to (1, 1) to 
(0, 1); (b) the arc of the circle x = cos t, y = sin t\ (c) the straight line segment 
y = 1 — jc. See Fig. 21.6. 

Solution (a) Along the segment from (1, 0) to (1, 1) we have x = 1 and dx = 
0; and along the segment from (1, 1) to (0, 1) we have y = 1 and dy = 0. Since 
the complete line integral is the sum of the line integrals along each of the seg¬ 
ments, we have 

f y dx + (x + 2 y) dy = ( (1 + 2 y) dy + [ dx 
J c J o J i 


y 



o 


Figure 21.6 
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(b) Here we have jc = cos t and y = sin t for 0 < t < tt/2, so dx = —sin t dt 
and dy = cos t dt, and therefore 

f y dx + (x + 2 y) dy = [ - sin 2 t dt + (cos t + 2 sin t) cos t dt 

J c J o 

r7r/2 

= (cos 2 t — sin 2 t + 2 sin t cos t) dt 
J o 

rir/2 

= (cos 2t + 2 sin t cos t) dt 


— sin 2 1 + sin 2 t 
2 


7t/2 


= 1. 


(c) To integrate along the segment y = 1 — jc we can use jc as the parameter, 
so that dy = —dx. Since jc varies from 1 to 0 along this path, the integral is 

f y dx + (x + 2y) dy = f (1 — x) dx + [x + 2(1 - x)]( — dx) 

J c h 

= (-1 )dx= 1. 

J \ 




In this example all three line integrals have the same value, and we might sus¬ 
pect that perhaps with this integrand we get the same value for any path from 
(1, 0) to (0, 1). This is indeed true, as we shall see in Section 21.2, where we 
investigate the underlying reasons why some line integrals from one point to 
another have values that are independent of the path of integration, and others 
do not. 

It will often be necessary to consider situations in which the path of integra¬ 
tion C is a closed curve, which means that the final point B is the same as the 
initial point A (Fig. 21.7). For the sake of emphasis, in this case a line integral 
is usually written with a small circle on the integral sign, as in 

F • */R or M dx + TV dy. 

Example 4 Calculate j> c F • c/R, where F = y\ + 2;cj and C is the circle jc 2 + 
y 2 = 1 described counterclockwise from A = (1,0) back to the same point (Fig. 
21 . 8 ). 

Solution A simple parametric representation is jc = cos t and y = sin r, where 
the counterclockwise orientation means that t increases from 0 to 27 t. Since R = 
x * + yi = cos tl + si n f j> have 

<iR = (-sin t\ + cos fj) dt, 

and therefore 


F • dR = (sin ri + 2 cos rj) • ( —sin ri + cos rj) dt 
= (2 cos 2 t - sin 2 t) dt 

= fy + |cos2r|^, 


Figure 21.8 
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by the half-angle formulas. It now follows that 


ri 3 . ~] 2rr 

= — t + — sin 2t = it. 
2 4 o 


PROBLEMS 


1 Evaluate the line integral I = Jc xy 2 dx - (jc + y) dy, 
where C is 

(a) the straight line segment from (0, 0) to (1, 2); 

(b) the parabolic path y = 2 jc 2 from (0, 0) to (1, 2); 

(c) the broken line from (0, 0) to (1, 0) to (1, 2). 

Sketch all paths. 

2 Evaluate the line integral I = Jc x 2 y dx - xy 2 dy, where 
C is the broken line joining the points (0, 0), (1, 1), 
(2, 1) in this order. 

3 Evaluate f c dx/y + dy/x, where C is the part of the hy¬ 
perbola xy = 4 from (1, 4) to (4, 1). 

4 Show that (a) J c (x 2 - 2y) dx = -f, (b) Jc2xy 2 dy = 
and (c) J c (jc 2 — 2 y) dx + 2jcy 2 dy = ~ f if C is the 
straight line segment y = x, 0 < jc < 1. 

5 Show that Jc(x 2 + 3jcy) dx + (3jc 2 - 2 y 2 ) dy = if 
C is the segment of the parabola x = t, y = t 2 from t = 
1 to t = 2. 

6 Evaluate Jc(dx + dy)/(x 2 + y 2 ), where C is the upper 
half of the circle x 2 -\- y 2 = a 2 from (a, 0) to (— a , 0). 

7 Find the values of the line integral J c (jc — y) dx + 
Vjcdy along the following paths C from (0, 0) to (1, 1): 
(a) jc = t, y = V, (b ) x = t, y = t 2 \ (c) x = t 2 , y = r, (d) 
x ~ t, y = t 3 . Sketch all paths. 

8 Show that J c (x 2 + y 2 ) dx = —f, if C is the broken line 
from (0, 0) to (1, 1) to (0, 1). 

9 Evaluate J c x dx + x 2 dy from ( —1, 0) to (1, 0) 

(a) along the jc-axis; _ 

(b) along the semicircle y = Vl - x 2 ; 

(c) along the broken line from (-1, 0) to (0, 1) to (1, 1) 
to (1, 0). 

Sketch all paths. 

10 Evaluate <J> C (3jc + 4 y) dx + (2jc + 3y 2 ) dy, where C is 
the circle jc 2 + y 2 = 4 traversed counterclockwise from 
(2, 0). 

11 Find the values of the line integral J c 2 xy dx + (jc 2 + 
y 2 ) dy along the following paths C from (0, 0) to (1, 1): 

(a) y = jc; (b) y = jc 2 ; (c) jc = y 2 ; (d) y = jc 3 ; (e) jc = y 3 ; 
(f) the broken line from (0, 0) to (1, 0) to (1, 1). Sketch 
all paths. 

12 Find the values of the line integral J c x 2 dx + y 2 dy along 
the following paths C from (0, 1) to (1, 0): (a) the cir¬ 


cular arc jc = cos t, y = sin r; (b) the straight line seg¬ 
ment; (c) the segment of the parabola y = 1 — jc 2 . 

13 If F = (yi - jcj)/(jc 2 + y 2 ), find Jc F • dR from (—1, 0) 
to (1,0) 

(a) along the semicircle y = Vl — jc 2 ; 

(b) along the broken line from (-1,0) to (0, 1) to (1, 1) 
to (1,0). 

14 Compute <J C F * dR if F = (jc + y)i + (y 2 - jc)j, where C 
is the closed curve that begins at (1, 0), proceeds along 
the upper half of the unit circle to (—1, 0), and returns 
to (1, 0) along the jc-axis. 

15 Evaluate J c yVy dx + jcVy dy, where C is the part of 
the curve jc 2 = y 3 from (1, 1) to (8, 4). 

16 If F = (2jc + y)i + (3jc - 2y)j, evaluate Jc F • </R along 

(a) the straight line from (0, 0) to (1, 1); 

(b) the parabola y = jc 2 from (0, 0) to (1, 1); 

(c) y = sin ( tt!2 ) jc from (0, 0) to (1, 1); 

(d) jc = y n {n > 0) from (0, 0) to (1, 1). 

17 Show that <J> C (—y dx + jc dy)/(x 2 + y 2 ) = 2tt where C is 
the circle jc 2 + y 2 = a 2 traversed counterclockwise from 
(a, 0). 

18 If F = jcyi + (y 2 + l)j, calculate J c F ■ dR, where C is 

(a) the line segment from (0, 0) to (1, 1); 

(b) the broken line from (0, 0) to (1, 0) to (1, 1); 

(c) the parabola jc = y 2 from (0, 0) to (1, 1). 

19 Find the values of f c y dx + jc dy along the following 
paths C from (— a, 0) to {a, 0): 

(a) the upper half of the circle jc 2 + y 2 = a 2 ; 

(b) the broken line from (— a, 0) to (— a, a) to (a, a) to 

(a, 0); 

(c) the straight line segment joining the points. 

20 Evaluate Jcxy 2 dx + jc 3 y dy, where C is the broken line 
consisting of the segments from (- 1, -1) to (2, - 1) and 
from (2, -1) to (2, 4). 

21 A particle i s moved around a square path from (0, 0) to 
(1, 0) to (1, 1) to (0, 1) to (0, 0) under the action of the 
force field F = (2jc + y)i + (jc + 4y)j. Find the work 
done. 

22 Calculate <J C 2xy dx + (jc 2 + y 2 ) dy, where C is the 
boundary of the semicircular region jc 2 -L y 2 < 1, y > 0 
described counterclockwise. 
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21.2 

INDEPENDENCE OF 
PATH. CONSERVATIVE 
FIELDS 


y> 



(l, o) * 


Figure 21.9 


In Example 3 of Section 21.1, we calculated the line integral 

\ c f-dR 


( 1 ) 


of the vector field 


F(x, y) = y\ + (x + 2y)j (2) 

along each of the three different paths from (1, 0) to (0, 1) shown in Fig. 21.9, 
and we obtained the same value 1 for the integral along all of these paths. This 
was not an accident. The underlying reason for this result is the fact that the vec¬ 
tor field (2) is the gradient of a scalar field, namely, of the function 

f(x, y) = xy + y 2 , (3) 


because clearly 


v/ -s , + a; J - >i+( * +WJ -^ 

To understand the significance of this statement, recall from Section 19.5 that 
in multivariable calculus the gradient plays a similar role to that of the deriva¬ 
tive in single-variable calculus. The Fundamental Theorem of (single-variable) 
Calculus can be expressed in the form 

\ b f\x)dx=f{b)~ f (a), 

Ja 

where fix) is a function of a single variable. The corresponding result here is 

j c Vf-dR=f(B)-f(A), (4) 

where fix, y) is a function of two variables (a scalar field) and A and B are the 
initial and final points of the path C. For example, since the vector field (2) is 
the gradient of the scalar field (3), that is, F = V/ formula (4) tells us that the 
value of the line integral (1) along any of the paths C shown in Fig. 21.9 is 

| c F-dR = f c Vf- dR =/(0, 1) - /(1, 0) = 1 -0=1. 

without calculation. 

Formula (4) is called the Fundamental Theorem of Calculus for Line Integrals. 
We can state this theorem more precisely as follows: 


If a vector field F is the gradient of some scalar field f in a region R, so that F = V/ 
in R, and if C is any piecewise smooth curve in R with initial and final points A and 
B, then 

f c F-dR=/(B)-/(A). (5) 

To prove this, suppose that C is smooth with parametric equations x = xit) and 
y = yit), a < t < b. Then 
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f r Jn ( m rfRl , (bfdfdx df dy 

F • dR = Vf-d R = V/- — <* = hr- -r + ^r- -r 

•'<7 Jc | dt dt 3y dt 


dt 


L 


b d 
dt 


/MO, y(t )] dt 


= f[x(b\ y(b)] - f[(x(a ), y(a)] 


The crucial steps here depend on the multivariable chain rule (Section 19.6) and 
the one-variable Fundamental Theorem of Calculus. The argument for piecewise 
smooth curves now follows at once by applying (5) to each smooth piece sepa¬ 
rately, adding, and canceling the function values at the corners. 

This theorem has several layers of meaning. We begin by illustrating its use¬ 
fulness for evaluating line integrals. 


Example 1 Compute the line integral of the vector field F = y cosxy i + 
x cos xy j along the parabolic path y = x 2 from (0, 0) to (1, 1). 


Solution For this path it is natural to use x as the parameter, where x varies 
from 0 to 1. Since dy = 2x dx, we have 

F • dR = y cos xy dx + x cos xy dy 

= x 2 cos x 3 dx + x cos x 3 2x dx 


= 3x 2 cos x 3 dx, 


so 


| c F-J R 



cos x 3 dx = sin x 3 


l 

o 


sin 1 — sin 0 = sin I. 


This straightforward calculation of the line integral is easy to carry out, but a 
much easier method is now available. The first step is to notice that the vector 
field F is the gradient of the scalar field f(x , y) = sin Jty. (Students should verify 
this.) With this fact in hand, all that remains is to apply formula (5): 


F • dR = sin xy 


(i. i) 
( 0 , 0 ) 


= sin 1 — sin 0 = sin 1. 


The great advantage of this method is that no attention at all needs to be paid to 
the actual path of integration from the first point to the second. 


As this example shows, the Fundamental Theorem can sometimes be used in 
the practical task of evaluating line integrals. Nevertheless, its main importance 
is theoretical. First, we point out that the right side of (5) depends only on the 
points A and B, and not at all on the path C that joins them. The line integral on 
the left side of (5) therefore has the same value for all paths C from A to B. This 
can be expressed by saying that the line integral of a gradient field is indepen¬ 
dent of the path. Next, it is clear from formula (5) that if C is a closed path, so 
that the final point B is the same as the initial point A , then f(B) — f(A) = 0 and 
therefore the line integral is zero. That is, if F is a gradient field, then 

<> F • dR = 0 
Jc 

for every closed path C. 
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These arguments show that 


Gradient field 




integral is independent of path 


integral around closed path is zero 


(The symbol => means “implies.”) Actually, these three properties are equivalent, 
in the sense that each implies the other two. 

To begin the demonstration of equivalence, suppose that the line integral 
of the vector field F is independent of the path. We shall prove that the inte¬ 
gral of F around a closed path is zero. To see why this is so, we examine 
Fig. 21.10, in which two points A and B are chosen on the closed path C. 
These points divide C into paths C\ from A to B and C 2 from B to A. Since 
Ci and — C 2 are both paths from A to B, the assumption of independence of 
path implies that 

f F • dR = f F • dR = - f FdR 

Jc 1 J-C2 Jc 2 


It follows from this that 


F • dR 


-L 


F • dR + F - dR = 0, 


as asserted. Conversely, if we assume that the integral around every closed path 
is zero, then we can easily reverse this argument to show that the integral from 
A to B is independent of the path. 

To complete the proof of the equivalence of the three properties, it suffices to 
show that if F is a vector field whose line integral is independent of path, then 
F = V/ for some scalar field f To do this, we choose a fixed point (xq, yo) in the 
region under discussion and let (x, y) be an arbitrary point in this region. Given 
any path C from (x 0 , yo) to (*> y ) [wo assume there is such a path], we define the 
function /(x, y) by means of the formula 

flx,y)=( F dR = [ <X >) F • rfR. 

Jc J( x o- v o> 

See Fig. 21.11. Because of the hypothesis of independence of path, the value of 
this integral depends only on the point ( x, y) and not on the path C, and there¬ 
fore provides an unambiguous definition for /(x, y). To verify that V/= F, we 
suppose that the vector field F has the usual form, F = M(x, y)i + N(x, y)j, so 
that 

f(x, ;y) 

f(x, y)= M dx + N dy. 

J ( x o* yd> 

To show that df/dx = M, we hold y fixed and move along the straight path from 
{x, y) to (x + Ax, y), as shown in the figure. Since dy = 0 on this short path in¬ 
crement, we have 

rX+Ax 

fix + Ax, y) -f(x, y)= M dx, 

J x 

so 



d/ _ ffx ± Ax, y) - fix, y) 
dx A 1 ™!) Ax 


= lim -M’ 

Ax—^0 Ax J x 


M dx = M 
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by the Fundamental Theorem of Calculus. Similarly, dfldy = N, so V/ = F and 
the argument is complete. 

As we suggested earlier, the main significance of these ideas lies in their ap¬ 
plications to physics. In order to understand what is involved, let us suppose that 
F is a force field and that a particle of mass m is moved by this force along a 
curved path C from a point A to a point B. Let the path be parametrized by the 
time t, with parametric equations x = x(t ) and y = y(t), t\ < t < r 2 . Then the work 
done by F in moving the particle along this path is 



According to Newton’s second law of motion we have 


where v = dR/dt is the velocity. If ^denotes the speed, so that v = |v|, then we 
can write the integrand in (6) as 


dR d\ 1 d 1 d 9 * 

F • — = m— * v = ~ m — (v • v) = - m — ( v 2 ). 
dt dt 2 dt 2 dt 


Therefore (6) becomes 



where v A and v B are the initial and final speeds, that is, the speeds at the points 
A and B. Since \mv 2 is the kinetic energy of the particle, (7) says that the work 
done equals the change in kinetic energy. (A similar discussion for the case of 
linear motion is given in Section 7.7.) 

We continue this line of thought to its natural conclusion. The force field F is 
called conservative if it is the gradient of a scalar field. For reasons that will ap¬ 
pear in a moment, it is customary in this context to introduce a minus sign and 
write F = — VK so that V increases most rapidly in the direction opposite to F. 
The function V(x, y) is then called the potential energy. This function is just the 
negative of what we have been denoting by f It exists if and only if F is a gra¬ 
dient field, and when it does, the Fundamental Theorem (5) tells us that 

W = f F dR = - f VV ■ dR = —[ V(B) - V(A)\ = V(A) - V(B), ( 8 ) 

where A and B are the initial and final points of the arbitrary path C. If we now 
equate (7) and (8), we get 

V(A) - V(B) = | mv B 2 - J mv A \ 2 


or 


^ mv A 2 + V(A) = ^ mv B 2 + V(B). 


(9) 


*In this calculation we use the product rule for the derivative of the dot product of two vector func¬ 
tions of t. This is easy to prove from formula (8) in Section 18.2. 
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(We now see that the minus sign is introduced into the definition of potential en¬ 
ergy in order to make the signs here come out right.) Equation (9) says that the 
sum of the kinetic energy and the potential energy is the same at the initial point 
as it is at the final point. Since these points are arbitrary, the total energy is con¬ 
stant. This is the law of conservation of energy, which is one of the basic prin¬ 
ciples of classical physics. This law is true in any conservative force field, such 
as the earth’s gravitational field or the electric field produced by any distribution 
of electric charge. 

Our work has demonstrated that a force field is conservative if and only if it 
satisfies any one of the following equivalent conditions: 

integral is independent of path 

Gradient field 

integral around closed path is zero 

The importance of these fields justifies turning our attention to the practical prob¬ 
lem of determining whether a given force field is or is not conservative. Since 
any vector field can be thought of as a force field, these remarks apply to vector 
fields in general. 

Example 2 Show that the vector field F = xyi + ry 2 j is not conservative. 



Solution One way of doing this is to show that Jc F • JR does depend on the 
path. We choose two convenient points, say (0, 0) and ( 1 , 1 ), and integrate from 
the first to the second along any two convenient different paths, as shown in Fig. 
21 . 12 . First, along the line y = x we have 

f F • dR = [ xy dx + xy 2 dy = [ (x 2 + x 3 ) dx = — H — = —. 

J c J c 3 4 12 

On the other hand, along the broken line from (0, 0) to ( 1 , 0) to ( 1 , 1 ) we have 
I F ■ dR = ft) ■ dx + J y 2 dy = j. 

Since the values of these integrals are not equal, the field is not conservative. 
In probing for unequal line integrals in this way, we are perfectly free to 
choose paths that make the calculations easy. (Of course, if these two line 
integrals had turned out to be equal, this would not have precluded unequal 
results for two other line integrals, so nothing would have been proved one way 
or the other.) 

Another method is to assume that the field is conservative, so that F = V/ for 
some function f(x, y), and to deduce a contradiction from this assumption. The 
assumption means that there exists a function / such that df/dx = xy and df/dy = 
xy 2 . But this is impossible, because the mixed partial derivatives would then be 


J2—, 

3y 


and 


a 2 /- 

3y 


” y 


which are obviously not equal, whereas the theory of partial derivatives tells us 
that these derivatives must be equal. This contradiction tells us that no / exists, 
so F is not conservative. 
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The reasoning used in the second method of this example depends on the equal¬ 
ity of mixed partial derivatives, 


d 2 f a 2 / 

dy dx dx dy ’ 


( 10 ) 


which is valid in any region where both derivatives are continuous (Section 19.2). 
This reasoning can be extended as follows: if 


F = M(x, y)i + N(x, y)j 


(ID 


is a conservative vector field, so that an/exists with the property that V/ = F or 


— = M and 
dx 



then by (10) we know that 


dM _ dN 
3y 3jc 


( 12 ) 


Condition (12) is therefore necessary for a vector field to be conservative, and we 
have seen how this fact can be used. But is it also sufficient? That is, does (12) 
guarantee that (11) is conservative? We investigate this question in Section 21.3. 


PROBLEMS 


In Problems 1-4, use both methods of Example 2 to show 
that the vector field is not conservative. 

1 F = yi - xy 

2 F = x(y - l)i + xy 

3 F = jc 3 yi + xy 2 y 


In Problems 5-8, show that the given line integrals are not 
independent of path by integrating along two different paths 
from (0, 0) to (1, 1). 

5 f 2 xy dx + (y 2 — x 2 ) dy. 

J c 

6 f 2 xy dx -f (y — jc 2 ) dy. 

J c 

7 f (x 2 — y 3 ) dx + 3jcy 2 dy. 

J c 

8 [ (jc — y) dx + (jc + y) dy. 

J c 

9 Show that 


f ( 1 ,4) 

^ 2jcy dx + jc 2 dy 

is independent of the path, and evaluate the integral by 

(a) using formula (5); 

(b) integrating along any convenient path. 

10 Show that 


CO’Tr) . 

( sin y dx + jc cos y dy 

is independent of the path, and evaluate the integral by 

(a) using formula (5); 

(b) integrating along any convenient path. 

In Problems 11-16, show that the integral is independent of 
the path and use any method to evaluate it. 
f(l.S) 

11 2y dx + 2 jc dy. 
k- 2-D 

12 y?- e x dx + 2 ye x dy. 

Ao.o) 

f(7T/2,l) 

13 e^cos jc dx + e y sin jc dy. 

J(O.O) 

14 J ^ 3jc 2 y 2 dx + 2jc 3 y dy. 

15 [ ( 2xy dx + (jc 2 -f y 2 ) dy. 
k- 2.D 

fn.n 

16 (jc + y) dx + jc dy. 

J(0,0) J 

17 Suppose that a particle of mass m moves in the jcy-plane 
under the influence of the constant gravitational force 
F = — mgy If the particle moves from (jci, yj) to fe, yi) 
along a path C, show that the work done by F is 

W = mg(y, - y 2 ), 

regardless of the path. 
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21.3 

GREEN’S THEOREM 


As we said at the beginning of this chapter, Green’s Theorem establishes an im¬ 
portant link between line integrals and double integrals. Our purpose in this sec¬ 
tion is to reveal the nature of this link. 

Consider a vector field 


F = M(x, y)i + N(x, y)j 


(1) 


defined on a certain region in the xy-plane. We now take up the question of 
whether the condition 

dM 3A 

^T=a7 (2) 



is sufficient to guarantee that F is conservative, that is, that F is the gradient of 
some scalar field / In the light of what we have learned in Section 21.2, this is 
equivalent to asking whether condition (2) implies that the integral of F around 
every closed path is zero. We shall use our investigation of this question as a 
means of discovering Green’s Theorem, which we will then prove and apply in 
various ways. 

The simplest type of closed path C is a rectangular path like the one shown in 
Fig. 21.13. We shall calculate the integral of F around this path and see what is 
needed to make its value zero. Integrating counterclockwise as shown, and be¬ 
ginning with the path segment on the lower edge of the rectangular region R, we 
have 


F • dR = j M(x, y) dx + N(x, y ) dy 


= f M(x , c) dx + [ N(b , y) dy + [ M{x , d) dx + f N(a, y) dy 

J a J c J b J d 

= £ [ N(b , y) - N(a, y)] dy - f [M(x, d) - M(x, c)] dx. (3) 


We next make an ingenious application of the Fundamental Theorem of Calcu¬ 
lus to write these two integrands as 



N(b, y) - N{a, y) = N(x, y) 




=b _ dP 

= a = l * 


dx 


M(x, d) - M(x, c ) = M(x, y) 


y=el 

y=c 



This enables us to write (3) in the form 


F • d R = i M dx + N dy 


] c 

rd rb 3/V rb rcl m 

= U^ dxdy -Ll^r dy 


dM 
a / dy 


’dx. 


These iterated integrals can be written as double integrals over the region R en¬ 
closed by C, so we have 


F • dR = (j) M dx + N dy 

dN 
dx 




dN__dM 
dx dy 


dA. 


(4) 
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Not closed and 
not simple 


Simple but 
not closed 


Closed but 
not simple 


Simple and 
dosed 


Figure 21.14 Various types of curves. 


Now we can see what is happening. Condition (2) implies that this double inte¬ 
gral is zero, so $cF ■ dR — 0. It is tempting to infer from this that condition (2) 
implies F is conservative. However, this inference requires that §c F • JR = 0 for 
every closed path C, and we have demonstrated this only for rectangular paths 
like the one in Fig. 21.13. 

If we pluck out the essence of this argument, we see that it lies in equation 
(4), which we can write in the form 



(5) 


This statement, that a line integral around a closed curve equals a certain double 
integral over the region inside the curve, is called Green s Theorem, after the 
English mathematical physicist George Green.* 

Strictly speaking, Green’s Theorem is not merely equation (5), but rather a 
fairly careful statement of conditions under which (5) is valid. To state such con¬ 
ditions, it is necessary to introduce the concept of a simple closed curve. We al¬ 
ready know that a closed curve is one for which the final point B is the same as 
the initial point A. A plane curve is said to be simple if it does not intersect it¬ 
self anywhere between its endpoints (Fig. 21.14). Unless the contrary is explic¬ 
itly stated, we assume that simple closed curves are positively oriented, which 
means that they are traversed in such a way that their interiors are always on the 
left, as shown on the right in the figure. 

Green s Theorem can now be stated as follows: 

If C is a piecewise smooth, simple closed curve that bounds a region R, and if M(x, y) 
and N(x, y) are continuous and have continuous partial derivatives along C and 
throughout R, then 



(5) 


We have proved (5) only for rectangular regions R of the kind shown in Fig. 
21.13. We now give a similar argument for the case in which R is both vertically 
simple and horizontally simple, in the sense described in Section 20.2. Then we 
shall indicate how to extend the theorem to more general regions. 


*Green (1793-1841) was obliged to leave school at an early age to work in his father’s bakery, and 
consequently had little formal education. By assiduous study in his spare time, he taught himself 
mathematics and physics from library books, particularly Laplace’s Mecanique Celeste. In 1828 he 
published locally at his own expense his most important work. Essay on the Application of Math¬ 
ematical Analysis to the Theories of Electricity and Magnetism. Although Green’s Theorem (in an 
equivalent form) appeared in this pamphlet, little notice was taken until the pamphlet was repub¬ 
lished in 1846, five years after his death, and thereby came to the attention of scientists who had the 
knowledge to appreciate its merits. 
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Since R is assumed to be vertically simple (Fig. 21.15), its boundary C can be 
thought of as consisting of a lower curve y = yiU) and an upper curve y = V 2 U), 
possibly separated by vertical segments on the sides. The integral jM(x , y) dx 
over any part of C that consists of vertical segments is zero, since dx = 0 on such 
a segment. We therefore have 

i M(x, y) dx = \ M[x, y\(x)] dx + [ M[x, y 2 (x)\ dx, (6) 

JC J a J b 


where the lower curve is traced from left to right and the upper curve from right 
to left. By the Fundamental Theorem of Calculus, ( 6 ) can be written as 


M dx = f [M[x, yiU)] - M[x, y 2 Ml} 

JC J a 

= f [ - M(x, y) 


dx 


y=y 2 {x) 




f h dM ff dM , 

_ l i lW _ TT dy * _ II ~ T7 dA ' 


dM 


dM 


dy 


dy 


(7) 


But R is also assumed to be horizontally simple, and a similar argument, which 
we ask students to give in Problem 22, shows that 


i Ndy= l[^ dA - 


( 8 ) 



We now obtain Green’s Theorem (5) for the region R by adding (7) and ( 8 ). 

A complete and rigorous proof of Green’s Theorem is beyond the scope of this 
book. Nevertheless, it is quite easy to extend the argument to cover any region 
R that can be subdivided into a finite number of regions R\, R 2 , .. . , R n that are 
both vertically and horizontally simple. The validity of Green’s Theorem for R 
then follows from its validity for each of the regions R\, R 2 , ... , R n . 

For example, the region R in Fig. 21.16 can be subdivided into the regions R\ 
and 7?2 by introducing the indicated cut, which becomes part of the boundary of 
R 1 when traced from right to left (C 3 ), and part of the boundary of R 2 when traced 
from left to right (C 4 ). By applying Green’s Theorem separately to R\ and R 2 , 
we get 


and 


ci+c 3 


Mdx + Ndy = ff 


dJV 

dx 


dM~ 

dy _ 


dA 



M dx + N dy = 


C2+C4 


*-s 

*2 


dN dM 
dx dy 


If we add these two equations the result is 

„ . „ J r/TdW dM 

Mdx+Ndy = 


Cl+C 2 


dA. 


dA, 


which is Green’s Theorem for the region R. This occurs because the two line inte¬ 
grals along C 3 and C 4 cancel each other, since C 3 and C 4 are the same curve traced 
in opposite directions. Similarly, Green’s Theorem can be extended to the region 
in Fig. 21.17 by subdividing it into the four simpler regions shown in the figure. 


Figure 21.17 
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Example 1 Evaluate the line integral 

/ = j> (3.x — y) dx + (x + 5 y) dy 
around the unit circle x = cos t, y = sin t, 0 < t < 2tt. 

Solution The straightforward calculation of this integral gives 

cIlT 

7=1 [(3 cos t — sin r)(—sin t) + (cos t + 5 sin t )(cos t)] dt 

Jn 


rlTT 

= [2 si: 


sin t cos t + 1 ] dt = 


sin A t + t 


= 2tt. 


This is easy enough, but Green’s Theorem makes it even easier. Since M = 3x - 
y and N = x + 5y, we have 

dM _ ] 

dy 


and -r— = 1, 
dx 


so 


/ = 


= // [1 -(-1)]«M 

R 

= 2 JJ dA = 2(area of circle) = 277. 


Example 2 Evaluate the line integral 

/ = | (2y + Vl + jc 5 ) dx + (5 jc - e y 2 ) dy 
around the circle x 2 + y 2 = 4. 

Solution The actual calculation of this integral looks like a very forbidding task, 
but Green’s Theorem provides another way. since M = 2y + Vl+i 5 and N = 
5x — e y2 , 

dM dN 

= 2 and — = 5. 
dy ox 


Therefore 


/ = || (5 - 2) dA = 3 JJ dA = 3(area of circle) = 3(477) = 1277, 


R R 

since R is a circular disk of radius 2. 


Example 3 If R is any region to which Green’s Theorem is applicable, show that 
the area A of R is given by the formula 

1 


A = — <j> — y dx + x dy. 
2 Jc 


( 9 ) 
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Solution Since M = — y and N = x, and therefore 


dM 

dy 


-1 


dN 

and — = 1, 
ox 


Green’s Theorem yields 


| — y dx + x dy = JJ [ 1 — (— I)] dA = 2 JJ dA = 


2A, 


as stated. 


Example 4 Use formula (9) to find the area bounded by the ellipse x 2 la 2 + 
y 2 /b 2 = 1. 

Solution We can parametrize the ellipse by a* = a cost, y = bsint, where 0 < 
t < 27r. Then formula (9) yields 

1 f 2jr 

A = — [( — b sin t)(—a sin t) + (a cos t)(b cos f)] dt 

2 J o 

1 f 2n 

= — ah dt — TTcib. 

2 J 0 


Our original problem in this section was to determine whether the condition 

dM dN_ 
dy dx 

is sufficient to guarantee that the vector field 

F = M(x, y)i + N(x, y)j ( 1 ) 

is conservative. Green’s Theorem provides the solution. For if C is any simple 
closed path in the domain of F, and if the region enclosed by C is also in the do¬ 
main, then Green’s Theorem tells us that 

By using this equation we see that if dMIdy = dN/dx then the double integral is 
zero, and therefore the line integral is zero. If the line integral is zero around 
every simple closed path, then it is also zero around every closed path, and this 
proves that F is conservative. We emphasize that for this reasoning to work, the 
region enclosed by C must lie entirely in the domain of F. A convenient way to 
guarantee this is to require that the domain of F must be simply connected , which 
means that the inside of every simple closed path in the domain also lies in the 
domain. Roughly speaking, the domain of F is not allowed to have any holes. In 
Fig. 21.18 we show regions with one, two, and three holes, respectively; the points 
inside the inner curves do not belong to the regions R, so these regions are not 
simply connected. Our overall conclusion can be stated as follows: 

If the domain of definition of the vector field F = M(x f y)i -f N(x, y)j is simply con¬ 
nected, then F is conservative if and only i f the condition dMIdy = dN/dx is satisfied. 
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Figure 21.18 Regions not simply 
connected. 


One final question: If a given vector field F is known to be conservative, so 
that F = V/for some function/(;c, y), how do we find/? Such a function is called 
a potential function, or simply a potential , for F. One way is by inspection, but 
this only works in simple cases. A more systematic method is illustrated in the 
following example. As the student will see, it amounts to integrating the equa¬ 
tions 




and condition ( 2 ) guarantees that this can be done. 


Example 5 Find a potential / for the vector field 

F = (y 2 + l)i + 2xyj. 


Solution Here we have M — y 2 + 1 and N = 2xy. It is easy to verify that 
dM/dy = dN/dx, and therefore/exists and our only problem is to find it. We know 
that 


4^- = y 2 + 1 and = 2 ry. (10) 

ox dy 

In computing df/dx, we differentiate with respect to x while holding y constant, 
so by integrating the first of equations (10) with respect tox, we obtain/ = xy 2 + 
x + g(y), where g(y) is a function of y that is yet to be determined. By differ¬ 
entiating with respect to y, we see that df/dy = 2 xy + g'(y), and by comparing 
this with the second of equations (10) we conclude that g'(y) = 0. It follows that 
g(y) is a constant C that can be chosen arbitrarily, and therefore the potential we 
are seeking is f(x, y) = xy 2 + x + C. It is easy to check this result by verifying 
that V/ = F. 


*Recall that for physical reasons the potential energy associated with a force field F is any scalar 
function V (if one exists) such that F = — W. The concepts of potential and potential energy are 
closely related but not identical. 


PROBLEMS 

In Problems 1 -4, evaluate the line integrals directly, and also 
by Green’s Theorem. 

1 §d*y ~ y 2 ) dx + xy 2 dy , where C is the simple closed 
path formed by y = 0, x = 1 , y = x. 


2 § c x dx + xy 2 dy , where C is the simple closed path 
formed by y = x 2 and y = jc. 

3 l/y^i+ \/x dy, where C is the simple closed path 
formed by y = x, y = 4, x = 1. 
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4 §c y 2 dx + x 2 dy , where C is the simple closed path 
formed by y = 0, x = 1, y = 1, x = 0. 

In Problems 5-12, use Green’s Theorem to compute the given 

line integrals. 

5 $c *y dx 4- (x 4- y) dy , where C is the closed path (ob¬ 
viously simple) formed by y = 0, x = 0, y = 1, x = -1. 

6 $c ~ xy/( 1 + x) dx + In (1 + x) dy, where C is the closed 
path formed by y = 0, x + 2y = 4, x = 0. 

7 §c ~ x 2 y/(l + x 2 ) dx + tan -1 x dy , where C is the closed 
path formed by y = 0, x = 1, y = 1, x = 0. 

8 x dx + xy dy, where C is the closed path formed by 
y = 0, x 2 + y 2 = 1 (x, y > 0), x = 0. 

9 $c (e* 3 + y 2 ) dx + (x + Vl 4- y 7 ) dy, where C is the 
closed path formed by y = 0, x = 1, y = x. 

10 j>c ~ y 3 dx + x 3 dy, where C is the closed path formed 
by y = x 3 and y = x. 

11 $c (~y 2 + tan -1 x) dx + In y dy, where C is the closed 
path formed by y = x 2 and x = y 2 . 

12 0* 2 - y) dx + x dy, where C is the circle x 2 + y 2 = 
9. 

In Problems 13-20, use formula (9) to find the area bounded 

by the given curves. 

13 y = 3x and y 2 = 9x. 

14 y = 0,x + y = a(<3>0), x = 0. 

15 The x-axis and one arch of the cycloid x = a(6 - sin 0), 
y = a( 1 - cos 6 ). 

16 y = x 2 and x = y 3 . 

17 x = a cos 3 6 and y = a sin 3 0, 0 < 6 < 2 it (an astroid or 
hypocycloid of four cusps). 

18 y = x 2 and x = y 2 . 

19 The x-axis and one arch of y = sin x. 

20 9y = x, xy = 1, y = x. 

21 The loop of the folium o f Descartes (with Cartesian equa¬ 
tion x 3 + y 3 = 3oxy) is shown in Fig. 17.11. In Problem 
16 of Section 17.1 we asked students to introduce the pa¬ 
rameter t = y/x and obtain the parametric equations 

3 at 3 at 2 

X = TT7' y = TT7- 


29 Let Ci, C 2 , and C 3 be the simple closed curves shown in 
Fig. 21.19, and let R be the region inside C\ and outside 
C 2 and C 3 . Assume that A/(x, y) and N(x, y) are contin¬ 
uous and have continuous partial derivatives in R and 
along all the curves. Show that Green’s Theorem 


Mclx + N 


dy = \\ 


[[ 

'dN _ 

dM~ 

JJ 

R 

dx 

dy 


dA 


remains valid in this case, provided that C is understood 
to be the total boundary of R, consisting of Ci, C 2 , and 
C 3 positively oriented as shown in the figure. (The curves 
C 2 and C 3 are oriented clockwise, but nevertheless the 
orientation is positive because they are traversed in such 
a way that the region R remains on the left.) 



30 Can Green’s Theorem be used to evaluate the line inte¬ 
gral 


f c 72+^ * + dy ' 

(a) where C is the circle x 2 + y 2 = 1 ? 

(b) where C is the triangle with vertices (1, 0), (1, 2), 
(2,2)7 

31 If C| is the circle x 2 + y 2 = 1 and C 2 is an arbitrary sim¬ 
ple closed path containing Cj, as shown in Fig. 21.20, 
use the idea of Problem 29 to show that 


Use formula (9) to find the area of the loop. Hint: The 
part of the loop below the line y = x is traced out as t in¬ 
creases from 0 to 1. 

22 Give the details of the argument establishing formula (8) 
for the case in which R is horizontally simple. 

In Problems 23-28, verify that the given vector field is con¬ 
servative and find a potential for it. 

23 F = ) l3 i + 3 xy 2 ). 

24 F = e y cos xi + e y sin xj. 

25 F = (ye*>' - 2x)i + {xe xy + 2y)j. 

26 F = y cos xyi + x cos xyj. 

27 F = (sin y — y sin x)i + (x cos y + cos x)j. 

28 F = xi + yj. 


y 



Figure 21.20 
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dx + 


Jt 


Jt 2 + y 2 


dy 


and evaluate the integral on the left by calculating the in¬ 
tegral on the right. 


c, * 2 + y 2 


dx + 


x 2 + y : 


dy, 


In this section and the next, we move out of the plane into three-dimensional 
space and give a brief intuitive introduction to the two fundamental integral the¬ 
orems of vector analysis. These theorems are roughly similar to each other, for 
both make assertions of the following kind: 

The integral of a certain function over the boundary of a region is equal to the inte¬ 
gral of a related function over the region itself 


21.4 

SURFACE INTEGRALS 
AND GAUSS’S 
THEOREM 


It is possible to spend considerable time analyzing such purely mathematical is¬ 
sues as what is meant by a region and its boundary, but in this short sketch we 
shall proceed informally and concentrate instead on the physical meaning of what 
we are doing. 

The concept of gradient , as we presented it in Chapter 19, applies only to 
scalar fields, that is, functions whose values are numbers. The gradient of a scalar 
field f(x, y , z) is a vector field that represents the rate of change of f because at 
any point its component in a given direction is the directional derivative of / in 
that direction. Our purpose here is to consider the more complicated problem of 
describing the rate of change of a vector field. There are two fundamental tools 
for measuring the rate of change of a vector field: the divergence and the curl. 

We recall that the gradient of a scalar field f(x, y, z) is defined by 





where the symbol V (“del”) represents the vector differential operator 


V 


A- A- 

dx 1 + dy J 



If F = L i + Mj + /Vk is a given vector field, we can apply V to F in two ways, 
by using the dot and cross products. We interpret the dot product of V and F to 
mean 


v - F= (i ,+ i j+ i k )- (Li+Mj+ ® 

_dL dM dN 
8y + dz ' 

This scalar quantity is called the divergence of F and is often denoted by div F, 
so that 


div F 


V 


dL dM dN 
dx dy dz' 


The cross product of V and F is interpreted to mean* 


( 1 ) 


^Remember formula (II) in Section 18.3. 
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V X F = 


1 J 

A. A 

dx 3y 

L M 


This vector quantity is 


k 

d (dN dAf\. (dL dN\ . (dAi dL\ 

N 

called the curl of F and is often denoted by curl F, so that 


curl F = V X F. (2) 

Example 1 Compute the divergence and curl of the vector field F = 2;c 2 yi + 
3*z 3 j + x> ,2 z 2 k. 


Solution 


and 


By using formulas (1) and (2) we at once obtain 

div F = V • F = (2 x 2 y) + (3 xz 3 ) + -j- (xy 2 z 2 ) 

dx dy dz 

= 4xy + 2xy 2 z 


curl F = V x F = 


i j k 

AAA 

dx dy dz 

2x 2 y 3 xz 3 xy 2 z 2 

(2xyz 2 - 9xz 2 )\ + (~y 2 z 2 )j + (3z 3 - 2x 2 )k. 


There is clearly no difficulty about performing routine calculations of this kind. 
The real questions are. What do they mean and what is their value? Our purpose 
in the rest of this section is to explore the meaning of the divergence, and to do 
this we need the concept of flux. 


THE MEANING OF THE DIVERGENCE 



Figure 21.21 


We shall use an example from hydrodynamics to motivate the ideas. Suppose that 
a stream of fluid (gas or liquid) is flowing through a region of space. At a given 
point (x, y, z), let its density be the scalar function 8 = 8(x, y, z) and its velocity 
the vector function v = \(x, y, z), and consider the vector field F = Sv. Now con¬ 
sider a small flat patch of surface inside the fluid, with area A A and unit normal 
vector n, as shown in Fig. 21.21. If we think of this patch as a piece of screen 
or netting, so that the fluid can move through it without hindrance, we wish to 
find an expression for the amount of fluid that flows through the patch per unit 
time. It is clear from the figure that the fluid passing through the patch in a small 
time interval At forms a small tube of approximate volume (v At) • n A A, and 
the approximate mass of the fluid in this tube is S(v At) • n AA.* The approxi¬ 
mate mass of fluid crossing the area AA per unit time is therefore 8\ • n AA or 
F • n A A. This is called the flux of the vector field F through the area A A. 

We now put forward an alternative definition for the divergence of F and then 
show that this new definition agrees with the one given above in formula (1). The 


*We assume in this discussion that all functions are continuous, so when AA and A t are very small, 
the vector v changes very little in direction or magnitude from one point of A A to another, and the 
density 8 changes very little from one point of the tube to another. 
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purpose of this maneuver is to arrive at a way of thinking about the divergence 
that conveys an intuitive understanding of what it means. 

Consider a point P = (t, y, z) at the center of a small rectangular box with 
edges Ax, Ay, Az, as shown in Fig. 21.22. We compute the total flux of the vec¬ 
tor field F outward through the six faces of this box (i.e., on each face we choose 
n to be the outward unit normal). We then divide this total flux by the volume 
A V = At Ay Az of the box, and form the limit of this flux per unit volume as the 
dimensions of the box approach zero. This is our new definition for the diver¬ 
gence of F at the point P = (t, y, z): 


div F = (flux of F out through the faces). 



Physically, this represents the mass of fluid that emerges from a small element 
of volume containing the point P, per unit time per unit volume. 

To show that this definition agrees with formula (1), we carry out a rough 
calculation of the limit (3), where F = Li + Mj + Nk. On the front face of the 
box in Fig. 21.22 we see that the outward unit normal is i, so the flux out 
through this face is approximately L(t + {At, y, z) Ay Az. Since the outward unit 
normal on the back face is —i, the flux out through this face is approximately 
—L(t — {At, y, z) Ay Az, and therefore the combined flux out through the front 
and back faces is approximately 

[L(x + {Ar, y, z) ~ L(x - {Ar, y, z)\ Ay Az. 

Similarly, the faces in the y-direction and z-direction contribute flux of approxi¬ 
mate amounts 



Figure 21.22 


[M(t, y + {Ay, z) ~ M(t, y - {Ay, z)] At Az 


and 


[N(t, y, z + {Az) - N(x, y, z - {Az)] Av Ay. 


We next divide the sum of these three quantities—the total flux out through all 
the faces of the box—by AV = At Ay Az to obtain 


L(x + {At, y, z) - L(t - {At, y, z) 
At 


M(t, y + {Ay, z) — M(x, y — {Ay, z) 

Ay 


+ 


N( T, y, z + {Az) - N( T, y, z - {Az) 

Az 


Finally, if we take the limit of this expression as At, Ay, Az -> 0, then formula 
(3) yields the earlier definition (1), as stated,* 


div F = 



dM dN 
dy + dz 


This result permits us to consider (3) as the basic definition of the divergence 
and (1) as merely a formula for computing it in rectangular coordinates. 


*Here we use a slightly different way of defining the derivative of a function. See Additional Prob¬ 
lem 9 in Chapter 2. 
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SURFACE INTEGRALS 


Let S be a smooth surface and f(x, y, z) a continuous function defined on S. The 
surface integral of/over S is denoted by 

JJf(x,y,z)dA, (4) 

and is defined as a limit of sums in the following way. We begin by subdividing 
the surface into n small pieces with areas AAi, AA 2 , . . . , A A n . We next choose 
a point (jc/, y h zd on the ith piece, find the value f(x/, y h zd of the function at this 
point, multiply this value by the area A A/ to obtain the product /(*/, y h zd A A,-, 
and form the sum of these products, 

n 

(5) 

i=i 

Finally, we let n tend to infinity in such a way that the largest diameter of the 
pieces approaches zero; that is, we carry out a sequence of subdivisions of the 
surface S into smaller and smaller pieces, each time constructing a sum of the 
form (5). If these sums approach a limiting value, independent of the way the 
subdivisions are formed and the way the points (jc„ y h zd are chosen, then this 
limit is the definition of the surface integral (4): 

[[/(*. y, z) dA = lim £/(*, yh zd AA f . 
s '—Pi 



11 may be encouraging to students to know that only rarely do w e actually eval¬ 
uate a surface integral. It is the concept of these integrals that is important, be¬ 
cause they provide a convenient language for expressing certain basic ideas of 
mathematics and physics. 

To see what a surface integral can represent, we return to our example from 
hydrodynamics. Consider a fluid flowing through a certain region of space, and 
let 8 = y, z) and v = \(x, y, z) be its density and velocity, as before. Sup¬ 
pose that S is a smooth surface lying inside the region, and think of S as a curved 
piece of screen or netting that permits the fluid to pass through it without 
any resistance (Fig. 21.23). As we saw in our previous discussion, the mass of 
fluid crossing a surface element of area dA and unit normal n per unit time, is 
8\ • n dA or F • n dA, where F = 8\. Accordingly, the surface integral 

JJ F • n dA (6) 

gives the rate of flow of the fluid through the entire surface S in terms of mass 
per unit time. This is called the flux of F through S. 

More generally, if F is any vector field whatever, the surface integral ( 6 ) is 
still called the flux of F through the surface S. The physical meaning of this in¬ 
tegral clearly depends on the nature of the physical quantity represented by F. A 
variety of interpretations and applications arise by letting F be a vector field re¬ 
lated to heat flow, or gravitation, or electricity, or magnetism. Hydrodynamics is 
only one of many subjects in which these concepts are useful. 

We restricted ourselves to a smooth surface in the above discussion in order 
to guarantee that the unit normal vector n will be a continuous function of the 
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position of its tail, and this in turn is necessary in order to guarantee that the in¬ 
tegrand F • n in ( 6 ) is a continuous scalar function. A surface S is called piece- 
wise smooth if it consists of a finite number of smooth pieces. The surfaces we 
work with are understood to be piecewise smooth, and the value of an integral 
of the form ( 6 ) over such a surface is defined to be the sum of its values over 
the smooth pieces. 

GAUSS’S THEOREM 

Surface integrals like ( 6 ) take on special importance when they are extended over 
closed surfaces. A surface S is said to be closed if it is the boundary of a bounded 
region of space. As examples we mention the surfaces of a sphere, a cube, a cylin¬ 
der, and a tetrahedron. 

Gauss's Theorem (also called the Divergence Theorem) states that 

The flux of a vector field F out through a closed suiface S equals the integral of the 
divergence of F over the region R bounded by S, 

|| F • n dA = HI div F dV. (7) 

S R 

This is a rather crude statement, without any of the hypotheses or carefully for¬ 
mulated restrictions that characterize most respectable mathematical theorems. 
We shall provide an equally crude “proof”—which, however, has the great merit 
of showing at a glance why the theorem is true. 

First, we use planes parallel to the coordinate planes to subdivide the region 
R into a great many small rectangular boxes of the kind shown in Fig. 21.24 (we 
ignore the incomplete boxes that do not lie wholly inside R). For the box in the 
figure, with volume AK definition (3) tells us that the outward flux of F over the 
faces is given by the approximate formula 

flux of F over faces = (div F) AU (8) 

We now observe that the outward flux of F through the surface S is approxi¬ 
mately equal to the total flux over all the faces of all the boxes, since for two ad¬ 
jacent boxes the outward flux from one through their common face precisely can¬ 
cels the outward flux from the other through the same face, leaving only the flux 
through all the exterior faces. In view of ( 8 ), this tells us that 

JJf • n <A4 = 2(div F) AU 

Finally, by using the fact that the sum on the right is an approximating sum for 
the triple integral of the divergence of F over R, we obtain (7) by taking smaller 
and smaller subdivisions of R. 

Example 2 Make a direct calculation of the flux of the vector field F = ,vi + 
yj + zk out through the surface of the cylinder whose lateral surface is x 2 + y 2 = 
a 2 and whose bottom and top are z = 0 and z = b. Also find this flux by apply¬ 
ing Gauss’s Theorem. 

Solution On the lateral surface L we have n = (x\ 4- y\)!a, so the flux over L is 


Figure 21.24 






776 


LINE AND SURFACE INTEGRALS. GREEN S THEOREM, GAUSS'S THEOREM, AND STOKES' THEOREM 


JJ F * n dA = JJ- — dA — JJ a dA = a{2irab) = 2Tra 2 b. 

l l a L 

On the top T we have n = k, soon7;F*n = z = 6 and the flux is 

JJ F • n dA = JJ b dA = b(ira 2 ) — iTa 2 b. 

r T 

On the bottom B we have n = -k, so on B, F • n = — z - 0 and the flux is 

JJp-ndA = JJo dA = 0 . 

B B 

Accordingly, the flux over the whole surface is 2iTa 2 b + 7ra 2 b + 0 = 2ira 2 b. To 
find this flux by applying Gauss’s Theorem, we have only to notice that 


and therefore 


divF= i (x)+ i (>,)+ i (z) = 


3, 


JJ F • n dA = HI div F dV = JJJ 3 dV = 3(volume) = 37 Ta 2 b. 

SR R 


In Example 2 the integrations were too easy to present much of a technical 
challenge. We next consider a similar problem in which actual calculations are 
needed. 


Example 3 Let S be the surface of the cylinder described in Example 2, and find 
the surface integral of the function x 2 z over S. 



Solution As before, S is piecewise smooth and we must integrate separately 
over the lateral area L, the top T, and the bottom B: 

JJ x 2 z dA = JJ x 2 z dA + JJ x 2 z dA + JJ x 2 z dA. 

s L r B 

The third integral here is clearly zero, because z — 0 on B. For the first integral 
we have (using cylindrical coordinates as shown in Fig. 21.25) 


[\ x 2 z dA = \ [ (a cos 6) 2 z{a dO dz) 
JJ L J o J o 

n Z 77" 

z cos 2 6 dd dz 

) 




o •'o 

b 


z| 4 sin 20 


rb 

dz = TTO 3 Z 


dz 


For the second integral we use dA = r dr dd, x = r cos 6, z 


JJ x 2 z dA = J J (r cos 6) 2 b(r dr 


dd) 


-W 

4 Jn 


cos 2 6 dd 


= J- TTO?b 2 . 
2 

= b, so 


= "7 d + -j- sin 20 

4 [2 4 



o 
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The total surface integral is therefore 

Jj x 2 z dA = ^ 7 m 3 b 2 + jra 4 b + 0 

5 

= ira 3 b(2b + a). 


Finally, we consider a problem in which spherical coordinates are used for cal¬ 
culating a surface integral. 


Example 4 Let S be the surface of the solid (Fig. 21.26) bounded below by the 
xy-plane and above by the upper half of the sphere x 2 + y 2 + z 2 = a 2 . Find the 
flux of the vector field F = zk out through S by direct calculation, and also by 
using Gauss’s Theorem. 


Solution Let B denote the bottom of the solid in the .ry-plane, and T the hemi¬ 
spherical top. On B the unit normal vector n is given by n = -k, and on T we 
have n = (xi + 3 j + zk)/a , so 


-z = 0 


F • n = 


( 1 a cos <ft ) 2 


= a cos 2 </> 


on B , 
on T. 


This shows that the flux through B is zero, and since the element of area on T is 
dA = (a d(fr)(a sin 0 dO) = a 2 sin 0 dcf) d9, the total flux out through S is given 
by 



Figure 21.26 


JJf • n dA = JJ (a cos 2 2 sin </> d(f) dd) 

S T 

J r2ir r ttH 

cos 2 (p sin <p d(f) dd 

0 J o 

[ 2iT r 1 

= a 3 I —— cos 3 
J o |_ 3 

To find this flux by using Gauss’s Theorem, we merely observe that since 
div F = 1 we have 


dd 


rliT 1 

= a 3 ±dd = 

Jn j 


JTO 3 . 


JJf ■ n dA = JJJ div F dV = JJJ dV = volume = 

S R R 


Gauss’s Theorem is a profound theorem of mathematical analysis, with a 
wealth of important applications to many of the physical sciences. The cursory 
sketch of these ideas that we have given here—together with a similar sketch 
of Stokes’ Theorem in the next section—is perhaps as far as an introductory 
calculus course should go in this direction. Students who wish to learn more 
are encouraged to continue and take advanced courses (vector analysis, po¬ 
tential theory, mathematical physics, etc.) in which these themes are fully 
developed. 
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PROBLEMS 

1 Find the divergence of the vector field F if 

(a) F = (y — z)i + (z - x)j + (x ~ y)k; 

(b) F = (2 z 2 - sin e y )\ + xyj - xzk; 

(c) F = xyi + xz 3 j + (2z - yz) k; 

(d) F = e x sin yi + e A cos yj + sin xk; 

(e) F = — i + — j + — k, where r = Vx 2 + y 2 + z 2 . 

r r r 

In Problems 2-6, use Gauss’s Theorem to find the flux of the 
given vector field over the given surface S. 

2 F = xi — y j 4- zk; S is the surface of the cylinder in Ex¬ 
ample 2. 

3 F = xi + yj + zk; S is the surface of the ellipsoid 
x 2 /a 2 + y 2 /b 2 + z 2 /c 2 = 1. 

4 F = xi + yj + zk; S is the surface of the tetrahedron 
formed by the plane xla + y/b + z/c = 1 (a, b,c > 0) and 
the coordinate planes. 

5 F = x 3 i + y 3 j + z 3 k; S is the surface of the sphere x 2 + 
y 2 + z 2 = a 2 . 

6 F = yzi + xzj + xyk; S is the surface of the cone z 2 = 
m\x 2 + y 2 ), 0 ^ z — h. 

7 If R is the position vector xi + yj + zk and r = 

Vx 2 + y 2 + z 2 is its length, find the divergence 

of the central force field F = /(r)(R/r), where/(r) is 
an arbitrary differentiable function. 

8 Find the flux of the vector field defined in Problem 7 
over the sphere x 2 + y 2 + z 2 = a 2 if 

(a) /(r) = r; (b) f(r) = 

9 If n is a positive number and j{r) = \lr n in Problem 7, 
show that the divergence of the force field F is zero if 
n = 2, and only in this case. 

10 Verify Gauss’s Theorem for the vector field F = 2zi + 
(x — y)j + (2xy + z)k and the rectangular box whose 
faces are x = 0, x = 1, y = 0, y = 2, z = 0, z = 3. 

11 Use Gauss’s Theorem to find the flux of F over the sur¬ 
face of the box in Problem 10 if 


(a) F = x 2 i + y 2 j + z 2 k; 

(b) F = xzi + xyj + yzk. 

12 If F = xi + yj + k, find the value of II F • n dA over 

5 

(a) the surface of the cube whose faces are x = 0, x = 
1, y = 0, y = 1, z = 0, z = 1; 

(b) the surface of the sphere x 2 + y 2 + z 2 = 4; 

(c) the part of the plane x + 2y + 3z = 6 that lies in the 
first octant, where n points upward. 

13 If S is a closed surface, show that the flux of the posi¬ 
tion vector R = xi + yj + zkout through S is 3K where 
V is the volume of the region bounded by S. What is the 
flux through S of F = 27xi - 1 lyj + 4zk? 

14 Use Gauss’s Theorem to find the flux of the vector field 
F = x 2 i - 2xy j + xyz 2 k out throu gh the surface of the 
solid bounded by z = 0 and z = Va 2 — x 2 — y 2 . 

15 Let S be the surface of the sphere x 2 + y 2 + z 2 = a 2 . If 
F = r 2 (xi + yj + zk), where r 2 = x 2 + y 2 + z 2 , find the 
flux of F out through S by direct calculation, and also by 
using Gauss’s Theorem. 

16 If S is the surface of the cube whose vertices are (0, 0, 
0), (1,0, 0), (1, 1, 0), (0, 1, 0),(0,0, 1), (1,0, 1), (1, 1, 
1), and (0, 1, 1), verify Gauss’s Theorem for the vector 
field F = xzi + y 2 j + xk. 

17 If S is the surface of the tetrahedron bounded by the co¬ 
ordinate planes and the plane 2x + 2y + z = 6, verify 
Gauss’s Theorem for the vector field F = xi + y 2 j + zk. 

18 If S is the surface of the solid bounded by z = 0 and z = 
V a 2 — x 2 — y 2 , verify Gauss’s Theorem for the vector 
field F = 2xzi + yzj + z 2 k. 

19 Let S be the surface of the solid bounded by the cylin¬ 
der x 2 + y 2 = 4, the plane x + z = 4, and the xy-plane. 
Find the flux out through S of the vector field F = 
(x 2 + e y )\ + (xy — tan z)j + sin x k. 


21.5 

STOKES’ THEOREM 


Stokes’ Theorem is an extension of Green’s Theorem to three dimensions, in¬ 
volving curved surfaces and their boundaries rather than plane regions and their 
boundaries. Sir George Stokes (1819-1903) was an eminent British mathemati¬ 
cal physicist. He introduced the theorem known by his name in an examination 
question for students at Cambridge University in 1854. A fairly full account of 
Stokes’ personality and scientific work can be found in G. E. Hutchinson, The 
Enchanted Voyage (Yale University Press, 1962). We shall state the theorem af¬ 
ter a few preliminaries that will help us understand its meaning. 

Suppose that F = Li + A/j + Nk is a vector field defined in a certain region 
of space. It will be convenient in this section to think of F as the velocity field 
of a flowing fluid. Suppose also that C is a curve that lies in the region and is 
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specified by certain parametric equations. The line integral of F along C, denoted 
by 

I F • dR or j Ldx + M dy + N dz, 

is defined and calculated in just the same way as in two dimensions, and requires 
no further explanation. If C is a closed curve, as shown in Fig. 21.27, the line 
integral is usually written as 



[> F • dR. 


This integral measures the tendency of the fluid to circulate or swirl around C, * ,gure 21 
and is called the circulation of F around C. 

Now suppose that C is a small simple closed curve that lies in a plane with 
unit normal vector n, where the direction of n is related to the direction of C by 
the right-hand thumb rule, and let P be a point inside C (Fig. 21.27). If A A is 
the area of the region enclosed by C, then 


1 

AA 


F • dR 


can be thought of as the circulation of F per unit area around P, and the limit 


lim 


1 


aa->o AA J c 


F • dR 


is called the circulation density of F at P around n. The point of these remarks 
is that this concept is closely related to the curl of the vector field F, which was 
defined in Section 21.4 by 


i 

j 

k 

_a_ 

d_ 

_a_ 


dy 

dz 

L 

M 

N 


( 1 ) 


In fact, it can be shown that the curl of F has the property that at any point 
its component in a given direction n is precisely the circulation density of F 
around n, 


(curl F) • n = lim . . 
v aa_»o AA 


-Lf 

A Air 


F • dR. 


( 2 ) 


The proof of (2) is fairly complicated and will not be given here; it can be found 
in any good book on vector analysis. 

We can visualize the meaning of (2) in a concrete way if we imagine a small 
paddle wheel placed in the flowing fluid at the point P with its axis pointing in v. 

the direction of n (Fig. 21.28). The circulation of the fluid around n will cause 
the paddle wheel to turn, and the speed at which it spins will be proportional to 
the circulation density. The paddle wheel will spin fastest when it points in the \ 
direction in which the circulation density is largest, and (2) tells us that this hap- 
pens when n points in the same direction as curl F. We conclude that at each 
point of space the vector curl F has the direction in which the circulation den¬ 
sity is largest, with magnitude equal to this largest circulation density. The pad- Figure 21 
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Figure 21.29 


die wheel we have described here can therefore be thought of as an imaginary 
instrument for sensing the direction and magnitude of the curl. 

We are now ready for the main theorem of this section. Stokes' Theorem as¬ 
serts the following (see Fig. 21.29): 

If S is a surface in space with boundary curve C, then the circulation of a vector field 

F around C is equal to the integral over S of the normal component of the curl of F, 

FdR = If (curl F) ■ n dA. (3) 

c s 

Just as in the case of Gauss’s Theorem in the preceding section, we choose to 
keep this statement as simple as possible and not complicate it with the hypotheses 
and restrictions that would be needed to convert it into a genuine mathematical 
theorem. For instance, it is necessary to assume that S is a two-sided (orientable) 
surface with the direction of the unit normal vector n related to the direction of 
C by the right-hand thumb rule, as shown in the figure. Also, of course, n must 
be continuous, F must be continuous, L, M , and N must have continuous partial 
derivatives, and so on. 

This rough, intuitive version of Stokes’ Theorem has a rough, intuitive “proof” 
based on equation (2). First, we subdivide the surface S into a large number of 
small patches with areas A A f and boundary curves C/. By applying (2) to the ith 
patch we obtain the approximate equation 


Q 


F dR = (curl F) • n A/f. 



Figure 21.30 


If we add the left sides of these equations for all curves C [y then the line inte¬ 
grals over all interior common boundaries cancel, being calculated once in each 
direction, leaving only the line integral around the exterior boundary C (see Fig. 
21.30). This gives 


I F • dR = X (curl F) • n AA, (4) 

and by taking smaller and smaller subdivisions we obtain (3), since the sums on 
the right side of (4) are approximating sums for the surface integral on the right 
side of (3). 


Example 1 If the surface S is a region R lying flat in the jcy-plane, then n = k 
and by (1) we see that 


(curl F) • n 


dx 


dL 

Sy’ 


so (3) reduces to 


f c Ldx + Mdy = jjjff- 



dA. 


This is Green’s Theorem (Section 21.3), which is thus a special case of Stokes’ 
Theorem. 
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Example 2 Evaluate the line integral 

/ = | y 3 z 2 dx + 3 xy 2 z 2 dy + 2xy 3 z dz 

around the closed curve C whose vector equation is R = a sin ti + b cos t] + 
c cos fk, 0 < t < 27r, where abc i= 0. 


Solution This integral is the circulation around C of the vector field F = y 3 z 2 i + 
3xy 2 z 2 j + 2.ry 3 zk. An easy calculation shows that curl F = V X F = 0. If S is 
any surface whose boundary is C, then the right side of (3) has the value 0 in 
this case, and therefore Stokes’ Theorem tells us that / = 0. 

Also, for this particular F and C it is not too difficult to verify Stokes’ Theo¬ 
rem by calculating I directly. This gives 

r2ir 

I = [(b 3 cos 3 t)(c 2 cos 2 t)(a cos t) 

J o 

+ 3 (a sin t)(b 2 cos 2 t)(c 2 cos 2 t)( — b sin t) 

+ 2 (a sin t)(b 3 cos 3 r)(c cos t)( — c sin t)\ dt 


rhr 

= ab 3 c 2 [cos 6 t — 5 cos 4 t sin 2 t] dt = ab 3 c 2 sin t cos 5 

J o 


27T 
0 


= 0 . 


In Section 21.2 we proved that three properties of vector fields in the plane 
are equivalent to one another. Stokes’ Theorem makes it possible to extend these 
ideas in a natural way to three-dimensional space. Specifically, if F is a vector 
field defined in a simply connected region of space, then any one of the follow¬ 
ing four properties implies the remaining three:* 

(a) §c F • = 0 for every simple closed curve C. 

(b) Jc F • JR is independent of the path. 

(c) F is a gradient field, i.e., F = V/ for some scalar field/ 

(d) curl F = 0. 


The equivalence of (a), (b), and (c) is established in just the same way as in two 
dimensions; the fact that (c) implies (d) is a straightforward calculation; and 
Stokes’ Theorem enables us to show very easily that (d) implies (a). A vector 
field with any one of these properties is said to be conservative or irrotational 
[because of (d)]. 

For students who desire a fuller explanation of the reasons underlying the 
equivalence of these four properties, we offer the following details of the argu¬ 
ments. 

To understand the equivalence of (a) and (b) we examine Fig. 21.31, in which 
Ci and C 2 are two paths from A to B and C is the simple closed curve formed 
by tracing out Ci and — C 2 in this order, where — C 2 means C 2 traced in the op¬ 
posite direction. For these paths, property (b) tells us that 

f F • dR = f F • dR, 

J c\ J c 2 



*A region in three-dimensional space is said to be simply connected if every simple closed curve in 
the region can be shrunk continuously to a point without leaving the region. 
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which is equivalent to 


F • dR - F • dR = 0, 


r C] J c 2 


or 


f F • dR + [ F • dR = 0, 

J ci J -c 2 


or 


F • dR = 0. 


'c 



This shows that (b) implies (a), and the reasoning is clearly reversible. 

To understand why (c) implies (b), we use the above notation and write 





df=f(B) - /(A). 


Our conclusion now follows from the fact that the expression last written de¬ 
pends only on the points A and B, and not at all on the path of integration. (A 
slightly more detailed treatment of this reasoning for the two-dimensional case 
is given in Section 21.2.) 

To show that (b) implies (c), we must use independence of path to construct 
a potential function f(x, y, z). This is easy to do by choosing a fixed point (jcq, 
yo, Zo) aR d integrating F along any path from (jco, yo> Zo) to a variable point (x, y, 
z), as suggested in Fig. 21.32. Since the value of the integral is independent of 
the choice of path, this integral is a function only of the point (x, y, z ), and de¬ 
fines our potential function: 



The next step is to show that V/ = F by using the calculations given for the two- 
dimensional case in Section 21.2, but we do not repeat these details. 

To prove that (c) implies (d) by the “straightforward calculation” mentioned 
above, we have only to write 



x 


Figure 21.32 


i j k 

AAA 

dx dy dz 

AAA. 

8y dz 



curl F = curl V/ = 
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because this expression vanishes by the equality of the mixed partial derivatives. 

To establish the final implication, that (d) implies (a), we consider a simple 
closed curve C, as shown in Fig. 21.33. Since our region is simply connected, C 
can be shrunk continuously to a point without leaving the region. In this shrink¬ 
ing process, C sweeps out a surface S, and by Stokes’ Theorem we have 

j> F ■ dR = jj (curl F) • n dA. 

The integral on the right equals 0 because of our assumption that curl F = 0, and 
this tells us that 


[> F-dR = 0, 



Figure 21.33 


which completes the argument. 

One final remark: The relations among properties (a) through (d) will not be 
truly understood until we reach the stage at which the implications described 
above can be grasped as an organic whole and recalled in a few seconds of 
thought. 

We have seen that Gauss’s Theorem relates an integral over a closed surface 
to a corresponding volume integral over the region of space enclosed by the sur¬ 
face, and Stokes’ Theorem relates an integral around a closed curve to a corre¬ 
sponding surface integral over any surface bounded by the curve. As we sug¬ 
gested at the beginning of Section 21.4, these statements are very similar and are 
presumably somehow connected with each other. It turns out that both are spe¬ 
cial cases of a powerful theorem of modern analysis called the generalized Stokes 
Theorem. Students who wish to understand these relationships must study the 
theory of differential forms. 


PROBLEMS 


1 If S is a closed surface that lies in a region of space in 
which a vector field F is defined, show that 

jj (curl F) • n dA = 0. 

5 

2 If the vector field F = —yi + *j + Ok is the velocity field 
of a flowing fluid, sketch enough of this field in the 
*y-plane (that is, sketch enough of the velocity vectors 
attached to various points) to understand the nature of 
the motion. Then calculate curl F, let C be the circle R = 
r cos fli + r sin 6] + Ok (0 < 6 < 2tt) in the *y-plane, 
and verify the formula 

(curl F) • n = - 7 - F • dR 
A J C 

for this circle and its interior in the *y-plane. 

3 Repeat Problem 2 for the vector fields 

(a) F = ax j, where a is a positive constant; 

(b) F = fir) R, where r is the length of the position vec¬ 


tor R = *i + yj + zk and fir ) is an arbitrary differ¬ 
entiable function. 

In Problems 4-10, apply Stokes’ Theorem to find fcF • dR 
for the given F and the given C. In each case let C be ori¬ 
ented counterclockwise as seen from above. 

4 F = y(x - z)i + (2x 2 + z 2 )j + y 3 cos xz k; C is the 
boundary of the square 0 < * < 2, 0 < y < 2, z = 5. 

5 F = (z - y)i + yj + xk; C is the intersection of the top 
half of the sphere* 2 + y 2 + z 2 = 4 with the cylinder r = 
2 cos 6. 

6 F = yi + (x + y)j + (x + y + z)k; C is the ellipse in 
which the plane z = x intersects the cylinder x 2 + y 2 = 1. 

7 F = (y - *)i + (* - z)j + (* - y)k; C is the boundary 
of the triangular part of the plane * + y + 2z = 2 that 
lies in the first octant. 

8 F = (3y + z)i + (sin y - 3*)j + ( e z + *)k; C is the cir¬ 
cle x 2 + y 2 = 1, z = 5. 

9 F = 2zi + 6*j — 3yk; C is the ellipse in which the plane 
z = y + 1 intersects the cylinder x 2 + y 2 = 1. 
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10 F = e x2 \ + (x + z) sin y 3 j + ( y 2 — x 2 + 2yz)k; C is the 
boundary of the triangular part of the plane x + y + z = 
3 that lies in the first octant. 

In Problems 11-14, verify Stokes’ Theorem for the given F, 

S, and C. 

11 F = (z ~ y)i + (x + z)j - (x + y)k, S is the part of the 
paraboloid z = 9 — x 2 - y 2 that lies above the xy-plane, 
and C is its boundary circle x 2 + y 2 = 9 in the xy-plane, 
oriented counterclockwise as seen from above. 

12 F = xy\ + y^j + zx k, S is the part of the plane x + y + 
z = 1 that lies in the first octant, and C is its boundary 
oriented counterclockwise as seen from above. 

13 F = y\ - xj, S is the top half of the sphere x 2 + y 2 + 
z 2 = 4, and C is its boundary circle x 2 + y 2 = 4 in the 
xy-plane, oriented counterclockwise as seen from above. 


14 F = (x + y)i + (y + z)j + (z + x)k, S is the elliptical 
disk x 2 /a 2 + y 2 /b 2 ^ 1, z = 0, and C is its boundary ori¬ 
ented counterclockwise as seen from above. 

15 Let S be the top half of the ellipsoid x 2 + y 2 + z 2 /9 = I, 
oriented so that n is directed upward. If F = x 3 \ + y 4 j + 
z 3 sin xy k, evaluate 

JJ (curl F) • n dA 

s 

by replacing S by a simpler surface with the same bound¬ 
ary. 

16 Repeat Problem 15 if F = xz 2 i + x 3 j + cos xzk and S is 
the top half of the ellipsoid x 2 + y 2 + 4z 2 = 1 with n di¬ 
rected upward. 


21.6 

MAXWELL’S 
EQUATIONS. A FINAL 
THOUGHT 


To gain a slight glimpse of the significance of the ideas of this chapter, we look 
very briefly at the famous equations formulated in the 1860s by James Clerk 
Maxwell (1831-1879). These equations are remarkable because they contain a 
complete theory of everything that was then known or would later become known 
about electricity and magnetism. Maxwell was the greatest theoretical physicist 
of the nineteenth century, and an excellent account of his life and work is given 
by James R. Newman in Science and Sensibility, vol. 1, pp. 139-193 (Simon and 
Schuster, 1961). 

In Maxwell’s theory there are two vector fields defined at every point in space: 
an electric field E and a magnetic field B. The electric field is produced by charged 
particles (electrons, protons, etc.) that may be moving or stationary, and the mag¬ 
netic field by moving charged particles. 

All known phenomena involving electromagnetism can be explained and un¬ 
derstood by means of Maxwell's equations : 


1 V • E = —. 

e* 


2 


V 


X E = - 


9B 

dt ' 


3 V • B = 0. 

4 c 2 VxB = J-+^, 

e 0 it 


Here q is the charge density, 6q is a constant, c is the velocity of light, and j is 
the current density (not to be confused with the unit vector in the direction of 
the y-axis). We make no attempt to discuss the meaning of these four equations, 
but we do point out that the first two make statements about the divergence and 
curl of E, and the second two about the divergence and curl of B. Equivalent ver¬ 
bal statements of Maxwell’s equations are given by Richard Feynman (Nobel 
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Prize, 1965) on p. 18-2 in vol. 2 of his Lectures on Physics (Addison-Wesley, 
1964): 


,, _i r . . . , x , charge inside 

1 Flux of E through a closed surface =-. 

2' Line integral of E around a loop = —(flux of B through the loop). 

at 

y Flux of B through a closed surface = 0. 

„, n „ , current through the loop 

4 c~ (integral of B around a loop) = - 


+ — (flux of E through the loop). 
at 


By a “loop,” Feynman means what we have called a simple closed curve. The 
fact that these verbal statements are indeed equivalent to Maxwell’s equations 1 
to 4 depends on Gauss’s Theorem and Stokes’ Theorem. This is perhaps easier 
to grasp when these verbal statements are expressed in terms of line and surface 
integrals: 


1 " 


is 


E • n dA — — 
6o 


(.S is a closed surface). 


2 " 


E • dR 


d_ 

dt 


n 


B • n dA 


(C is a simple closed curve and S is a sur¬ 
face bounded by C). 


3" JJb • n dA = 0 

5 


(S is a closed surface). 


4" 


0 

c- 


f B • t/R = — f[j-ndA + 

C 60 JJ 


ill E -" M 


(C is a simple closed 
curve and S is a surface 
bounded by C). 


Our only purpose in mentioning these matters is to try to make it perfectly 
clear to the student that the mathematics we have been doing in this chapter has 
profoundly important applications in physical science. Feynman devotes the first 
21 chapters in vol. 2 of his Lectures to the meaning and implications of Maxwell’s 
equations. At one point he memorably remarks: 


From a long view of the history of mankind—seen from, say, ten thousand years from 
now—there can be little doubt that the most significant event of the 19th century will 
be judged as Maxwell’s discovery of the laws of electrodynamics. The American Civil 
War will pale into provincial insignificance in comparison with this important scien¬ 
tific event of the same decade. 


In making this provocative comment, perhaps Feynman was carried away by his 
ebullient enthusiasm—but perhaps not. 
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CHAPTER 21 REVIEW: CONCEPTS, THEOREMS 


Think through the following. 

1 Line integral. 

2 Fundamental Theorem of Calculus for line integrals. 

3 Conservative field in the plane: three equivalent proper¬ 
ties. 

4 Green’s Theorem. 

5 Divergence of a vector field: definition, formula. 


6 Surface integral. 

7 Gauss’s Theorem (or the Divergence Theorem). 

8 Curl of a vector field: definition, meaning. 

9 Stokes’ Theorem. 

10 Conservative (or irrotational) field in space: four equiv 
alent properties. 


APPENDIX 

A 


THE THEORY 
OF CALCULUS 


I mean the word proof not in the sense of the lawyers, who set two half proofs equal 
to a whole one, but in the sense of the mathematician, where \ proof = 0 and it is de¬ 
manded for proof that every doubt becomes impossible. 

Carl Friedrich Gauss 

Certitude is not the test of certainty. We have been cocksure of many things that were 
not so. 

Oliver Wendell Holmes 


When considered for its own sake, and apart from any uses it may have, the real number 
system appears as an intricate intellectual structure whose endless complexities are of in¬ 
terest mainly to mathematicians. However, from the practical point of view, it is the foun¬ 
dation on which virtually all other branches of mathematics rest, and as such, it underlies 
every quantitative aspect of civilized life. 

Most of us learn in school how to use real numbers for counting, measurement, and 
solving algebraic problems. Nevertheless, no matter how much skill of this kind we de¬ 
velop, few of us ever confront the question of just what the real numbers are. Our pur¬ 
pose here is to answer this question as briefly and clearly as possible. In doing so, we 
will also provide an adequate basis for the capsule discussions of the theory of calculus 
that are given in the following sections. 

There are several ways to introduce the real number system. We adopt the most effi¬ 
cient of these—the axiomatic approach—in which we start with the real numbers them¬ 
selves as given undefined objects possessing certain simple properties that we use as ax¬ 
ioms. This means we assume there exists a set R of objects, called real numbers , that 
satisfy the 10 axioms listed in the following pages. All further properties of real numbers, 
regardless of how profound they may be, are ultimately provable as logical consequences 
of these. The axioms fall into three natural groups. Those of the first group are stated in 
terms of the two operations + and • , addition and multiplication, which can be applied 
to any pair x and y of real numbers to produce their sum x + y and their product x • y 
(also denoted more simply by xy). 


A.l 

THE REAL NUMBER 
SYSTEM 


Algebra Axioms 

1 Commutative laws: x + y = y + x, xy = yx. 

2 Associative laws: x + (y + z) = (x + y) + z, *(yz) = (xy)z. 
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4 Existence of identity elements: There exist two distinct real numbers, denoted by 0 
and 1 , such that 0 + * = ;c + 0 = ;c and 1 • x = x ■ 1 = x for every x. 

5 Existence of negatives: For each x there exists a unique y such that x + y = y + 
x = 0. 

6 Existence of reciprocals: For each x =£ 0 there exists a unique z such that xz = 
zx = 1 . 


The number y in axiom 5 is customarily denoted by -x, and z in axiom 6 by \fx or 
x~K Subtraction and division can now be defined by x - y = x + (— y) and x/y = x(\ly). 
All the usual laws of elementary algebra can be deduced from these axioms and these de¬ 
finitions. We illustrate this process by giving three very brief proofs. 

Example 1 (i) x + y = x + z implies y = z (the cancellation law of addition). Proof : 

Since * + y = x + z, (-*) + (x + y) = (~x) + (x + z); by axiom 2, [(-*) + x] 4- y = 
[(-*) + x] + z; by axiom 5, 0 + y = 0 + z; and by axiom 4, y = z. 

(ii) * • 0 = 0. Proof : Axiom 4 gives 0 + 1 = 1, so *(0 + 1) = x • 1; by axiom 3, x • 
0 + x • 1 = x • 1; by axiom 4, x*0 + ;c = x = 0 + x;by axiom 1, x + x • 0 — x + 0; and 
by (i), *-0 = 0. 

(iii) (—1)(— 1) = 1. Proof Axiom 5 gives 1 + ( -1) = 0, so on multiplying by - 1 and 
using axioms 3, 4, and (ii) we obtain (-1) + (—1)(—1) = 0; and adding 1 to both sides 
of this yields (—!)(-!)= 1 after careful reduction. 


The next group of axioms enables us to establish an order relation in the real number 
system. It is convenient to introduce this relation indirectly, by basing it on a concept of 
positiveness. This means we assume there exists in R a special subset P, called the set of 
positive numbers, that satisfies the three axioms listed below. The statement that a num¬ 
ber x is in the set P is symbolized by writing 0 < ;c, or equivalently x > 0. 

Order Axioms 

7 For each x, one and only one of the following possibilities is true: x = 0, x > 0, 
-x > 0 . 

8 If x and y are positive, so is x + y. 

9 If x and y are positive, so is xy. 

We now introduce the familiar order relations < and > as follows: x <y is defined to 
mean y - x > 0, and x > y is equivalent to y < x. As usual, x < y means that x < y or 
x = y, and x > y is equivalent to y < x. All the customary rules for working with in¬ 
equalities can be proved as theorems on the basis of these axioms and definitions. 

Example 2 It is quite easy to show that for any real numbers x and y one and only one 
of these possibilities is true: x = y, x <y y x > y (proof: apply axiom 7 to the number y - 
x). We next consider the proofs of the following familiar facts: 

If x < y and y < z, then x < z. 

If x > 0 and y < z, then xy < xz. 

If x < 0 and y < z, then xy > xz. 

If x < y then x + z < y + z for any z. 

The definitions allow us to express these statements in equivalent forms that are more 
convenient from the point of view of providing proofs: 
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If y — x > 0 and z ~ y > 0, then z — x> 0. 

If jc > 0 and z ~ y > 0, then xz - xy > 0. 

If — x > 0 and z ~ y > 0, then xy — xz > 0. 

If y — x > 0, then (y + z) — (x + z) > 0 for any z. 

The first of these assertions is an obvious consequence of axiom 8, the second and third 
follow directly from axiom 9, and the fourth is trivial, since (y + z) — (x + z) = y ~ x. 


The program of carefully deducing all the algebraic and order properties of R from ax¬ 
ioms 1 to 9 is rather long and boring, and no useful purpose would be served by pursu¬ 
ing this aspect of the matter any further. It is quite enough for students to understand that 
this program can be carried out, and we omit the details. 

The nine axioms given above do not fully determine the real number system. This is 
most easily seen by noticing that the set Q of all rational numbers is a number system 
different from R that also satisfies all nine axioms. Of course, the difference between Q 
and R is simply that Q lacks the irrationals, which any workable number system ought to 
have. One more axiom is needed to guarantee that R is free from this defect, or equiva¬ 
lently that the real number system has no “gaps” or “holes.” 

Two preliminary definitions are necessary before our final axiom can be stated. Both 
refer to an arbitrary set S of real numbers. A real number b is called an upper bound 
for S if x < b for every x in S. Further, a real number b 0 is called a least upper bound 
for S if (i) b 0 is an upper bound for S, and (ii) b 0 < b for every upper bound b of S. A 
set has many upper bounds if it has one, but it can have only one least upper bound. 
The proof is easy: If b 0 and b\ are both least upper bounds for S, then b 0 < b\ (since bo 
is a least upper bound and b\ is an upper bound) and b\ < bo (since b\ is a least upper 
bound and bo is an upper bound), so bo = b \. This argument permits us to speak of 
the least upper bound of S. These concepts can be visualized in the usual way, as sug¬ 
gested by Fig. A.l. 

Example 3 The set of all positive integers has no upper bound. If S is the closed inter¬ 
val 0 < x < 1, then the numbers 1, 2, 3.74, and 513 (among others) are all upper bounds 
of S, and 1 is its least upper bound. The same statements are true if S is the open inter¬ 
val 0 < x < 1. In the first case, the least upper bound 1 belongs to the set S, but in the 
second case it does not. The set S consisting of all numbers in the sequence 

J_ 2_ 3_ n 

r r 4’ *’*’ n + r *“ 

also has 1 as its least upper bound. 


The following is the final axiom for the real number system R. 

Least Upper Bound Axiom 

10 Every nonempty set of real numbers that has an upper bound also has a least upper 
bound. 


Least upper 

bound b 0 u PP er bounds b 


l 



5 


Figure A.l 
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This axiom guarantees that the real number system has the property of “completeness” 
or “continuity” that is absolutely essential for the development of calculus. The best way 
to grasp the significance of this axiom is to observe that it is not true for the set Q of ra¬ 
tional numbers: If S is taken to be the set of positive rationals r such that r 2 < 2, then S 
has an upper bound in Q but does not have a least upper bound in Q (the least upper bound 
of S in R is V2, but this number is not in Q). 

Remark 1 We have implied, but not actually stated, that the 10 axioms given here com¬ 
pletely characterize the real number system R. The meaning of this statement can be clar¬ 
ified by formulating our ideas on a more abstract level, as follows. In modern algebra a 
set of objects that satisfies axioms 1 to 6 is called a field. There are many different fields, 
some finite and others infinite. The simplest consists of the two elements 0 and I alone, 
with addition and multiplication defined by 

0 + 0 = 0, 0 + 1 = 1 + 0 = 1, 1 + 1=0, 

0-0 = 0 , 0 - 1 = I - 0 = 0 , 1-1 = 1 . 

A field that satisfies the additional axioms 7 to 9 is called an ordered field. Both Q and 
R are ordered fields, but there are also a number of others. It can be proved that an or¬ 
dered field must have infinitely many distinct elements, so some fields—including the 
two-element field just mentioned—cannot be ordered. We use axiom 10 to narrow our 
scope still further, and an ordered field that satisfies this axiom is called a complete or¬ 
dered field. It can be proved that any two complete ordered fields are abstractly identical 
in a very precise sense, so there is really only one, namely, Rf It is therefore possible to 
define a real number very simply, as an element in a complete ordered field. However, it 
is clear that no such definition can be considered satisfactory without a good deal of pre¬ 
liminary explanation and proof. 

Remark 2 There may be a few exceedingly skeptical readers who find themselves think¬ 
ing thoughts like these: “What this writer says sounds reasonable enough, provided the 
red number system R exists in the first place. But how do we know that it does? After 
all, this number system is not a physical object that can be seen and touched, but a cre¬ 
ation of the mind—like a unicorn—and perhaps we deceive ourselves by supposing that 
it exists.” 

There are two ways to answer this objection. One is to give a concrete definition of R 
as the set of all infinite decimals, with the usual agreement that such decimals as 
0.25000 . . . and 0.24999 ... are to be considered equal. Addition, multiplication, and the 
set of positive numbers must now be given satisfactory definitions, and in this scheme of 
things our axioms 1 to 10 become theorems whose proofs lean heavily on these defini¬ 
tions. This program is surprisingly difficult to carry out. f 

A second approach is to use the much more basic positive integers as a given supply 
of building materials for the explicit step-by-step construction of the real number system 
—first the integers, then the rationals, and finally the reals. This time the axioms I to 10 
appear as theorems that can be deduced from assumed properties of the positive integers.* 

We do not encourage students to investigate these matters any further, for there is no 
part of mathematics more tedious and unrewarding than the detailed construction of the 
real number system by either of these methods. 


*For a fuller discussion, with proofs (or sketches of proofs), see pp. 1-8 in the first volume of 
E. Hille’s Analytic Function Theory (Ginn and Co., 1959). 

"^See Chapter 1 of J. F. Ritt’s Theory of Functions (King’s Crown Press, 1947). 

*The classical source for this construction is E. Landau, Foundations of Analysis (Chelsea, 1951). 
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We begin by recalling the definition of the limit of a function from Section 2.5. Consider 
a function /(x) that is defined for values of x arbitrarily close to a point a on the x-axis 
but not necessarily at a itself. Another way to express this requirement is to say that there 
are x’s in the domain of the function that satisfy the inequalities 0 < \x — a\ < 8 for every 
positive number 8. Under these circumstances, the statement that 


A. 2 

THEOREMS ABOUT 
LIMITS 


lim f(x) = L 

A'—># 


is defined to mean the following: For each positive number e there exists a positive num¬ 
ber 8 with the property that 

I/M ~ L\< e 

for every x in the domain of the function that satisfies the inequalities 

0 < \x - a\ < 8. 

In the hope of clarifying the meaning of this definition, we examine the way it is used 
in a simple special case. It is obvious by inspection that 

lim (3x — 1) = 2 . ( 1 ) 


However, to prove this by using the definition, we must start with an arbitrary positive 
number e and find a 8 > 0 that “works” for this e, in the sense that 

0 < \x - 1 1 < 8 implies |(3x - 1 ) - 2 | < e. (2) 

But the last inequality here is the same as |3x - 3| < e or — e < 3x — 3 < e; and after di¬ 
vision by 3 this becomes — ye < x — 1 < ye. This suggests that 8 = ye might work. To 
show that it does, we observe that ifO<|x-l|<ye then -ye < x - 1 < ye, which im¬ 
plies — e < 3x — 3 < e or |(3x — 1) — 2| < e. Thus, for any e > 0 the number 8 = ye ac¬ 
tually has the property stated in (2). The requirement of the definition is therefore satis¬ 
fied, and ( 1 ) is proved. 

It is natural to object to this procedure and to feel that carefully proving a transparent 

statement like (1) is empty mumbo-jumbo and a waste of time. However, the point is this: 

(I) is obviously true and doesn’t really need a proof, but many important limits are far 
from obvious and cannot be dealt with by simple inspection. For instance, it is no exag¬ 
geration to say that large parts of advanced mathematics would disappear like a puff of 
smoke without the ideas and methods that depend on the vital limits 

lim - Sm X = 1 and lim (1 + x) l/x = e. 

A ->0 x 

(The fundamental constant denoted by e is officially introduced in Chapter 8 ; its approx¬ 
imate value is 2.71828.) We clearly need powerful tools to cope with limits like these, not 
vague ideas and fuzzy concepts. We have proved (1) not for its own sake, but in order to 
illustrate the use of the definition of the limit of a function. This definition is not intended 
to be merely a passive description in the sense of most dictionary definitions. On the con¬ 
trary, it is a sharp-edged instrument of proof that is capable of being manipulated effec¬ 
tively in complex and subtle arguments where sloppy thinking brings nothing but confu¬ 
sion. We have two purposes in the theorems and proofs given below: first, to establish the 
results themselves, and thereby provide a solid logical foundation for all of our work that 
depends on limits of functions; and second, to further illustrate the use of the definition 
in the machinery of formal proofs. 

Our first theorem states a fact that most people take for granted without fully realizing 
it, namely, that a function /(x) cannot approach two different limits as x approaches a. 
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Theorem 1 If lim*-** fix) = L\ and Y\m x ^ a f(x) = L 2 , then L\ = L 2 . 


Proof Our method of proof is to show that the assumption L\ =£ L 2 leads to the absurd 
conclusion \L\ - L 2 1 < \L\ ~ L 2 [ We therefore assume that L\ =£ L 2 , so that \L\ — L 2 \ is 
positive, and we let e be the positive number \\L\ — L 2 |. By the first hypothesis there ex¬ 
ists a number 8\ > 0 such that 

0 < |jc - a\ < implies \f(x) — L { \ < e, 

and by the second hypothesis there exists a number 82 > 0 such that 

0 < \x ~ a\ < 82 implies | f(x) - L 2 \ < e. 

Define 8 to be the smaller of the numbers 8 \ and & 2 - Then 0 < |jc — a\ < 8 implies both 

\f(x)-L x \<e and \f(x) - L 2 \ < e, 

and therefore 

\L l -L 2 \ = \[L l -f(x)] + [f(x)~L 2 ]\ 

£ |L, -fix )| + | fix) - L 2 | 

< € + e = 2e = \L\ — L 2 \. 

This contradiction—that the number \L\ — L 2 \ is less than itself—shows that it cannot 
be true that \L\ — L 2 \ is positive, so L\ = L 2 . 


Theorem 2 If fix) = jc, then linv^/00 = a\ that is, 

lim x — a. 

x—ta 


Proof Let e > 0 be given, and choose 8 = e. Then 0 < |x — a| < 5 = e implies that 
1/00 - a\ < e, since fix) = x. 


Theorem 3 If fix) = c, where c is a constant, then lim X ^ a f(x) = c; that is, 

lim c = c. 

x-^a 


Proof Since \f(x) - c\ = \c — c\ = Ofor all jc, any 8 > 0 will do, because |/00 - c\ will 
be < e for any given e > 0 and all x 


Theorem 4 If \im x -> a f (*) = L and lim*^ g(;c) = M, then 

(/) lim*-^ [/00 + g(x)] = L + M\ 

(“) Hm*^ [f(x) - g(x)] = L - M\ and 
(Hi) lim fix)g(x) = LM. 


Proof For (i), let e > 0 be given, let 5j > 0 be a number such that 
0 < \x — a\ < implies | fix) — L\ < \e, 

and let 8 2 > 0 be a number such that 
0 < \x — a\ < 82 


implies 


\g(x) -M\< 
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Define 8 to be the smaller of the numbers <5j and 82. Then 0 < \x - a\ < 8 implies 
|[/M + *<*)] ~(L + M)\= \[f(x) - L ] + [g(x) - M]\ 
s |/(jc) - L\ + |gU) - M\ 


< fe + je = e. 


and this proves (i). 

The argument for (ii) is almost identical with that just given and will be omitted. 

In proving (iii), we wish to make the difference f(x)g(x) - LM depend on the differ¬ 
ences f{x) - L and g(x) - M. This can be accomplished by subtracting and adding f(x)M, 
as follows: 


\f(x)g(x) - LM\ = |[/Mg(.c) ~ f(x)M] + U(x)M - LM]\ 


=£ I f(x)g(x) ~f(x)M | + \f(x)M - LM\ 

= l/wllsw - m\ + Mm - L\ 

^ l/W||gto -M\ + (\M\ + 1 )|/W - L\. 


Let 6 > 0 be given. We know that there exist positive numbers 5], 8 2 , 83 such that 
0 < I* - a\ < < 5 j implies | f(x) — L\ < 1, which in turn implies \f(x)\ < \L\ + 1 ; 



0 <\x - a\< 82 implies 



0 < |x - a\ < 83 implies 


Define 8 to be the smallest of the numbers 8 2 , 82, 83. Then 0 < \x — a\ < 8 implies 
|/to*W - LM\ < {e + ye = e, 
and the proof of (iii) is complete. 


Theorem 5 If Y\m x ^ a f(x) = L and lirn r _, a g(x) = M where M =£ 0, then 


x ^>a g(x) M' 



Proof By Theorem 4 [part (iii)] and the fact that 



it suffices to prove that 


x ^>a g(^) M 

We begin with the fact that if g(;c) =£ 0, then 


_J_1_ |gC*) - M\ 

g(x) M |M||gU)| ' 


Choose <5j > 0 so that 

0 < |x — a\ < 8\ implies |g(x) - M\ < ^\M\. 


For these ;t’s we have 
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\gW\ > \ \M\ 

and therefore 

_|_ 1 _ 

gW M 

Let e > 0 be given and choose <$2 > 0 so that 


1 <. 2 


I*M| N’ 




0 < \x ~ a\ < 82 implies |g(;t) — M\ < 


|A/| : 


We now define 8 to be the smaller of the numbers 8 \ and 82 and observe that 


0 < \x — a\ < 8 implies 
and this concludes the argument. 


1 


1 


g(x) M 


<^.M 6 = e 

M 2 2 


Theorem 6 If there exists a positive number p with the property that 

g(x) </« ^ h(x) 

for all x that satisfy the inequalities 0 < \x — a\ < p, and if \im x ^ a g(x) = L and lim x ^ a 
h{x) = L, then 

lim f(x) = L. 

x-*a 


Proof This statement is sometimes called the “squeeze theorem,” because it says that a 
function squeezed between two functions approaching the same limit L must also approach 
L (see Fig. A.2). For the proof, let e > 0 be given, and choose positive numbers 8 \ and 
82 so that 


and 


0 < \x ~ a\ < 81 


implies L — e < g(jt) < L + e 


0 < \x - a\ < 82 implies L - e< h(x) < L + e. 

Define 8 to be the smallest of the numbers p, 8 1 , §2 . Then 0 < \x — a\ < 8 implies 
L ~ e < g(x) < f(x) < h(x) < L + e, 
so I f(x) — L\ < € and the proof is complete. 


y 



Figure A.2 
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We continue with the proofs of a few simple facts about continuous functions that fol¬ 
low almost immediately from these theorems about limits. First, however, let us recall that 
a function/(x) is said to be continuous at a point a if 

I'm fix) = f(a). 

X — 

It is sometimes convenient to use the epsilon-delta version of this statement: for each 
e > 0 there exists a 8 > 0 with the property that 

I/to - f(a) I < e 

for every * in the domain of the function that satisfies the inequality 

\x — a\< 8. 


Theorem 7 lff(x) and g(x) are continuous at a point a, thenf{x) + g(x),f(x) — g(x), 
andf(x)g(x) are also continuous at a. Further, f(x)/g(x) is continuous at a if g(a) =£ 0. 


Proof We prove only the statement about f(x) + #(*), the other arguments being simi¬ 
lar. Since f(x) and g(jc) are continuous at a, we have 

lim f{x) =f(a) and lim g(x) = g(a). 

x~^a x—>a 

Part (i) of Theorem 4 now guarantees that 

lim [f{x) + g(x)] =f{a) + g(a), 

X — 

and this proves that fix) + g(jt) is continuous at a. 


Theorem 8 The functions fix) = x and g(jt) = c, where c is a constant, are continu¬ 
ous for all values of x. 


Proof These statements follow at once from Theorems 2 and 3. 

Theorem 9 Any polynomial 

P(x) = a n x n + a n -\x n ~ l + • • • + a\x + ao (3) 

is continuous for all values of x. 


Proof By Theorem 8 and the multiplication part of Theorem 7, each of the following 
functions is continuous for all values of x: x, x 2 = x - x, x 3 = x • x 2 x k for any pos¬ 
itive integer k, and cx k where c is any constant. Since the constant term ao is continuous, 
this tells us that each term of (3) is continuous for all values of jc, and we obtain the con¬ 
clusion by repeated application of the addition part of Theorem 7. 


Theorem 10 Any rational function 


R(x) = 


P(x) 

Q(xY 


where P(x) and Q(x) are polynomials, is continuous for all values of x for which 

GW * 0. 


Proof This is an immediate consequence of Theorem 9 and the division part of Theo¬ 
rem 7. 
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We conclude this section by proving that “a continuous function of a continuous function 
is continuous.” 


Theorem 11 If g(x) is continuous at a andf(x) is continuous at g{a ), then the com¬ 
posite function f(g(x)) is continuous at a. 


Proof Let e > 0 be given. Since /(jc) is continuous at g(a), we know that there exists a 
8 ] > 0 such that 

!/(#(■*)) “ f(g(a ))| < e (4) 


if 


\g(x) - g(a )| < Si. (5) 

But g(x) is continuous at a, so there exists a 8 > 0 such that |jc — a\ < 8 implies |g(jc) — 
g(a)| < 5]. We therefore see that \x - a\ < 8 implies (5), which in turn implies (4), and 
this is all that is needed to complete the proof. 


A.3 

SOME DEEPER 
PROPERTIES OF 
CONTINUOUS 
FUNCTIONS 


We recall that a closed interval [a, b] on the jc-axis is an interval which includes its end¬ 
points a and b. A function is said to be continuous on a closed interval if it is defined and 
continuous at each point of the interval. Functions of this kind have several important 
properties that we now discuss and prove. 


Theorem 1 (Boundedness Theorem) Let f(x) be a function continuous on a closed 
interval [a , b]. Then /(jc) is bounded on [a, b]\ that is, there exists a number C with the 
property that |/(jc)| < C for all x in [a, b]. 


A good way to study a theorem like this critically is to see what happens if the hy¬ 
potheses are weakened or removed. In Theorem 1 there are two main hypotheses: (1) the 
interval [a, b] is closed; and (2) the function /(jc) is continuous at each point of the in¬ 
terval. We show by examples that if either hypothesis is weakened, then the conclusion 
of the theorem can be false. 


y 



3 2 


Figure A.3 


Example 1 The function /(jc) = \/x is clearly continuous on the closed interval [1, 2], 
so according to Theorem 1, /(jc) should be bounded on this interval. Indeed, a bound C 
is easy to find: 

|/(jc)| < 1 for all jc in [1, 2]. 

Further (see Fig. A.3),/(jc) is continuous on the closed interval [1 In, 2] for any positive 
integer n, and in this case the number n is a bound: 

|/(jc)| ^ n for all jc in [1 In, 2]. 

On the other hand, /(jc) is also continuous on the nonclosed interval (0, 2], but /(jc) is 
not bounded on this interval. For, no matter how large a value of C we take, there are 
points in the interval for which /(jc) > C; specifically, if 0 < jc < 1/C, then/(jc) = 1/jc > 
C. This shows that the hypothesis requiring that the interval [a, b] be closed is necessary. 
We now extend the definition of /(jc) to include the point jc = 0, by putting 


/« = 



if 0 < jc < 2, 
if jc = 0. 


H —►jc 

2 


This function is defined on the entire closed interval [0, 2], and it is unbounded on this 
interval for the same reason. This time the conclusion of Theorem 1 is false because the 
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function f(x) is not continuous at each point of the closed interval: it is discontinuous at 
the single point * = 0 . 

These remarks show that the hypotheses of Theorem 1 cannot be weakened, and the 
following proof demonstrates that with both hypotheses in place the conclusion of the the¬ 
orem is inescapable. 


Proof of Theorem 1* Our proof uses the fact that a nonempty set of real numbers with 
an upper bound necessarily has a least upper bound (see Appendix A. 1). Let S be the set 
of all points c in [a, b] with the property that/(;c) is bounded on [a, c\ It is clear that S 
is nonempty and has b as an upper bound, and therefore has a least upper bound which 
we denote by cq. We claim that cq = b. To establish this, suppose that cq < b. Since f{x) 
is continuous at x = cq , it is easy to see that f(x) is bounded on [co — e, Co + e] for some 
6 > 0. Since/U) is also bounded on [a, cq — e], it is clearly bounded on [a, cq + e]. This 
contradicts the fact that cq is the least upper bound of S, so c o = b. This tells us that 
f(x) is bounded on [a, c] for every c < b. One more step is needed to finish the proof. 
Since f(x) is continuous at x = b, it is bounded on some closed interval 
[b - e, b\ By what we just proved, f(x) is also bounded on [a, b — e], so it is bounded 
on all of [a, b]. 

If a function f(x) is bounded on [a, b], then its range—the set of all its values—has 
an upper bound and a lower bound. If M and m are the least upper bound and greatest 
lower bound of the range, then 

m </(*) < M for all x in [ a,b ]. 

For bounded functions in general, the numbers M and m need not belong to the range. 
However, our next theorem asserts that if f(x) is continuous, then both numbers M and m 
are actually assumed as values of the function. 


Theorem 2 (Extreme Value Theorem) Letf(x) be a function continuous on a closed 
interval [ a , b\ Then f(x) assumes a maximum value M and a minimum value m\ that 
is, there exist points x\ and X 2 in [a, b] such that 

f(M) «/(* 2 ) 

for all x in [a, b\ 


This statement is intuitively clear if we think of a continuous function on a closed in¬ 
terval as one whose graph consists of a single continuous piece, without any gaps or holes; 
for as we move along the curve from the left endpoint ( a , f(a)) to the right endpoint 
(b, f(b), we feel compelled to believe that there must be a high point on the curve where 
f(x) has its maximum value and a low point where f(x) has its minimum value. This is 
true, but the situation is again very delicate, because if either hypothesis is weakened— 
even slightly—then the conclusion of the theorem can be false. 


Example 2 Consider the function f(x) defined by f(x) = x on the nonclosed interval 
[ 0 , 1 ), and also the function g(x) defined by 


8 (x) = 


x 

0 


if 0 <*< 1 , 
if 1 < x < 2 


on the closed interval [0, 2]. Both functions are shown in Fig. A.4. The function/U) does 
not assume a maximum value even though it is continuous on the interval [ 0 , 1 ), because 


"Some of the details of the proofs in this Appendix are left for students to fill in. 
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Figure A.4 




this interval is not closed; and the function g(x) does not assume a maximum value even 
though the interval [ 0 , 2 ] is closed, because g(x) is discontinuous at the single point x = 
1. In each case the values of the function get close to the number 1 (which is the least 
upper bound M of the range) as jc —> 1 from the left, but there is no point where the func¬ 
tion actually has the value 1 . 


Proof of Theorem 2 We prove the statement about assuming a maximum value. By The¬ 
orem 1 ,/U) is bounded on [a, b], so the range has an upper bound and therefore a least 
upper bound M. We must show that there exists a point x 2 in [a, b] such that/(x 2 ) = M. 
Suppose there is no such point, that is, suppose that f(x) < M for all x in [a, b]. Then 
M — f(x) is positive on [a, b], the function 


g(x) = 


1 


is continuous on [a, b], and Theorem 1 implies that this function is bounded. This means 
that there exists a number C such that 


for all x in [a, b], so 


1 _ 

M ~f(x) 


< C 


~ /(*) or f(x)< M - — 

This contradicts the fact that M is the least upper bound of the set of all/M’s and we are 
thereby forced to the desired conclusion: there exists at least one point x 2 in [a, b] for 
which f(x 2 ) = M. The statement that f(x) assumes a minimum value at some point x\ is 
proved similarly. 

The Extreme Value Theorem says that a function continuous on a closed interval actu¬ 
ally takes on a maximum value and a minimum value. There is a companion to this the¬ 
orem which states that such a function also takes on every value between its maximum 
and minimum values. Thus, a function continuous on a closed interval has a range which 
is itself a closed interval. To put it another way, such a function does not skip any values. 
We begin with a preliminary theorem that has many applications of its own (see Sec¬ 
tion 4.6). 


Theorem 3 Let f(x) be a function continuous on a closed interval [a, b], Iff(a) and 
f{b) have opposite signs, that is, if 

f(a)<0<m or f(a) > 0 >f(b), 

then there exists a point c between a and b such that /(c) = 0 . 
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This says—in effect—that the graph of a function continuous on a closed interval can¬ 
not get from one side of the x-axis to the other side without actually crossing this axis at 
a definite point (Fig. A.5, left). However, this conclusion can be false if the function fails 
to be continuous even at a single point. This is shown (Fig. A.5, right) by the function 
fix ) defined on the interval [1, 3] by 


fix) = 


if 1 < x < 2 , 
if2<x<3. 


Proof of Theorem 3 Suppose first that f{a) < 0 < fib). Since fia) < 0 and fix) is con¬ 
tinuous at x = a, there exists a number d in the open interval ia, b) such that fix) is neg¬ 
ative on [a, d). Let c be the least upper bound of the set of all such d% and observe that 
fix) is negative for all x < c. It cannot be true that/(c) > 0, for by continuity this would 
imply that fix) is positive on some interval (c - e, c], contrary to what we have just ob¬ 
served. Also, it cannot be true that/(c) < 0, for by continuity this would imply that fix) 
is negative on some interval [a, c + e), contrary to the definition of c. We conclude that 
/(c) = 0. The argument for the other case is similar. 


Theorem 4 (The Intermediate Value Theorem) Let fix) be a function continuous 
on a closed interval [a, b]. If M and m are the maximum and minimum values of fix) 
on [a, b\ and if C is any number between M and m so that m < C < M, then there ex¬ 
ists a point c in [a, b] such that /(c) = C. 


Proof The function g(x) = fix) — C is also continuous on [a, b\. If x\ and X 2 are points 
in [a, b] at which/(xj) = m and /(x 2 ) = A/, then g(x) is negative at xi and positive at x 2 ; 

gUi) =/C*i) ~ C = m - C < 0 

and 

gix 2 )=f(x 2 )-C = M-C>0. 

By Theorem 3, there exists a point c between X! and x 2 (and therefore in [a, b]) such that 
gic) = 0. But this means /(c) - C = 0 or /(c) = C. 

As another consequence of Theorem 3, we have 


Theorem 5 Let fix) be a function continuous on the closed unit interval [0, 1 ] which 
has the further property that its values also lie in this interval iFig. A.6 ). Then there 
exists at least one point c in [ 0 , 1 ] such that /(c) = c. 


Proof The function g(x) = fix) - x is continuous on [0, 1] and has the property that 
giO) = /(0) - 0 =/(0) > 0 and g(l) =/(l) — 1 ^ 0. By Theorem 3, there exists a point 
in c in [ 0 , 1 ] such that gic) = /(c) - c = 0 , so /(c) = c. 


Figure A.S 
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Figure A.6 
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A function f(x) with the properties assumed here is often called a continuous mapping 
of the interval [ 0 , 1 ] into itself and the point c is called a fixed point of this mapping. 
Theorem 5 is a special case of a famous and far-reaching theorem of modern mathemat¬ 
ics called Brouwer's fixed point theorem, which asserts that continuous mappings of cer¬ 
tain very general spaces into themselves always have fixed points. 



THE MEAN VALUE 
THEOREM 



This theorem is one of the most useful facts in the theoretical part of calculus. In geo¬ 
metric language, it is easy to state and intuitively plausible. It asserts that between any 
two points P and Q on the graph of a differentiable function there must exist at least one 
point where the tangent line is parallel to the chord joining P and Q, as shown in Fig. 
A.7. For the curve in the figure there are two such points. There may be many or there 
may be only one, but the theorem guarantees that there must always be at least one such 
point. By using the notation in the figure, we can express the statement of the theorem 
analytically by saying that there exists at least one number c between a and b (a < c < 
b) with the property that 


f\c) 


m -m 

b - a 


The significance of the Mean Value Theorem lies not in itself but in its consequences, for 
it provides a convenient way of getting a grip on many theoretical facts of practical im¬ 
portance. This will become clear in Theorems 3 and 4, and also in later sections of this 
appendix. 

A rigorous proof of the Mean Value Theorem is usually developed in the following way. 
We begin by establishing the special case of the theorem in which the points P and Q 
both lie on the x-axis: 


Theorem 1 (Rolle’s Theorem*) If a function f(x) is continuous on the closed inter¬ 
val a < x < b and differentiable in the open interval a < x < b, and if f{a) = f(b) = 
0 , then there exists at least one number c between a and b with the property that 
f\c) = 0. 


This theorem says that if a differentiable curve touches the ;c-axis at two points, then 
there must be at least one point on the curve between these points at which the tangent 
is horizontal (Fig. A. 8 ). Equivalently, the zeros of a differentiable function are always sep¬ 
arated by zeros of its derivative. 

Economists have a maxim, “There is no such thing as a free lunch.” For us—in the 
realm of pure mathematics—this means we cannot get something for nothing; or in other 
words, strong conclusions require strong hypotheses. The conclusion of Rolle’s Theorem 
depends heavily on its hypotheses, and the following examples show that these hypothe¬ 
ses cannot be weakened without destroying the conclusion. 


Example 1 The function 


fix) = 


x 

2 -x 


0 < x < 1, 
1 < x < 2 


*Michel Rolle (1652-1719) was an otherwise obscure French mathematician whose name is insepa¬ 
rably linked to one of the principal foundation stones of the theory of calculus. This is deliciously 
ironic, because, having studied the emerging subject and finding it unconvincing, he vigorously at¬ 
tacked it in the French Academy of Sciences as a bundle of ingenious fallacies. When his friends 
convinced him that things were not quite as bad as that, the opposition faction collapsed and the new 
analysis began a century of rapid and luxuriant growth in continental Europe. Rolle stated a poly¬ 
nomial version of his theorem in an almost-forgotten book on the algebra of solving equations (1691). 
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(see Fig. A.9) is zero at x = 0 and x = 2 and is continuous on the closed interval 0 < x < 
2. It is differentiable in the open interval 0 < x < 2, except at the single point x - \, where 
the derivative does not exist. The derivative fix) is clearly not zero at any point in the in¬ 
terval, and this failure of the conclusion of Rolle’s Theorem arises from the fact that the 
function fails to be differentiable at a single crucial point. 


Example 2 The function 


fix) = 


x 

0 


0< jf < 1, 
x- 1 


(see Fig. A. 10) is zero at x = 0 and x = 1 and is differentiable in the open interval 0 < 
x < 1. It is continuous on the closed interval 0 < x < 1, except at the single point x = 1. 
The derivative fix) is not zero at any point in the interval, and in this case the failure of 
the conclusion of Rolle’s Theorem arises from the discontinuity of the function at a sin¬ 
gle point. 


Proof of Theorem 1 By Theorem 2 in Appendix A.3, our continuity hypothesis implies 
that/(;c) assumes a maximum value M and a minimum value m on [a, b\ The fact that 
fix) is zero at the endpoints a and b tells us that m < 0 < M. If fix) is zero at every point 
of [a, b], then clearly /'(c) = 0 for every c in (a, b), and in this trivial case the conclu¬ 
sion is true. We may therefore suppose that the function assumes nonzero values, so ei¬ 
ther M > 0 or m < 0 (or perhaps both). We first consider the case in which M > 0. If c 
is a point at which/(c) = M, then a < c < b because the function is zero at the endpoints 
a and b. Since fix) is differentiable in the open interval a < x < b, the derivative 


f\c) = lim 

x^c 


fix) ~fjc) 
X — c 


( 1 ) 


exists/ It is part of the meaning of (1) that this limit must exist and have the same value 
when x approaches c from the left and from the right. If x approaches c from the left, we 
have 


x - c < 0 and fix) - /(c) < 0, 

where the second inequality follows from the fact that/(c) = M is a maximum value. This 
implies that 


^Equation (I) is clearly an equivalent way of writing 


f(c) = I™ 


/(c + Ax) —/(c) 

Ax 


Figure A.8 

y 



o 1 2 

Figure A.9 



0 1 

Figure A.10 
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/V) = lim MJ£) 20 . 

x-*c- X — C 

Similarly, if x approaches c from the right, we have 

x - c > 0 and fix) — /(c) < 0, 
so 

+ X — C 


( 2 ) 


(3) 


We conclude from (2) and (3) that/'(c) = 0, as asserted. If M = 0, then m < 0, and this 
case can be treated by a similar argument. 

Our main theorem can now be stated as follows (see Fig. A.7). 


Theorem 2 (Mean Value Theorem) If a function f {x) is continuous on the closed in¬ 
terval a < x < b and differentiable in the open interval a < x < b, then there exists at 
least one number c between a and b with the property that 


f\c) = 


m -f{g) 

b - a 


(4) 


Proof It is easy to see that the equation of the chord joining P and Q in Fig. A.7 is 


y = f(a) + 


m zJM 

b - a 


(x - a). 


The function 


FW =/« ~f(a) 


m -m 

b - a 


(x ~ a) 


(5) 


is therefore the vertical distance from the chord up to the graph of y = /(*). It is easy to 
see that the function (5) satisfies the hypotheses of Theorem 1, so there exists a point c 
between a and b with the property that F\c) = 0. But this is equivalent to 

o, 

b - a 


which in turn is equivalent to (4), so the proof is complete. 


We now consider some of the applications of this theorem. 

It is clear that the derivative of a constant function is zero. Is the converse true? 
That is, if the derivative of a function is zero on an interval, is the function necessarily 
constant on that interval? At the beginning of Section 5.3 we encountered an important 
piece of reasoning about indefinite integrals in which this converse was needed, and 
we took it for granted. We are now in a position to prove it by using the Mean Value 
Theorem. 


Theorem 3 If a function f(x) is continuous on a closed interval /, and if f\x) exists 
and is zero in the interior of I, then f{x) is constant on I. 


Proof To say that /(;c) is constant on I means that it has only a single value there. To 
prove that this is the case, suppose it has two different values, say f{a) ^f(b) for a < b 
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in /. Then the Mean Value Theorem implies that for some c between a and b we have 


f\c) = 


m -m ^ Q 

b — a 


But this cannot be true, sine tf\x) = 0 at all points in the interior of /. This contradiction 
shows that/U) cannot have different values in /, and is therefore constant on /, as we 
wished to prove. 


At the beginning of Chapter 4 we based our work on curve sketching on the “intuitively 
obvious” fact that a function is increasing or decreasing according as its derivative is pos¬ 
itive or negative. The Mean Value Theorem makes it possible to give a rigorous proof of 
this. 


Theorem 4 Letf{x) be a function continuous on a closed interval I and differentiable 
in the interior of I. If f\x) > 0 in the interior of I, then f{x) is increasing on I. Simi¬ 
larly, iff'(x) < 0 in the interior of I, thenf(x) is decreasing on /. 


Proof We shall prove only the first statement, in which we assume that f\x) > 0 in the 
interior of /. For any two points a < b in /, the Mean Value Theorem tells us that 


f\c) = 


m-f(a) 

b — a 


for some c between a and b. But f\c) > 0, so the fraction on the right side of this equa¬ 
tion is positive. Since b — a is positive, it follows that f{b) — f(a) is also positive, so 
f{a) <f(b) and consequently f{x) is increasing on /. 


Finally, we use Rolle’s Theorem to prove a technical extension of the Mean Value The¬ 
orem that is needed for establishing L’Hospital’s rule in Chapter 12. 


Theorem 5 (Generalized Mean Value Theorem) Letf(x) and g(x) be continuous on 
the closed interval a < x ^ b and differentiable in the open interval a < x < b, and as¬ 
sume further that g'(x) ± 0 for a < x < b. Then there exists at least one number c be¬ 
tween a and b with the property that 

m = m -m 

g\c) g(b) - g(a )' 


Proof We begin by noticing that if g(a) = g(b), then by Rolle’s Theorem g'Qt) vanishes 
at some point between a and b , contrary to hypothesis. Therefore g(a) =£ g(b), and the 
right side of ( 6 ) makes sense. To prove the theorem, consider the function 

F(x) = [f(b) -f(a)][g(x) ~ g(a)] ~ [f(x) ~ f{a)][g{b) - g(a)]. 

It is easy to see that this function satisfies the hypotheses of Rolle’s Theorem, so 
there exists a point c between a and b with the property that F\c) = 0. But this is 
equivalent to 

[f{b) ~f(a)]g\c) ~f\c)[g(b) - g{a)] = 0, 
which is equivalent to ( 6 ). 

Students should notice that this theorem reduces to Theorem 2 if g(x) = x. 
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A.5 

THE INTEGRABILITY 
OF CONTINUOUS 
FUNCTIONS 


In Section 6.4 the definite integral of a function over an interval was defined by means 
of a complicated passage to the limit, as follows. 

We start with an arbitrary bounded function f(x) defined on a closed interval [a, b]. We 
subdivide this interval into n equal or unequal subintervals by inserting n — 1 points of 
division x\, x 2 , . . ., x n -\, so that 

a = x 0 < x\ < x 2 < • * * < x n -\ <x n = b. (1) 

These points are said to constitute a partition P of [ a , b] into the subintervals 

[*o. *iL [*i. x 2 ],. . . , [x n -u x n ]. 

If Axi< = x k — x k ~\ is the length of the /:th subinterval, then the length of the longest subin¬ 
terval is called the norm of the partition and is denoted by the symbol ||P||, 

||P|| = max {A*!, Ajc 2 , . . ., Ajc„}. 

In each of the subintervals **] we choose an arbitrary point x%- We now multiply 

the value of the function/(x) at the point x k by the length Ax k of the corresponding subin¬ 
terval and form the sum of these products as the subscript k varies from 1 to n, 

J /(4) A**. (2) 

k= 1 

For each positive integer n we consider all possible partitions (1) and all possible choices 
of the points x k , and therefore all possible values of the sum (2). If there exists a number 
I such that the sum (2) approaches / as n —> °o and ||P|| 0, regardless of how the parti¬ 

tions P are formed and the points x k are chosen, then we call this number / the definite 
integral (or briefly the integral) of f(x) on [a, b] and denote it by the symbol 

/ = P/M dx. 

Ja 

Under these circumstances the function f(x) is said to be integrable on [a, b\ It is cus¬ 
tomary to express these ideas by writing 

I f(x) dx = lim jr f(xt) A**, (3) 

]a l|/’ll-»o £ri 

where there is no need to specify that n because this is implied by the stronger con¬ 
dition ||P|| 0. 

As we said at the beginning, the limit operation in (3) is quite complicated and bears 
only a superficial resemblance to such straightforward limits as 

lim (x 2 + 1) = 5 and lim f 2 + —\ = 2. 

x —>2 n—»°o \ yi J 

In each of these cases we consider the behavior of a certain function in terms of the be¬ 
havior of an independent variable, but (3) does not lend itself to this way of thinking. We 
could try to use ||p|| as an independent variable, and describe the limit in terms of the idea 
expressed by the symbol ||p|| —> 0. But this is difficult, because the sum (2) is not a sin¬ 
gle-valued function of the quantity ||P||; to a given value of ||P|| there correspond an infi¬ 
nite number of different partitions P and an infinite number of ways of choosing the points 
xi, and therefore an infinite number of values of the sum (2). 

The complexity of the limit operation in (3) is a considerable inconvenience when it 
comes to giving rigorous proofs of theorems. The cumbersome notation required for such 
proofs forces the reasoning itself to be awkward and clumsy. For this reason, it is cus¬ 
tomary in modem treatments of the theory of integration to define the definite integral in 
a very different way, one which avoids appealing to any kind of passage to the limit. We 
now describe this more convenient approach and use it to prove our main theorem. 
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We therefore ignore our previous definition and begin all over again at the beginning, 
with an arbitrary bounded function/(x) defined on a closed interval [a, b ]. Since/(x) is 
bounded, it has a greatest lower bound m and a least upper bound M. If P is any given 
partition of [a, b], we denote by m * and M * the greatest lower bound and least upper bound 
of f{x) on the klh subinterval [x*-i, x*]. (If/(x) were assumed to be continuous on [a, b], 
then by Theorem 2 in Appendix A.3 the ra’s and M’s would be minimum values and max¬ 
imum values of the function. But we are not assuming continuity at this stage, so we must 
work instead with greatest lower bounds and least upper bounds.) We now form the lower 
sum 


and the upper sum 


sp = X Ax* 

k= 1 


n 


S P - ^ M k Ax*. 

k — 1 


It is obvious that s P < S P . Further, we have the important 


Lemma Every lower sum is less than or equal to every upper sum :; that is, if P \ and 
P 2 are any two partitions of [a, b\ then s P: < Sp 2 . 


Proof It is east to see that if a single point is added to a partition, then the lower sum is 
unchanged or increases and the upper sum is unchanged or decreases; and the same is 
true if any finite number of points are added to produce a refinement of the given parti¬ 
tion. We now apply this fact to the new partition P 3 which is formed from the points of 
Pi and P 2 taken together. Since P 3 is clearly a refinement of both P\ and P 2 , it follows 
that 


Sp\ — sp 3 ^ Sp 2 < Sp 2 , 

which completes the argument. 

Among other things, this lemma tells us that every upper sum is an upper bound for 
the set of all lower sums, and that every lower sum is a lower bound for the set of all up¬ 
per sums. We can therefore form the least upper bound of all possible lower sums, which 
is called the lower integral and denoted by 

/ = f fix) dx. 

Similarly, the greatest lower bound of all upper sums is called the upper integral and de¬ 
noted by 

l = \ b fix) dx. 

J a 

At this point we make a further application of the lemma to conclude that 

/< 7 . 

Accordingly, every bounded function defined on a closed interval has a lower integral and 
an upper integral, and these two integrals are defined without making any appeal to the 
concept of a limit. If the lower and upper integrals coincide, then we call their common 
value the integral of f(x) on [a, b] and denote it by the usual symbol, 

I = f f(x) dx\ 

Ja 
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and in this case the function f(x) is said to be integrable on [a, b ]. On the other hand, it 
is quite possible to have I < I, in which case/(x) is not integrable. The function described 
in Remark 4 of Section 6.4 provides a good example of this recalcitrant behavior. 

We now come to our main theorem, which guarantees that most of the functions we 
meet in practice are integrable. First, a bit of new terminology that will be useful in the 
proof. If/Qc) is a bounded function defined on an interval [a, b], and if m and M are its 
greatest lower bound and least upper bound on this interval, then the difference M — m 
is called the oscillation off(x) on [a, b\. 


Theorem If a function f {x) is continuous on a closed interval [a , b], then it is inte¬ 
grable on [< a , b]. 


Proof Consider a partition P of [a, b] into subintervals [x k -\, x k ], and form the lower 
and upper sums 

n n 

= X an d Sp = y M k A**. 

k =1 k =1 

The difference between these sums is 

n 

Sp-s p = Y ( M k ~ m k ) &x h (4) 

k= 1 

where M k — m k is the oscillation of f{x) on the A:th subinterval [x*_i, x k ]. If we can show 
that the difference (4) can be made as small as we please by choosing a suitable partition 
P, then this will clearly be enough to prove the theorem. We accomplish this in the fol¬ 
lowing way. Let e be a given small positive number. If it can be shown that there exists 
a partition P such that the oscillation of the function is less than e/(b — a ) on every subin¬ 
terval, that is 


Mk ~ mk< T=Z 


for k = 1, 2, .. ., n. 


then it will follow that 

n n 

Sp — sp = Y (M k — m k ) L±x k < — -V /\x k = - - {b — a) = e. 

b ~ a fc i b-a 

Since e can be made as small as we please, this will complete the proof. 

We must therefore prove the existence of a partition P with the required property. If we 
simplify the notation by writing e x = e/(b - a), so that ej is perceived as merely another 
positive number that can be made as small as we please, then this property of the parti¬ 
tion P can be stated as follows: The oscillation of the continuous function f(x) on every 
subinterval of the partition must be less than ei.* 

We give an indirect proof, that is, we assume that for at least one number e\ > 0 no 
partition of the desired type exists, and we show that this assumption leads to a contra¬ 
diction. Let c be the midpoint of [a, b]. Then no partition of the desired type exists for at 
least one of the two subintervals [a, c] and [c, b], for if each of these subintervals has 
such a partition, then the full interval [a, b] also has. Let [a\, b\\ be that half of [a, b) 
with no such partition; and if both halves have no such partition, let [a\, b\] be the left 
half, [a, c]. Now bisect [a\, b i], and in the same way produce one of its halves, say 
[< 22 , bil with no such partition; and continue the process indefinitely. We observe that the 


This fact about a continuous function defined on a closed interval is usually referred to in the liter¬ 
ature as the Theorem on Uniform Continuity. 
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oscillation of f(x) on the nth subinterval [ a n , b n ] is at least €\, and also that the length of 
this subinterval is (b — a)/2 n . Let a 0 be the least upper bound of the set of left endpoints 
a\ y a 2 , « 3 ,. . « of this nested sequence of subintervals. Then a 0 certainly lies in the inter¬ 
val [a, b]\ and by the continuity of f(x) at a 0 , there exists an interval (a 0 - 8, a 0 + 8 ) in 
which the oscillation of f(x) is less than €\. However, if n is large enough, the interval 
[a n , b n ] lies wholly within the interval (a 0 “ 8, a 0 + 6), and therefore the oscillation of 
f(x) on [a n , b n ] must also be less than ei, contradicting our previous inference that the os¬ 
cillation of/U) on [ a n , b n \ is at least ei. This contradiction finally concludes the proof of 
the theorem. 


If students wonder whether a discontinuous function can be integrable, the answer is 
Yes. The function whose graph is shown in Fig. A. 11 provides an example of this asser¬ 
tion. It is defined on the closed interval [0, 1], and its values are 


for 0 < x 
2 2 

3 1 3 

4 for - < x < 4, 

4 2 4 


1 r 3 ^ ^ 7 

- for 


1 for x = 1. 

This function has an infinite number of points of discontinuity, but it also has the prop¬ 
erty of being nondecreasing, in the sense that;ti < *2 implies f(x\) < f(x 2 ), and any such 
function is integrable on any closed interval [a, b]. Students are invited to prove this for 
themselves by noticing that in this case the difference (4) can be written as 

n 

S P ~s P = ^ ( M k ~ m k ) kx k 

k= 1 

< ||P|| X (M k ~ m k ) = \\P\\[f(b) -f(a)l 

k— 1 

The set of all integrable functions can be characterized in a simple and absolutely precise 
way, but we do not pursue this matter any further here. 


y 






1 

2 


X 


Figure A. 11 
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A.7 

CONTINUOUS CURVES 
WITH NO LENGTH 
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The proof given here uses the Mean Value Theorem established in Appendix A.4 and as¬ 
sumes that students understand the concepts developed in Appendix A.5. 

To set the stage for the argument, we consider a function f(x) that is continuous on a 
closed interval [a, b\ If F(x) is any function such that F'(x) = f(x), we must prove that 

[ b f(x)dx= F(b) - F(a). (1) 

Ja 

We accomplish this by showing that the number o n the right of (1) lies between the lower 
sum and the upper sum associated with an arbitrary partition 

a = X0 < X\ < X2 < ' ' ' < *n -1 <x n = b (2) 

of the interval [a, b]. 

The reasoning is as follows. The function F(x) satisfies the hypotheses of the Mean 
Value Theorem on each subinterval of the partition (2). This theorem therefore guaran¬ 
tees the existence of points x*, x \,. . ., x* n in these subintervals such that 

F(x i) - F(a) = F\x\)(x i - a) -/(*!) 

F(x 2 ) ~ F{x i) = F\x\ )(x 2 ~ x t ) =f{x\) Av 2 , 


F(b) - F(x h -i) = F\xn)(b - x„-i) =f(x*„) Ax n . 

If we add these equations and take advantage of the cancellations on the left, we get 

n 

Fib) ~ Fid) = X /(*!) Ajc*. (3) 

k= 1 

The right side of (3) clearly lies between the lower sum and the upper sum associated 
with the partition (2), so the proof is complete. 


In Section 7.5 we introduced the concept of the length of a curve, and we suggested there 
that the theory of arc length is more complicated than it seems. We now offer a few fur¬ 
ther thoughts on this subject, with some examples to illustrate the bizarre situations that 
can arise. 

Let AB be the graph of a function y = f(x) that is continuous on a closed interval 
[a, b]. We connect A and B by a broken line whose vertices lie on the graph and use this 
line to define the length of the curve, as follows. We partition the interval [a, b] into n 
subintervals by inserting n — 1 points *i, * 2 , ■ • • , x n - b with x {) = a and x„ = b, as shown 
in Fig. A. 12. Let P k be the point (x k , y k ), where y k = f(x k ), and let L n be the length of the 
inscribed polygonal line: 

L n = AP\ + P\P 2 + • • • + P n ~\B. 

If for all partitions of [a, b ]—or equivalently, all choices of the points P 1 , Pi ,... , P n -1 
—the set of lengths L n is bounded, then the curve is said to be rectifiable and the least 
upper bound L of the set of L„’s is called the length of the curve. 

On the other hand, if the set of all lengths L n is unbounded, then the curve is said to 
be nonrectifiable and has no length. Most of the continuous curves we encounter are rec¬ 
tifiable, but it is not true that every continuous curve has this property. Since this contra¬ 
dicts the intuitive preconceptions most people have about curves, we shall give an exam¬ 
ple below of a continuous curve that has no length. 

First, however, a bit of theory, which we hope will clarify the ideas. We begin by con¬ 
sidering the case of a curve that rises as we move along it from left to right; that is, we 
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assume that the continuous function f(x) is increasing (Fig. A. 13). If we introduce the no¬ 
tation 


Ax* = x* - x k - u Ay* = y* - y k _ ,, 


then we see that 


P k -\P k < Ax* + Ay*, 

and by adding we obtain 

L n = AP\ + P\P 2 + • ■ • + P n -\B < AC + CB. 

This upper bound is independent of the number and location of the points P*, so the curve 
is rectifiable. A curve that falls as we move from left to right—the graph of a decreasing 
function — is easily seen to be rectifiable in the same way. 

A curve that rises and falls a finite number of times is clearly rectifiable, because the 
sum of the bounds for the parts is a bound for the whole. This is equivalent to saying that 
the graph of y = f(x) is rectifiable if it has a finite number of local maxima and minima. 
Nonrectifiable curves must therefore be highly oscillatory, with an infinite number of wig¬ 
gles. 

It is convenient to express this idea in a more precise way. For a general curve the num¬ 
ber A yk can be positive or negative, but always 

|Ay*| < P k -\Pk s Ax* + |Ay*|, 

and by adding we see that 

|Ay,| + • • • + |Ay„| <L n <b - a + |Ay t | + • • • + |Ay„|. 

If the set of all sums on the left, that is, the numbers 

V n = | Ay, | + • • • + |Ay„|, 

is bounded for all possible partitions of the interval [a, b], then the function y = f(x) is 
said to be of bounded variation on the interval. We see from this discussion that since 
V n < L n , if the set of L n 's is bounded, then the set of VVs is also bounded; and since 
L n ^ b - a + V n , if the set of VVs is bounded, then the set of L„ s is also bounded. We 
therefore have 


Figure A 


J 


A 


Figure A 
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Theorem 1 The graph of y = f(x) is rectifiable on an interval [a, b] if and only if 
f(x) is of bounded variation on this interval . 



If the set of Vfs is bounded, its least upper bound V is called the total variation of the 
function/(x) on the interval [a, b\ V is a measure of the total amount of rising and falling 
of the graph. As examples, for the function y = x 2 on -2 ^ x ^ 3 we have V = 13, since 
the graph falls four units and rises nine units; and for y = cos x on 0 < x < 27t we have 
V = 4. Each curve therefore has a length, but this length may be difficult to calculate. 

Our blanket assumption is that the function y = /(x) is continuous on the closed inter¬ 
val a < x < b. Let us now make the further assumption that/(x) has a derivative on the 
open interval a < x <b. Then the Mean Value Theorem tells us (Fig. A. 14) that there ex¬ 
ists a point x h x k -\ <x k < x with the property that 


f\x k ) = 




SO 


We can now prove 


lAy*| = \f\x k )\ Sx k . 


Theorem 2 Ify = /(x) has a bounded derivative on the open interval a < x < b, then 
its graph is rectifiable. 


The proof is easy: If \f\x)\ ^ M, then |Ay*| < MAjc* and 
V n = |Ajyi | + • • • + |Ay„| 

< M(A*i + A*2 + • • • + Ax n ) 

= M(b - a). 

The function is therefore of bounded variation, and by Theorem 1 its graph is rectifiable. 


Now for some instructive examples. 



Example 1 Define y = /(*) on 0 ^ x < 1 by 

f x cos tt/x if 0 < x < 1, 
f {x) = I 0 if JC = 0. 


The graph of this function is shown in Fig. A. 15; it clearly has an infinite number of wig¬ 
gles as it approaches its left endpoint. We take the points A, Pi, ... , P n -i, B to be the 
points on the graph corresponding to x = 0, 1 In, 1 /(n - 1),..., 1/2, 1. We know that 


/ T =T C0S k7T = 


(-D* 

k ' 


Vn - |4yi| + • • • + |Ay„| 



The right side of this is greater than the sum 



+ !. 


n 


Figure A.15 
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It is known from the elementary theory of infinite series that these sums are unbounded 
as n increases. The function f(x) is therefore not of bounded variation, and by Theorem 
1 its graph is not rectifiable. 


Example 2 We next consider a close relative of the function discussed in Example 1. 
This time we define y = f(x) on the interval 0 < Jt < 1 by 

f x 2 cos ir/x if 0 < jc < 1, 

/W = (o if jc = 0 . 

The graph is shown in Fig. A. 16, and it also has a infinite number of wiggles. The deriv¬ 
ative of this function in the open interval is 

fix) = 77 sin — + 2x cos —. 

XX 

This derivative is easily seen to be bounded, 

|/'Wl S 7T + 2, 

so by Theorem 2 its graph is rectifiable. Even though we know this curve has a length, 
the problem of finding its value is far beyond our capabilities. 



In this discussion we begin by defining e to be the limit 

e = lim f 1 + —] . (1) 

n-><» y n ] 

We then carefully extend this formula step by step until we reach the more general con¬ 
clusion that 


A.8 

THE EXISTENCE OF 
e = lim (1 + K) 1/h 


e = lim (1 + h ) l,h , 


( 2 ) 


where h is allowed to approach 0 in any manner whatever, through rational or irrational, 
positive or negative, values. 

Our first task is to prove the existence of the limit (1), and thereby to legitimize this 
definition of e. By the binomial theorem the quantity 


X n = 



can be express. 1 as the following sum of n + 1 terms, 


i + „. I + «&•: 

n 1 


1) 1 A n(n - 1 )(n ~ 2) 1 


= 1 + 1 + 


o' + 


1 • 2 


1-2-3 


1 -- 
n 


— + 
n 5 


1-2-3 


■4)K)+ 



( 3 ) 


As n increases, the number of terms in this sum increases, and also each term after the 
second increases. This shows that 


X\ < X 2 < *3 < ' ' ' < X n < X n+ \ < 
Also, the expansion (3) tells us that 

*„ <1 + i+T 1 i + t4-3 + -"+ — 


( 4 ) 


1-2-3 • • -72 
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A.9 

FUNCTIONS THAT 
CANNOT BE 
INTEGRATED 


<1 + 1 + (y + 3 r + "' + 2 ^) < 1 +1 +1 - 3 ’ 

since the expression in parentheses is part of the familiar geometric series 



( 5 ) 


By (4) and (5) the x n 's steadily increase but always remain < 3, so they necessarily ap¬ 
proach a limiting value. In the present context this limiting value is e by definition. This 
argument proves (1), and also we clearly have 


e = lim 

n—>°° 



_J_Vi+1 

n + 1 ) 


which will be needed below. 

We next consider the limit (2) for the special case in which h is required to approach 
0 through positive values. When h < 1, there exists a unique positive integer n such that 


n < — < n + 
h 


1. 


This implies that 



—J—V < (i + h) uH < 

n + 1 / 



n 


M+ 1 


which in turn can be written as 


[1 + \/(n + 1)]* +1 
1 + 1 l(n + 1) 


< (1 + h) ]/h < 



( 6 ) 


As h —> 0, n —> °° and the first and third terms of the inequality (6) approach e. Since (1 + 
h) l,h is caught between the two, it must have the same limit, and therefore (2) is proved 
for the case in which /? —> 0 through positive values. 

We conclude our analysis by establishing (2) for the case in which h approaches 0 
through negative values. If we put h = —k, then 

(i + h) m = a - kr uk = (737)^ 




k yi -m( 

1 -k) \ 


1 + 



—> e • 


= e, 


by the result of the previous paragraph. 


In spite of the many successes achieved by the methods of Chapter 10, certain integrals 
have always resisted every attempt to express them in terms of elementary functions: for 
instance, 


e * 2 dx, 

[ ~ dx, 

J x 

J COS X 2 

dx, 

dx 

f ^ , 

f sin x 

dx. 

In x ’ 

v sin x dx. 

I , 


There are also the so-called elliptic integrals, of which 


1 


dx 


J Vl — jc 3 dx and 


Vl -x 4 
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are examples.* * In the nineteenth century it was finally proved, by the great French math¬ 
ematician Liouville and his followers, that the problem of working out these integrals in 
terms of elementary functions is not merely difficult—it is actually impossible. 

The full depth of Liouville’s ideas cannot be plumbed in a calculus course. 1. Neverthe¬ 
less, it is quite possible to gain some impression of how these ideas work without neces¬ 
sarily undertaking a long program of preliminary study. 11 

Among other things, Liouville discovered and proved the following theorem: 

Iff(x) and g(x) are rational functions and g(x) is not a constant, and if f f{x)e 8 ^ dx 
is an elementary function, then this integral must have the form 

I /(jc)e* w dx = RetM 


for some rational function R. 

We illustrate the value of this theorem by using it to prove that the integral 

ffdx (I) 

is not elementary (that is, cannot be expressed in terms of elementary functions). Sup¬ 
pose, on the contrary, that this integral is elementary. Then by Liouville’s theorem we 
know that 


[ — dx = Re x 
J x 

for some rational function R. But this means that 

e x d e x 

— = — ( Re x ) or — = Re x + R'e x , 

x dx x 


so 


- = R + R'. 

X 


( 2 ) 


Since R is rational, it can be written in the form R = P/Q, where P and Q are polynomi¬ 
als with no common factor. We know that 


so (2) becomes 


which is equivalent to 


QP' ~ PQ' 

R ~ Q2 ’ 

1 = P QP'~ PQ' 

x Q Q 2 


Q 2 = PQx + x(QP' - PQ') 


or 


*In general, an elliptic integral is any integral of the form //?(*, y) dx, where R(x, y ) is a rational 
function of the two variables x, y and where y is the square root of a polynomial of the third or fourth 
degree in x. The name elliptic integral is used because an integral of this type arises in the problem 
of finding the circumference of an ellipse. 

^iouville’s theory is expounded in full in the monograph by J. F. Ritt, Integration in Finite Terms 
(Columbia University Press, 1948). 

*See D. G. Mead’s article “Integration,” Amer, Math. Monthly, 68 (1961), pp. 152-156. 
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QiQ ~ Px~ P'x) = - PQ'x . (3) 

Our purpose is to deduce a contradiction from (3), and we proceed as follows. Let x n be 
the highest power of* that can be factored out of the polynomial Q, so that Q(x) = x n Q\(x) 
where Qi(x) is a polynomial such that £?i(0) ^ 0- We first observe that n > 0; for if 
n = 0, so that 2(0) 0, then x = 0 reduces the right side of (3) to zero but not the left 

side, which cannot happen because (3) is an identity in x. This implies two facts that we 
need in order to obtain our final contradiction. First, P{ 0) =£ 0, because P and Q have no 
common factor and therefore x cannot be a factor of P. Second, we have 

Q\x) = x n Q\(x) + nx n ~ l Qfx) 

= 1 [*#(*)+ «fii(*)]; 

and since the polynomial in brackets has a nonzero value when x = 0, we know that*" -1 
is the highest power of x that can be factored out of Q'. These two facts taken together 
tell us that x n is the highest power of x that can be factored out of the polynomial on the 
right side of (3), whereas x n+l can be factored out of the left side. This contradiction 
brings us to the conclusion that (2) is impossible, so the integral (1) is not elementary. 

Remark 1 We know from our work in Section 6.7 that for any continuous integrand the 
definite integral 


F{x) = lo f(t) dt 


(4) 


exists and has the property that 


Since (5) is equivalent to 


dx 


F{x) = fix). 


[ f(x) dx = F{x), 


(5) 


we see that the indefinite integral of every continuous function exists. However, this fact 
has nothing to do with the issue of whether the integral can be expressed in terms of the 
elementary functions. When such an expression is not possible, formula (4) can be thought 
of as providing a legitimate and sometimes useful method for creating new functions. For 
example, the nonelementary function of x defined by 


1 


V27 T 


fV ' 2 ' 2 dt 

Jo 


has important applications in the theory of probability, and for this reason it has been stud¬ 
ied and tabulated and has thereby acquired a certain status as a “known function.” 


Remark 2 It is easy to see that the integral 
f e x 

— dx becomes 
J x 


I— 

J In t 


under the substitution t = e x \ for x = In t, dx = dt/t, and therefore 

(£l dx = j^d L = jjL, 

J x J In t t J In t 

Since we know that the first integral is not elementary, it is clear that the second integral 
is also not elementary. This is worth noticing because the function of x defined by 


f* dt 
h In t 


(6) 
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NOTE ON LIOUVILLE 

JosephLiouville( 1809-1882) was a highly 
respected professor at the College de France in Paris and the 
founder and editor of the Journal des Mathematiques Pures 
et Appliquees, a famous periodical that played an important 
role in French mathematical life through the latter part of 
the nineteenth century. For some reason, however, his own 
remarkable achievements as a creative mathematician have 
not received the appreciation they deserve. 

His theory of integrals of elementary functions as de¬ 
scribed above was perhaps the most original of all his 
achievements, for in it he proved that many familiar inte¬ 
grals are not merely difficult to work out, but actually im¬ 
possible. 

The fascinating and difficult theory of transcendental 
numbers is another important branch of mathematics that 
originated in Liouville’s work. The irrationality of 7r and e 
(that is, the fact that these numbers are not roots of any lin¬ 
ear equation ax + b = 0 whose coefficients are integers) had 
been proved in the eighteenth century by Lambert and Euler. 
In 1844 Liouville showed that e is also not a root of any 
quadratic equation with integral coefficients. This led him 
to conjecture that e is transcendental, which means that it 
does not satisfy any polynomial equation 

a n x n + a n ~\X n ~ 1 + • ■ • + a\x + ao = 0 
with integral coefficients. His efforts to prove this failed, but 


his ideas contributed to Hermite’s success in 1873 and then 
to Lindemann’s 1882 proof that ir is also transcendental. 
Lindemann’s result showed at last that the age-old problem 
of squaring the circle by a ruler-and-compass construction 
is impossible. One of the great mathematical achievements 
of modem times was Gelfond’s 1929 proof that e n is tran¬ 
scendental, but nothing is yet known about the nature of any 
of the numbers ir + e, ire, or if. Liouville also discovered 
a sufficient condition for transcendence and used it in 1851 
to produce the first examples of real numbers that are prov- 
ably transcendental. One of these is* 

i uk = w + w + w + ' ■' = °- 1: 110001 " 30 • • • • 

His methods here have also led to extensive further work in 
the twentieth century. 

The ancient Greek philosopher-scientist Democritus said, 
“I would rather discover one cause than be King of Persia/’ 
What Liouville accomplished was certainly better than be¬ 
ing King of Persia, or being any king or political leader what¬ 
soever. He was a thinker whose work will live as long as 
people care about beautiful ideas. 


*For the details of this, see pp. 288-290 of the present writer’s book, 
Calculus Gems (McGraw-Hill, 1992). 



is of great importance in the theory of prime numbers, and the behavior of this func¬ 
tion for large values of jc has been studied exhaustively for more than a century.* [The 
lower limit of integration in (6) is chosen to be 2 in order to avoid the point t = 1, where 
In / = 0.] 

*Seepp. 2-4 of H. M. Edwards, Riemanns Zeta Function (Academic Press, 1974). 


PROBLEMS 


1 Consider an integral of the form J7?(sin x, cos x) dx, 
where the integrand is a rational function of sin x and 
cos x. Show that the substitution 

z = tan \x 



+ z 2 , 


cos x = cos 2 



1 -Z 2 

1 +z 2 ’ 


sin x = 


2 z 

1 +z 2 ’ 


and 


dx = 


2 dz 

1 + z 2 ' 


converts this integral into the integral of a rational func¬ 
tion of z, which can then be worked out by routine pro¬ 
cedures. Hint: Show that 


Use the method of Problem 1 to find 

dx „ x f sin x dx 


(a) / 


2 + cos x ’ 


(b) I 


2 + sin x ’ 
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3 


4 


Use the method of Problem 1 to find 
(a) / sec x dx; (b) / tan x dx. 

Express your answers in the usual form [i.e., In (sec x 4 
tan x) and —In (cos x)]. 

Use the method of Problem 1 to obtain the following 
formulas 


(a) 

(b) 


i 

i 


dx _ f _ 2 dz 

a + b sin x J az 2 + 2 bz 4 a ’ 

_ dx _ 

a 4 b sin x + c cos x 


-i 


_ 2dz _ 

(a — c)z 2 + 2 bz + (a + c)' 


(c) 

(d) 


/ 

/ 


sin x dx 
1 4 sin x 

cos x dx 
1 4 cos x 


f 4 z dz 

J (i +z ) 2 (i +z 2 y 

f (1 - z 2 ) dz 

J 1 + z 2 • 


A rationalizing substitution is a change of variable that elim¬ 
inates radicals or fractional exponents. Find the following in¬ 
tegrals by using this idea. 


5 f — . Hint: Put u = Vx. 

J 


/ 


I + Vx 
+ 1 
Vx - I 

dx 

Vx + ^/x 

3V^r dx 
40 +x 3/4 )‘ 

Vx 


Jx. 


: . Hint: Put u = Vx. 


dx. 


11 


13 


1 + x 

Vi 

1 4 Vx 


dx. 


Vx + 2 




10 


12 


14 


v-2/3 


4 x 
dx 


dx. 


41 - Vx) 
V7T T 


x 4 3 

15 The special elliptic integral 


dx. 


du 


V(1 - m 2 )( 1 - ^t/ 2 ) 


is called the elliptic integral of the first kind. Show that 
each of the following integrals can be brought into this 
form by means of the indicated substitution: 


(a) / 

u 

(b) / 


dx 


V1 - k 2 sin 2 x 
u = sin x; 

dx [ 


-! 


du 


V(1 - u 2 )(\ - k 2 u 2 ) 


du 


Vcos 2x V(1 — w 2 )(l - 2 u 2 ) 


u = sin x; 



dx 

Vcos x 



_ du _ 

V(1 - w 2 )(l - 2 u 2 ) 


. 1 

u — sin — x; 

M)J . A 

Vcos x sn cos a 

du 

V(1 — m 2 )( 1 - ichV)' 


u = sin — x and k = esc — a. 

2 2 

16 Consider the integral in part (b) of Problem 15, 



f dx 
Vcos 2x 



dx 

- 2 sin 2 x 


Show that the substitution u = tan x transforms this in¬ 
tegral into the special elliptic integral 

f du 

' Vi - 4 

17 If p and ^ are rational numbers, show that the integral 
| x^( 1 - x)^ dx (*) 


is elementary in each of the following cases: 

(a) p is an integer (hint: if q = m/n with n > 0, put 1 — 
x = u n )\ 

(b) q is an integer; 

hint: 


(c) p + q is an integer 

I xP{ 1 - x)0 dx = j 


X P+C! 


dx 


The Russian mathematician Chebyshev proved that these 
are the only cases for which the integral (*) is elemen¬ 
tary. 1 Accordingly, 


| Vx Vl — x dx , J Vx Vl — x dx, 



are not elementary. 

18 Use the theorem of Chebyshev stated in Problem 17 to 
prove that each of the following integrals is not elemen¬ 
tary: 

(a) /VI — x 3 dx; 

(b) /V1 - x 4 dx; 

(c) /V 1 — x n dx, where n is any integer > 2; 


f See Ritt, p. 37. 
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(d) — , where n is any integer > 2. 

1 Vl - JC" 

19 Use Problem 17 to prove that 

(a) /Vsin x dx is not elementary (hint: put u = sin 2 x)\ 


(b) /sin^ x dx, where p is a rational number, is elemen¬ 
tary if and only if p is an integer; 

(c) /sin^ x cos^ x dx, where p and q are rational num¬ 
bers, is elementary if and only if p or q is an odd in¬ 
teger or p + q is an even integer. 


In making the direct substitutions discussed in Section 10.2, our procedure was to put 
u = g(x) where g(x) was part of the integrand. For this method to work, we had to have 
du = g'(jt) dx as another part of the integrand, and this meant that altogether the integrand 
had to have a rather special form. 

A much more natural way to change the variable in an integral / f(x) dx is to introduce 
a new variable u by writing x = h(u) and dx = h\u) du, where h(u) is some function that 
is suggested by the form of the integral. This means that if we translate the given integral 
from the x-notation to the w-notation by writing 


A. 10 

THE VALIDITY OF 
INTEGRATION BY 
INVERSE 
SUBSTITUTION 


j fix) dx = I f[h(u)]h\u) du = I g(u) du. 


(1) 


where g(u) = f[h(u)]h\u), then we hope the integral on the right will be easy to calcu¬ 
late. In fact, if 


j g(u) du = G(u), (2) 

then we expect to have 

j f(x) dx = G[kix)l (3) 

where u = k(x) is the inverse function of x = h(u).* This process is called inverse substi¬ 
tution. It is a very useful method, if we can find G(u ) and if we know the inverse func¬ 
tion u = k(x). These remarks constitute a general description of what is going on in the 
method of trigonometric substitutions. 

We can prove the validity of inverse substitution as follows. The point is this: In direct 
substitution as discussed in Section 10.2 we used the integral transformation (1) in the 
other direction, to calculate Jg(w) du. We showed that if 

j fix) dx = Fix), 


then 


I giu) du = F[h(u)]. 

Thus, in the present context, where we also have (2), it follows that 

F[h(u)] = G(u) + c 
for some constant c. But this says that 

F(x) = G\k{x)] + c, 

and therefore G[k(x)] is an integral of f(x), as claimed in (3). 

Even more, we can use the same method for dealing with definite integrals if the lim¬ 
its of integration are correctly changed; that is, 


*That is, u = k(x) is the result of solving x = h{u) for u in terms of a. The concept of an inverse func¬ 
tion is discussed in Remark 2 in Section 9.5. 
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j a fix) dx = | g(u) du, 

where c = k(a) and d = k(b). This can be established very easily by thinking of it in the 
other direction, as 

I g(u) du = fix ) dx, 

where a = h{c) and b = h(d), because this second version was proved in Section 10.2. 


A. 11 

PROOF OF THE 
PARTIAL FRACTIONS 
THEOREM 


Our purpose here is to establish the validity of the partial fractions decomposition as stated 
in a piecemeal manner in Section 10.6. We are considering a rational function P(x)/Q(x), 
and we assume that Q{x) is a polynomial of degree n that is completely factored into real 
linear and quadratic factors of various multiplicities. In the beginning we do not assume 
that P(x)/Q(x) is proper. This enables us to understand more clearly the significance of 
this assumption when it becomes necessary to make it. Our basic tool is the following 
lemma about the removal of a linear factor from the denominator. 


Lemma Let x — r be a linear factor of Q(x) of multiplicity 1 so that Q{x) = 
(x — r)Q\(x) with Q\(r) =£ 0. Then P(x)IQ(x) can be written in the form 

Pix) _ P(x) _ A PiCx) 

Q(x) ix - r)Q,(x) x-r Q ,W’ 

where A is a constant and P\(x) is a polynomial such that P\(x)!Q\(x) is a proper ra¬ 
tional function whenever P(x)/Q(x) is. The constant A can be calculated from either of 
the formulas 


P(r) _ P(r) 

GiW Q\rY 


( 2 ) 


Proof We must find a suitable A and P\(x), and we do this by letting (1) suggest what 
their definitions ought to be. With these definitions we then show that (1) is valid. 

By combining the fractions on the right side of (1), we see that A and P\(x) must be 
chosen so that the numerators are identical, 


Pix) = AQfx) + (x- r)P\(x). (3) 

Since this is to be an identity, it must hold in particular for x = r. This gives P(r) = 
+ 0> so we put 


A = 


Pjr) 

QxirY 


(4) 


This is a legitimate definition because Q\{r) =£ 0. Since 

Q(x) = (x~ r)Q\{x) and Q\x) = (x - r)Q[{x) + Qfx\ 

we see that Q\r) = Q\(r), and this establishes the second formula for A stated in (2). Us¬ 
ing the formula for A given by (4), we now solve (3) for AM, 


AM = 


PM ~ AQ X M 

x-r 


PM ~ [P(r)/Qi(r)]QiM 

x — r 


1 P{x)Q\{r) - P(r)Q\(x) 

Q\{r) 


x — r 


( 5 ) 
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We adopt this as our definition of Pi(x). It may appear that this function is not a polyno¬ 
mial. However, the numerator of this fraction is clearly a polynomial that has the value 0 
for x = r, so by the factor theorem of algebra it has x — r as a factor. The common fac¬ 
tor x — r can now be canceled from the numerator and denominator, and we conclude that 
Pi(x) is indeed a polynomial. We now show that (1) is valid when A and Pj(x) are de¬ 
fined as they are above: 

A P\(x) = AQ\(x) + (x - r)Pi(x) 
x - r Q x (x) (* ~ r)Qi(x) 

= [P{r)!Qx{r)]Q x {x) + [VQ x (r)][P(x)Qx(r) - P(r)Q x (x)] 

(x - r)Q\(x) 

= P(x) 

(x - r)Q x (x) * 

Finally, the statement that P x (x)IQ x (x) is proper whenever P(x)/Q(x) is proper follows from 
(3) by using the fact that the degree of Q\(x) is n - 1; for if the degree of Pj(x) is >n — 
1, then (3) shows that the degree of P(x) is > n. 

This lemma enables us to do everything we wish with respect to splitting off partial 
fractions generated by linear factors of Q(x). At this stage we specifically assume that 
P(x)/Q(x) is proper, so that each time the lemma is applied the residual rational function 
P\(x)/Q\(x) will also be proper. 

We first observe that if Q(x) can be factored completely into distinct linear factors, so 
that 


Q(x) = {x~ r i)(x - r 2 ) • * * (x - r n \ 


then 


P(x) = A\ + A 2 + ... + A n 
Q(x) x-r x x — r 2 x - r n ' 

for we can remove the factors from the denominator one at a time in accordance with the 
lemma. At the last step the residual denominator is x - r„, and since the numerator is nec¬ 
essarily of lower degree, this numerator must be a constant. 

Suppose next that x — r is a linear factor of Q(x) of multiplicity m, so that Q(x) = 
(x — r) m Q\(x) with Q\(r) # 0. To cope with this situation we apply the lemma repeatedly 
in a slightly different way. First, by (1) we have 

P(x) _ B m | P \ (x) 

C x-r)Q\(x) x-r Q\{x)' 

Dividing through by x — r and applying (1) again yields 

PM _ Bm P\(X) = B m B m - 1 P 2 (x) 

(x ~ r) 2 Q i(x) (x - r) 2 (x - r)Qi(pc) (x - r) 2 x - r QiW ‘ 

By continuing in this way, we find in the end that 

P(x) _ P(x) = B m Bx P m (x) 

Q(x) (x - r) m Q\(x) (x - r) m x - r Q x (x) 

_ Pm(x) B\ _j_ Bm 

2i(x) Jc-r (x — r) m ‘ 

In this manner we strip off all the linear factors from the denominator of our proper ra¬ 
tional function P(x)IQ(x) and generate the corresponding partial fractions as described in 
Section 10.6. 
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The rest of the proof requires an acquaintance with complex numbers, because the imag¬ 
inary zeros of a real polynomial come in conjugate pairs and this fact plays an essential 
role in the argument. Before we begin, it is necessary to observe that our fundamental 
lemma works in just the same way if the number r happens to be imaginary. 

Now suppose that x 2 + bx + c is a quadratic factor of Q(x) of multiplicity 1 which is 
irreducible in the sense that b 2 — 4c< 0, so that the roots r\ and r 2 of the equation x 2 + 
bx + c = 0 are conjugate complex numbers.* Then 

Q(x) = (x 2 + bx + c)Q 2 (x ) = (x - r,)(x - r 2 )Q 2 (x), 

and by two applications of our lemma we can find constants A i and A 2 and a polynomial 
P 2 (x) such that 


Pjx) _ P(x) Aj t A 2 i PjM 

Q(x) (x - r t )(x - r 2 )Q 2 (x) x - r\ x - r 2 Q 2 (x)' 

By using (2) we see that 


A\ = 


Pin) 

Q'in) 


and 


2 Q'(r 2 y 


and these formulas imply that A\ and A 2 are also conjugate complex numbers. By com¬ 
bining the corresponding partial fractions, we can now write 


P(x) _ Q4i +A 2 )x~ (A l r z +A 2 ri) P 2 (x) ^ Ax + B | P 2 {x) 
Qix) (x- r \){x- r 2 ) Q 2 (x) x 2 + bx + c Q 2 (x )’ 


where the numbers A = A\ + A 2 and B = ~{A\r 2 + A 2 r\) are real because n, r 2 and 
A\, A 2 are conjugate pairs of complex numbers. Also, we know from the last ex¬ 
pression that P 2 (x) is a real polynomial. If the factor x 2 + bx + c occurs with multi¬ 
plicity m > 1 , then we simply remove it over and over in the way used above with 
repeated linear factors. This produces exactly the partial fractions decomposition described 
in Section 10.6. 

When these procedures have been applied to each of the real linear and quadratic fac¬ 
tors of Q(x), and all the corresponding partial fractions have been stripped away, then there 
is nothing left of Q[x), the decomposition is complete, and the partial fractions theorem 
is fully proved. 

Up to this point we have said nothing about uniqueness, but it is worth remarking 
that a proper rational function can be decomposed into partial fractions in only one 
way. This will follow at once from our overall discussion if we can show in the lemma 
that the expansion (1) is unique. But this is easy; for if we assume two forms for this 
expansion, 


A , P\{x) = B | P 2 (x) 
x-r Qi(x) x~r Q\(x)' 

then we have 


AQ\(x) + (x~ r)P ,(*) = BQi(x) + (x - r)P 2 (x). 

By letting * —> r, we see from this that B = A, so A in (1) is unique; and this implies that 
Pi (*) in (1) is also unique. 


*If x 2 + bx + c were not irreducible, it would already have been factored into real linear factors in 
the “complete” factorization of Q(x) previously mentioned. 
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The convergence tests we discuss in this appendix are more delicate than the ratio test 
and enable us to arrive at a definite conclusion for many series with the property that 
a n+ \/a n —»1 from below. We begin with the following general theorem of Kummer.* 


Theorem 1 Assume that a n > 0, b n > 0, and 21 !b n diverges. If 
lim (b n -^--b n+ \\ = L, 


A.12 

THE EXTENDED RATIO 
TESTS OF RAABE AND 
GAUSS 


then 2 a n converges if L > 0 and diverges if L < 0. 


Proof If L > 0, then 

b n - . b > h > 0 

a n 

for all n ^ some no , so 

a„b n ~ a n+l b„+ l > ha n > 0 (1) 

for these n' s. This shows that {a n b n } is a decreasing sequence of positive numbers for 
n > no, so K = lim a n b n exists. It is now clear that 2 n=nfa n b n ~ a n +\b n + \) is a conver¬ 
gent telescopic series (with sum a no b nii ~ K), so by (1) and the comparison test we con¬ 
clude that 2 ha n converges, and therefore 2 a n also converges. 

Next, if L < 0 we have 

a„b n ~ a„+\b n +i < 0 

for all n > some no , so { a n b n } is an increasing sequence of positive numbers for these 
n's. It follows that 


G-n^n — ^nj^no Or a n — (^/to^no) ' i 

b n 


for n > no, so 2 a n diverges because 2l/&„ diverges. 


Students will observe that if we take b n = 1 in Rummer’s theorem, we obtain the ratio 
test. As another application we deduce Raabes testf 


Theorem 2 If a n > 0 and 


*The German mathematician Ernst Eduard Kummer (1810-1893) is remembered mainly for his work 
on the arithmetic of algebraic number fields, by means of which he proved Fermat’s last theorem for 
many prime exponents. He also contributed to geometry (the entity known as Kummer’s surface was 
much later found by Eddington to be related to Dirac’s theory of the electron) and extended Gauss’s 
work on hypergeometric series. He was a good-humored and rather easygoing man with a ready (and 
sometimes racy) wit. He taught at Breslau until 1855, when the death of Gauss dislocated the math¬ 
ematical map of Europe. Dirichlet succeeded Gauss at Gottingen, and Kummer replaced Dirichlet at 
Berlin. 

^Joseph Ludwig Raabe (1801-1859) was born of poor parents in Galicia and studied in Vienna. When 
cholera swept that city in 1831 he moved to Zurich. In 1833 the Austrian embassy in Bern demanded 
that the government in Zurich return Raabe to Austria, because he had broken Austrian law by tak¬ 
ing a position with the University of Zurich. This ludicrous demand was very sensibly ignored, and 
Raabe spent the rest of his life in various posts at the University. He was a man of unusual modesty 
and was considered a very gifted teacher. He is now known only for the convergence test discussed 
here, but he also worked on the summation of series, on systems of linear differential equations, and 
on the problem of the motion of the center of gravity of the planets. 
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fln+1 _ j _ A _|_ Al 
a n n n 

where A n —> 0, r/jerc 2a n converges if A > 1 diverges if A < 1. 


( 2 ) 


Proof Take b n = n in Rummer’s theorem. Then 

a n 


lim b n 


a n 


bn+\\ = lim 


= lim 


n~ \ 


- + ^M 

n n ) 


(n + 1) 


-1 + 


A(n+ 1) A n (n + 1) 


= A - 


and by Rummer’s theorem it follows that 2 a n converges if A > 1 and diverges if A < 1. 


For practical purposes, i t i s worth noting that Raabe’s test can b e formulated more con¬ 
veniently as follows: If a n > 0 and 

lim n ^1 — Gn+{ j = A, (3) 

then 2 a n converges if A > 1 and diverges if A < 1. To prove this, it suffices to express 
(3) in the equivalent form 

A “^f 1 ) =/i ~ Aa (4) 

where A n —> 0, since (4) is merely another way of writing (2). 


Example 1 If we apply the ratio test to the series 

1 1 -3 1 -3 -5 1 -3-5- •• (2k - 1) 

2 2-4 2-4-6 2 • 4 • 6 • • • (2 n) 

then we find that 


(5) 


a n +\ = 1-3 - 5 • • • (2ft - 1K2 n + 1) 2 • 4 • • • (2 n) 

a n 2 • 4 • • • (2n)(2n + 2) 1*3*-* (2n — 1) 

= \ 1 from below, 

2n + 2 


so the test fails, however 


n 


| 1 \ 
a n ) 


= n 



2n + ] \ 
2n + 2 J 


so (5) diverges by Raabe’s test. 


In + 2 V 


Example 2 Now consider the related series in which each term is squared, 


V 

2 

1 

1 • 3 " 

2 

1 

"1-3-51 

2 

[ 1 ■ 3 • 5 • ■ ■ (2n - 1) 1 

2 

r 

2-4 

\ 

2-4-6 

+ ■ ■ ■ + 

2 • 4 • 6 • • • (2 n) 


Here we see that 


®n+ 1 

a n 


(2 n + l) 2 
(2 n + 2) 2 


1 from below, 


( 6 ) 


so the ratio test fails again. Furthermore, we also have 
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n 


Qn +1 \ 

a n ) 



An 2 + An + 1 \ 
An 2 + 8n + 4 / 


so even Raabe’s test fails in this case 


4n 2 + 3 n 

4n 2 + 8n + 4 ' 


When A = 1 in Raabe’s test, we turn to Gauss’s test. 


Theorem 3 If a n > 0 and 

a n +1 _ ] _ A n 

a n n n ]+c 

where c > 0 and A n is bounded as n —> °o then Za n converges if A > 1 and diverges 
if A < 1. 


/Vrw/ If A =£ 1, the statement follows from Raabe’s test, since A„/h c —> 0. We therefore 
confine our attention to the case A = 1. Take b n = n In n in Rummer’s theorem. Then 


lim ( b„ - -^- L • b„+ 1 ) = lim 


= lim 


h In n - ( 1 - — + -yjj I (/i 4- 1) In (n + 1) 


(n 2 - 1) (n + 1) A n In (n + 1) 

n In n -In (n -F 1)---- 


= lim 


n In 


n 


K n + 
= -i + o - 0 = 


In (n + 1) (n + 1) A n In {n + 1) 


and the divergence of the series in this case is a consequence of Rummer’s theorem. 

Gauss actually expressed his test in a specialized form adapted to series in which a n +\/a n 
is a quotient of two polynomials having the same term of highest degree. This version is 
also known as Gauss’s test. 


Theorem 4 lfa n >0and 

a n+ 1 _ n k + an k ~ l + • • • 
a n n k + f3n k ~ l + • • • 

then 'Zan converges if (3 — a > 1 and diverges if (3 — a < 1. 


Proof If the quotient on the right of (7) is worked out by long division, we get 

an +1 _ j _ tx _|_ A n 

a n n n 2 ’ 

where A n is a quotient of the form 

yn k ~ 2 + • ■• 
n k ~ 2 + • •• 

and is therefore clearly bounded as n —> °° The statement now follows from Theorem 3 
with c = 1. 

Example 3 For a simple application we consider the series (6), for which Raabe’s test 
failed. Here we have 

a „+1 _ 4w 2 + An + 1 _ ” 2 + ” + 4 
a n An 2 + 8h + 4 n 2 + 2n + 1 ’ 

sojS — a = 2— 1 = 1 and the series diverges by Gauss’s test. 
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Example 4 Gauss’s original purpose in devising his test was to study the important hy¬ 
pergeometric series 

Y a(fl + D -"(<1 +w- \)b(b+\)---{b + n- 1) n 
^ n\c (c + 1) • • • (c + n — 1) X 

when x = 1: 

i + a ' b . + 1 Mb + 1) t 

1 • c 1 • 2 c(c + 1) 

+ + 1)6(6 + 1) — • (b + /i - 1) _ 

/i!c(c + 1) • • ■ (c + n — 1) 

Here we assume that none of the constants a , b, c is zero or a negative integer. This con¬ 
dition on a and b keeps the series from terminating, while that on c avoids division by 
zero. The ratio 

a n +, _ (a + n)(b + n) _ /i 2 + (a + 6) /i + a/? 
a n (n + l)(c + n) n 2 + (c + \)n + c 

is positive for all sufficiently large h, so the terms of (9) ultimately have the same sign. 
Any such series can be treated by Gauss’s test (or the ratio test, or Raabe’s test); and since 
in this case /3 — a = (c + 1) — (a + b), we see that (9) converges if c > a + b and di¬ 
verges if c < a + b. 


The hypergeometric series (8) is extremely interesting and versatile, and is capable of 
representing virtually every function that occurs in elementary analysis.* Here we confine 
ourselves to remarking that when a = 1 and b — c it reduces to the ordinary geometric 
series—hence its name. 

*See Problem 1 on p. 203 of the present writer’s book, Differential Equations, 2nd ed. (McGraw- 
Hill, 1991). 


PROBLEMS 

1 Show that 


I 

n= 1 L 


3 • 5 • • • (2n — 1) 


2 ■ 4 • 6 • 


(2/i) 


converges. More generally, let k be an arbitrary positive 
integer and show that 

3 • 5 * • • (2n — 1) 


I 

n= 1 L 


2 • 4 • 6 ■ 


(2/i) 


converges if k > 2 and diverges if k < 2. 

Determine the convergence behavior of the following series: 


(a) X 
« = 1 

(b) X 
> 1=1 

(c) X 


(d) X 

n = 1 

(e) X 

n = 1 

Find the 
converge 

(a) X 


1*3-5** 

• (In - 1) 


X 

n= 1 

4 * 6 • 8 • • 

• (2 n + 2) ’ 

(b) 

2 • 7 • 12 * 

• • (5 n - 3) 



6-11-16- 

• • (5n + 1) ’ 

(c) 

X 

n — \ 

2 • 4 • 6 • 

• ' (2 n) 



3-5*7** 

• (2/i + 1) ’ 




2 • 4 • 6 • 

• • (2 n) 


5.7.9... 

(In + 3) ’ 


1 • 3 • 5 • • • 

(2 n - 1) 

1 

2 • 4 • 6 • 

• • (2 n) 

n ’ 

positive integers k for which 1 

2-4-6 

• • • (In) 1 

k 

_ 3 • 5 • 7 • • 

• (2 n + 1) _ 


r 2-4-6 

• ' • (2 n) 1 

k 

5.7.9.. 

• (2 n + 3) 


1 •3 -5 • • • 

(2 n ~ 1) 

1 


2 • 4 • 6 • • • (2 n) 
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4 Determine the values of a and b (where neither is zero or 
a negative integer) for which the following series con¬ 
verge: 


<*>f + 


a(a + 1) 
b{b + 1) 


+ ■ • • 


a(a + 1) • • • (a + n — 1) 

+ b(b + 1) • • • (b + n - 1) + ' ' ' ; 


(b) J + 

b 


a(a + 2) 
b(b + 2) 


+ ■ ■ ■ 


+ 2) • * • (a + 2n - 2) 

+ b(b + 2) • • • (b + 2n - 2) + ’ ’ ' • 


We begin with two simple examples: 


S<-n 


n= 1 



= 1 - 




and 


S(-ir +1 


i 


j_ j_i_ 

2 2 + 3 2 4 2 + " ' 


( 1 ) 

( 2 ) 


A.13 

ABSOLUTE VS. 
CONDITIONAL 
CONVERGENCE 


As we saw in Section 13.8, there is an important distinction between these series. By the 
alternating series test, each is convergent as it stands. However, if we change the signs of 
all the negative terms—that is, if we replace each term by its absolute value—then the 
series become 


and 


X 1 = 

oo , 



and the first of these altered series now diverges, while the second still converges. 

It was this phenomenon that led us in Section 13.8 to make the following definition: A 
series 2 a n is said to be absolutely convergent if ^\a n \ converges.Thus, (2) is absolutely 
convergent but (1) is not. The careful reader will notice that this definition in itself says 
nothing about the convergence of Ha n . However, we proved in Section 13.8 that absolute 
convergence does indeed imply ordinary convergence. 

The series (1) shows that the converse of this theorem is false, that is, convergence does 
not imply absolute convergence. Absolute convergence is therefore a stronger property 
than ordinary convergence, and we shall see that absolutely convergent series have sev¬ 
eral important properties which they do not share with convergent series in general. This 
brings us to another definition that was given but not pursued very far in Section 13.8: A 
series that is convergent but not absolutely convergent is said to be conditionally conver¬ 
gent. Our present purpose is to establish some of the general properties of absolutely con¬ 
vergent series, and also to emphasize the sharp contrast between these series and those 
which are only conditionally convergent. For instance, we shall see that rearranging the 
terms of an absolutely convergent series has no effect on its behavior or its sum, but that 
rearranging a conditionally convergent series can have a drastic effect. 

It is convenient to begin this program by considering an arbitrary series ^Za n and defin¬ 
ing p n and q n by 


Pn = 



and 


Qn 


2 


a n 


(3) 


It is clear that p n = a n and q n = 0 if a n > 0, and that p n = 0 and q n = —a n if a n < 0. Ac¬ 
cordingly, if the given series is a mixture of positive and negative terms, then we can think 
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of 'Zpn as consisting of the positive terms of 2 a n , and of ^Zq n as consisting of the nega¬ 
tives of its negative terms. This is not quite correct because many pf s and qfs can be 
zero, but it does provide a point of view which is useful for understanding the following 
theorems. 


Theorem 1 Consider a series ^Za n and define p n and q n by (3). //XXz„ converges con¬ 
ditionally, then 'Zpn and ^Zq n both diverge ; and if 2,a n converges absolutely , then 'Zpn 
and 'Lqn both converge and the sums of these series are related by the equation ^Za n = 
Zpn - Yq n . 


Proof It is clear from (3) that a n —p n — q n and \a n \ = p n + q n . Our basic tools are these 
equations and the fact that convergent series can be added or subtracted term by term. 

To prove the first statement, we assume that 2 a n converges and 2|a„| diverges. If 2 q n 
converges, then the equation p n = a n + q n tells us that 2 p n must also converge. Similarly, 
if 'Zpn converges, the equation q n ~Pn~ a n tells us that 2 q n also converges. Thus, if ei¬ 
ther 'Zpn or *Zq n converges, both must converge; and in this case the equation \a n \ = p n + 
q n implies that ^\a n \ converges — contrary to the assumption. This proves that the condi¬ 
tional convergence of implies that 'Zpn and ^Zq n both diverge. To establish the second 
statement, we assume that 2|a„| converges. We know that 2 a n also converges, so equa¬ 
tions (3) imply that 2 p n and 2 q n both converge. It follows from this that 

X Pn ~ X <?« = X iPn ~ <?«) = X a n< 

and the proof is complete. 

The first part of this theorem is illustrated by the conditionally convergent series (1), 
in which ^Zp n and 2 q n are the divergent series 

1+0 + J + 0 + J+ ••• and 0 + y + 0 + ^- + -- -. 

In the case of the absolutely convergent series (2), 2 p n and are 

l+0 + 32 +0 + ^2 _ + ''' and 0+ ^2 +0 + ^ + '‘'' 

both of which are convergent. Briefly, Theorem 1 tells us that the convergence of an ab¬ 
solutely convergent series is due to the smallness of its terms, while that of a condition¬ 
ally convergent series is due not only to the smallness of its terms but also to cancella¬ 
tions between its positive and negative terms. 

In the last paragraph of Section 13.5 we proved a theorem about rearranging a conver¬ 
gent series of nonnegative terms. We now extend this theorem to the case of absolutely 
convergent series. 


Theorem 2 If^Za n is an absolutely convergent series with sum s, and if the af s are 
rearranged in any way to form a new series 2 b n , then this new series is also absolutely 
convergent with sum s. 


Proof The series 2|a n | is convergent and consists of nonnegative terms. Since the b„ s 
are just the af s in a different order, it follows from the theorem just mentioned that 2 \b n \ 
also converges, and therefore 2 b n is absolutely convergent. If ^Zb n = t, then Theorem 1 
allows us to write 


and 


s ^ a n ^ p n ^ q n 


(4) 
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t = Xb n = XPn-lQn, ( 5 ) 

where each of the constituent series on the right is convergent and consists of nonnega¬ 
tive terms. But the P n ' s and Q n 's are simply the p n ' s and q n 's in a different order, so by 
another application of the theorem in Section 13.5 we have = 2 p n and 2(7 n = 2 q n . 
The fact that t = s now follows at once from (4) and (5). 

This theorem was proved in 1837 by Dirichlet, who discovered the phenomenon dis¬ 
cussed in Problem 10 of Section 13.4 and was the first to understand the significance of 
absolutely convergent series. 

In striking contrast to the behavior of absolutely convergent series as stated in Theo¬ 
rem 2, we find that the sum of a conditionally convergent series depends in an essential 
way on the order of its terms, and that the value of this sum can be changed at will by a 
suitable rearrangement of these terms. This fact was discovered and proved in 1854 by 
the great German mathematician Riemann, and is known as Riemann’s rearrangement 
theorem. It can be formulated as follows. 


Theorem 3 Let 2 a n be a conditionally convergent series. Then its terms can be re¬ 
arranged to yield a convergent series whose sum is an arbitrary preassigned number , 
or a series that diverges to or a series that diverges to 


Proof The idea of the proof is surprisingly simple. We begin by using Theorem 1 to 
form the two divergent series of nonnegative terms 2 p„ and 2 q n . 

To establish the first statement, let s be any number and construct a rearrangement of 
the given series as follows. Start by writing down p's in order until the partial sum 

P\ + P2 + • • • + Pm 

is first > s\ next, continue with —q's until the total partial sum 

P\ + Pi + * * • + Pm - q\ ~ Vi ~ ’ ’ * - q mx 
is first < s\ then continue with p's until the total partial sum 

P\ + * * • + Pm - q\ - ' • * “ 4mi + Pm + l + ' ' ' + Pm 

is first ^ s; and so on. The possibility of each of these steps is guaranteed by the diver¬ 
gence of 2 p n and 2 <? n ; and the resulting rearrangement of 2 a n converges to 5 because 
p n —> 0 and q n —> 0 . 

In order to make the rearrangement diverge to «>, it suffices to write down enough p's 
to yield 

P\ + P2 + • • • + Pm S 1. 
then to insert -q\, then to continue with p's until 

Pi + • • • + Pm ~ + Pm + l + ■ ■ • + Pm a 2, 

then to insert -< 72 , and so on. We can produce divergence to —00 by a similar construc¬ 
tion. 

One of the principal applications of Theorem 2 relates to the multiplication of series. 
In this connection it is notationally convenient to index the terms of the series we con¬ 
sider by n = 0, 1, 2,.... If we multiply two series 

^0 + a \ T • ’ * T a n + • • • and ^0 ^1 T ••■ + &„+••• ( 6 ) 

by forming all possible products af>j (as in the case of finite sums), then we obtain the 
following doubly infinite array: 
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(7) 


There are many ways of arranging these products into a single infinite series, of which 
two are important. The first is to group them by diagonals, as indicated by the arrows: 

aobo + (a 0 bi + a\b 0 ) + (a 0 b 2 + a\b\ + a 2 b 0 ) + ■ ■ ■ . (8) 

This series can be defined as 2~=o c n , where 

c n = a 0 b n + a x b n -\ + • • • + a n b 0 . 

It is called the product (or sometimes the Cauchy product) of the two series (6), and is 
particularly useful in working with power series. 

A second method of arranging (7) into a series is by squares, as suggested by the bro¬ 
ken lines: 

a 0 b 0 + (aob\ + a\b\ + a\bo) + ( aob 2 + a\b 2 + a 2 b 2 + a 2 b\ + a 2 bo) + • • • , (9) 

The advantage of this arrangement is that the nth partial sum s n of (9) is given by 

s n = (A) + a i + • • * + a n )(b 0 + /?! + ••• + b n ), (10) 

and this is useful in proving a preliminary fact about the multiplication of series. 


Theorem 4 If the two series (6) have nonnegative terms and converge to s and t, then 
their product (8) converges to st. 


Ptroof it is clear from (10) that (9) converges to st. Now denote the series (8) and (9) 
without parentheses by (8') and (9'). The series (90 of nonnegative terms still converges 
to st; for if m is an integer such that n 2 < m < (n + l) 2 , then the rath partial sum of (90 
lies between and s„, and both of these converge to st. By Theorem 2, the terms of 
(90 can be rearranged to yield (80 without changing the sum st; and when parentheses 
are suitably inserted, we see that (8) converges to st. 

The force of this result can best be appreciated by observing that the product of the 
convergent series 

y (-D" = -L _ J_ + _L — a,) 

Vn + 1 VT V2 V3 

with itself does not converge at all. We ask students to convince themselves of this in 
Problem 9. 

We now extend Theorem 4 to the case of absolutely convergent series. 


Theorem 5 If the series 2~=o a n and 2“= o b n are absolutely convergent , with sums s 
and t , then their product 

X + a \bn- \ + • • • a n bo) = a 0 b 0 + (a 0 b\ + aM + (a 0 b 2 + a\b\ + a 2 b 0 ) + * • • 

n =0 

+ (a 0 b n + a\b n ~\ + • • • + a n b 0 ) + ■ • ■ (12) 

is also absolutely convergent , with sum st. 
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Proof The series 2~ =0 \a n \ and 2~= 0 \b n \ are convergent and have nonnegative terms, so 
by the proof of Theorem 4 we see that the series 

MM + M\bi\ + |<2i||fc 0 | 

+ IMM + klll^ll + MM + * ’ ' 

+ kol \b n \ + |«il|^-i| + *'■ + ! a n ||^ol + ' * ' 

= Wobo\ + |a 0 ^i| + \aib 0 \ 

+ |<20^21 + l^l^ll + Mol + ‘ ' * 

+ \aob n \ + | a\b n -i\ + ■ * * + \a n bo\ + * ’ * (13) 

converges, and therefore 

ao^o + + ti\b 0 + ' ’ ' + a ob n 4- • • • + a n bo + • • • (14) 

is absolutely convergent. It follows from Theorem 2 that the sum of (14) will not change 
if we rearrange its terms and write it as 

+ <2o^i + ci\b\ -+- d\bo 

+ <2(^2 + a\b2 + + ^2^0 + * * * . (15) 

We now observe that the sum of the first (n + l) 2 terms of (15) is (ao + a,\ + • • • + 
a n )(b 0 + b\ + ■ • ■ 4- b n ), so it is clear that (15), and with it (14), converges to st. Since 
(12) is obtained by suitably inserting parentheses in (14), we see that (12) also converges 
to st. All that remains is to show that (12) converges absolutely; but this follows by the 
comparison test from the inequality 

\a 0 b n + ci\b n —\ + • • • + a n bo\ < \a 0 b n \ + \a\b n -i\ + ■ • • + \a n bo\ 
and the fact that the series 

ko^ol + (ko^ll + \a\b 0 \) + * * ‘ 

+ (\aob n \ + \a\b n - X \ + ' ‘ * + \a»b 0 \) + * ■ ■ , 
obtained from (13) by inserting parentheses, is convergent. 

This theorem shows that the absolute convergence of both of the given series is a suf¬ 
ficient condition for the convergence of their product to st. It is an interesting fact that 
this conclusion can also be obtained from the weaker hypothesis that only one of the two 
series is absolutely convergent (the example following Theorem 4 shows that the product 
of two conditionally convergent series need not converge at all!). A proof of this result is 
outlined in Problem 10. 


PROBLEMS 


1 Use the formula 1 + { + ■•• + 1/^ = In H + y+o(l) 
to establish the validity of the stated sums of the fol¬ 
lowing rearrangements of the series 1 - { + j - j + 
• • • = In 2: 

(a) l+i + y- -f-{-*i+--- = In2; 

(b) l+T + i- {“i+*--={ln6; 


(c) 1 - £ 

(d) 1 — 1 
1 . 




• = y In 2; 

_ 1 _ 1 _ _ 

12 14 


2 A series is sometimes said to be unconditionally 
convergent if it converges and every rearrangement con- 



830 


THE THEORY OF CALCULUS 


verges to the same sum. Show that 2 a n converges un¬ 
conditionally if and only if it converges absolutely. 

3 If 'Zcin is absolutely convergent, prove that 2 a n 2 con¬ 
verges. 

4 If a i + a 2 + 03 + 04 + • ■ • is absolutely converg¬ 
ent, prove that a { + a 2 + 03 + 04 + * * * = (01 + 
03 +■'■) + (02 + 04 +•■■)■ Is this necessarily true 
for any convergent series? 

5 Let 'Zdn be a convergent series with sum s. If ^Lb n is a 
rearrangement in which no term of the first series is 
moved more than no places from its original position, 
prove that ^Lb n still converges to s. 

6 What is the sum of 

1 + 1 + 1 — 1+...9 

*7 Prove that the series 


signs, etc., show that the resulting series diverges if 
p q and converges if p = q. 

9 Show that the product of the series (11) with itself di¬ 
verges. 

*10 If 2"= o a n and 2~=o b n converge to s’ and t, and if 
2~=o cin converges absolutely, prove that their product 

X b n + a\bn-\ + ■ * ■ + a n bo) 

n =0 

converges to st (this result is known as Mertens' theo¬ 
rem 1 '). Hint: Put s n = a 0 + • ■ • + a n , t n = bo + ■ • ■ + 
b n , and a n = t n ~ t\ show that the wth partial sum of the 
product can be written as 

s n t + a 0 a n + + • • • + a n a 0 \ 

and prove that if jS n is defined by 

= a 0 a n + a x a n -\ + • • • + a n a 0 


1 —7 + y — ^■ + ,,, = ln2 

converges to In 2\fp/q if its terms are rearranged by 
writing the first p positive terms, then the first q nega¬ 
tive terms, then the next p positive terms, then the next 
q negative terms, etc. 

*8 We know that the harmonic series 1 +{ + { + ? + 

• • • diverges but the alternating harmonic series 1 - 
\ + y — { + • ■ ■ converges. 

(a) Show that 1+y — { + i + — •••di¬ 

verges. 

(b) If the signs are changed in the harmonic series in 
such a way that p positive signs are followed by q 
negative signs, then p positive signs by q negative 


= a 0 a n + a x a n -i + • • • + a n a 0 , 
then (3 n —> 0. 


’’Franz Mertens (1840-1927) was born in Poland, studied at Berlin 
under Kummer and Kronecker, and taught at Cracow, Graz, and Vi¬ 
enna. He retained extraordinary vigor of mind and body to an ad¬ 
vanced age, and wrote the last of his more than 100 research papers 
at the age of 86. His main interest was analytic number theory, where 
he was a master in the use of elementary methods to simplify diffi¬ 
cult proofs. He discovered the theorem given here in connection with 
his success (after Euler had failed) in proving the convergence of the 
series 2 (± 1 tp n ), where the p „*s are the primes and the signs are + 
or - according as p n is of the form 4n + 1 or 4n + 3. 


A.14 

DIRICHLET’S TEST 


With one exception, all of our convergence tests in Chapter 13 are tests that apply only 
to series of positive (or nonnegative) terms; that is, they are tests for absolute convergence. 
This exception is the alternating series test, which says that the series 

b\ - b 2 + &3 “ ^4 + • * • (1) 


converges if the bn s form a decreasing sequence of positive numbers and b n —» 0. We can 
think of (1) as generated by multiplying the terms of the series 1 - 1 + 1 - 1 + • • • by 
the terms of the sequence b\, £ 2 , ^ 3 , b*, ... . From this point of view it is natural (and 
profitable) to generalize by considering a series 

0i + 02 + * ’ ’ + a n + • • • (2) 


and sequence 


bu b 2 , . ■ • ,b m ..., 


(3) 


and the problem is to find conditions under which the series 
a\b\ + a 2 b 2 + • • • + a n b n + * ■ * 


( 4 ) 
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converges. It is obvious that if (2) is absolutely convergent and (3) is bounded, then (4) 
is also absolutely convergent. Our purpose here, however, is to obtain criteria for the con¬ 
vergence of (4) that are not merely tests for absolute convergence. 

To accomplish this, we need Abels partial summation formula : If s n = a\ + a 2 + 

• • • + then 

a\b\ + a 2 b 2 + • • • + a n b n = si(b] - b 2 ) + s 2 (b 2 ~ b 3 ) + • • • 

T s n— \{bn— 1 — bn) + S n b n . (5) 

The proof is easy. Since a\ = s\ and a n = s n — s n - 1 for n > 1, we have 

a\b\ = s\b\ 
a 2 b 2 — s 2 b 2 — s\b 2 
a 3 b 3 = s 3 b 3 - s 2 b 3 

&n—\bn—\ — s n—lb n —\ S n - 2 b n -\ 

Qnbn ~ s nbn ~ s n— \bn- 

On adding these equations and suitably grouping the terms on the right, we obtain (5).This 
result enables us to establish Dirichlels test. 


Theorem 1 If the series (2) has bounded partial sums , and if( 3) is a decreasing se¬ 
quence of positive numbers such that b n —> 0, then the series (4) converges. 


Proof If we put S n = a\b\ + a 2 b 2 + • • • + a n b n , then (5) tells us that 

$n ~ Tn "h Snbn, (6) 

where 


Tn = S\(bi - b 2 ) + s 2 (b 2 - b 3 ) + • • • + s n -fb n -\ - b n ). 

We must prove that lim S n exists, and we do this by showing that lim T n and lim s n b n both 
exist. Our first assumption says that there is a constant M such that \s n \ < M for every n , 
so \s n b n \ ^ Mb n \ and since b n —> 0, we conclude that s n b n —> 0. Next, T n is the (n - 1 )st 
partial sum of the series 

s\(bi ~ b 2 ) + s 2 ( b 2 - b 3 ) + • • •, (7) 

so lim T n will certainly exist if (7) converges. To establish this, it suffices to show that (7) 
is absolutely convergent. We now use the assumption that the bf s are positive and de¬ 
creasing, which yields 

\s\(bi - b 2 )\ + \s 2 (b 2 ~ b 3 )| + • • • + \s n -\(b n -i ~ b n )\ 

^ M{b\ — b 2 ) + M(b 2 — b 3 ) + • • • + M{b n ~\ — b n ) 
= M{b\ - b n ) < Mb\. 

This implies that (7) is absolutely convergent, and the proof is complete. 

In order to make effective use of Dirichlet’s test, we must be acquainted with a few se¬ 
ries having bounded partial sums. Naturally, all convergent series have this property, but 
many that are divergent also have it. Perhaps the simplest is 1 - 1 + 1 - 1 + • • • ; and 
from this we see at once that the alternating series test is an immediate consequence of 
Theorem 1. 
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Dirichlet’s test is particularly useful in demonstrating the convergence of certain trigono¬ 
metric series, and here we encounter some further series that do not converge but never¬ 
theless have bounded partial sums. 

As an example, we show that if x is any number =£ kir, where k = 0, ± 1, it 2,... , 
then the series 


cos x + 


cos 3x . cos 5x 


+ COS (2/2 - \)x + 
2n ~ 1 


( 8 ) 


converges. Here the b n 's can be taken as 1, y,},..., so it suffices to prove that the par¬ 
tial sums of 


cos x + cos 3x + cos 5x + ■ • • + cos (2 n - l)x + • • • (9) 

are bounded. To do this, we use the trigonometric identity 

2 cos a sin b = sin {a + b) — sin (a — b). 

By putting b = x and a = x, 3x, 5x ,. . ., (2 n - 1 )x, we obtain 
2 cos x sin x = sin 2 x — 0 
2 cos 3x sin x = sin 4x — sin 2x 
2 cos 5x sin x = sin 6x — sin 4x 


2 cos (2 n - 1 )x sin x = sin 2 nx — sin(2rc — 2)x; 

and adding yields 

2 sin x[cos x + cos 3x + • • • + cos (2 n — l)x] = sin 2 nx 
or 

sin 2 nx 


cos x + cos 3x + • • • + cos (2 n — 1 )x : 


2 sin x ’ 


But |sin 2wx| < 1, so 


cos x + cos 3x + • • • + cos {In - l)x| < — t. -; 

1 2|sinx| 


( 10 ) 


This proves that the partial sums of (9) are bounded, so (8) converges. It should now be 
apparent why we assumed that x A kir\ We musthave sin x ^ 0 in (10). Actually, of course, 
it is obvious that (8) diverges if x = k7T. 


PROBLEMS 


Let p be a positive constant. It is clear from the discus¬ 
sion in the text that 

, cos 3x , cos 5x , 
cos x ^ y y ^ ’ ' 

converges for every x ^ kir\ and when x = kir, the series 
converges if p > 1 and diverges if p < 1. Use the identity 


2 sin a sin b = cos {a — b) — cos {a + b) 
to show that 

sin 3x sin 5x 

sin * + -y- + +■■■ 

converges for all x. 
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2 Prove the identities 


2 sin — (sin * + sin 2 jc + 


and 


+ sin nx ) 


x (In + \)x 

= COS — - COS --- 

2 2 


2 sin — (cos a: + cos 2x + 


■ + cos nx) 

. (In + 1)* 
= sin--- 


JC 

sm — 


Let b\, b 2 , , b n , ... be a decreasing sequence of posi¬ 

tive numbers such that b n 0, and use these identities to 
show that 

(a) 2“=i b n sin nx converges for all jc; 

(b) 2~=i b n cos nx converges for all x±2kir. 

3 Investigate the convergence behavior of 

cos 3x + cos 6x + cos 9x + 

2 3 4 

4 Show that the following series converge: 



ln 2 In 3 


1 

In 4 


1 

In 5 


+ In 6 + In 7 


and 


5 

6 


7 


8 



A series of the form 2~ =] a n ln x is called a Dirichlet se¬ 
ries. If it converges for x = x 0 , show that it also converges 
for every x > xq. 

Prove Abel's test : If the series (2) converges and (3) is a 
bounded increasing or decreasing sequence, then the se¬ 
ries (4) also converges. (Note the relation of this statement 
to Dirichlet’s test—more is assumed about the series, but 
less about the sequence. It is almost certain that Abel knew 
Dirichlet’s test.) 

Use Abel’s test to show that the following series con¬ 
verge: 

/X 1.1 1 1 III 1 , 1 1 

(a) COS 1 + 7 COS 2 ~~ 2 C0S 3 + 5 C0S 4 + 7 COS 5 ~ 

I 1 I 

4 cos 6 + • • • ; 


(b) 2 + 


2 2 


A_A + A + 

3 2 4 2 5 2 


_7_ 

6 2 


If Za n converges, so do Za n /n, ZaJ In n, Za n cos 1 In, 
Za n sin 1 In, 2(1 -f 1 /n)a n , 2(1 + 1 /n)"a tl , and 2Vra a n . 
Why? 


NOTE ON DIRICHLET 

P. G. L. Dirichlet (1805-1859) was a Ger¬ 
man mathematician who made many contributions of last¬ 
ing value to analysis and number theory. One of his earliest 
achievements was a milestone in analysis: In 1829 he gave 
the first satisfactory proof that certain specific types of func¬ 
tions are actually the sums of their Fourier series. Previous 
work in this field had consisted wholly of the uncritical ma¬ 
nipulation of formulas; Dirichlet transformed the subject 
into genuine mathematics in the modem sense. As a by-prod¬ 
uct of this research, he also contributed greatly to the cor¬ 
rect understanding of the nature of a function, and gave the 
definition which is now most often used, namely, that y is a 
function of x when to each value of x in a given interval 
there corresponds a unique value of y. He added that it does 
not matter whether y depends on x according to some “for¬ 
mula” or “law” or “mathematical operation,” and he em¬ 
phasized this by giving the example of the f unction of x 
which has the value 1 for all rational x's and the value 0 for 
all irrational x's. 


Ill 


Perhaps his greatest works were two long memoirs of 
1837 and 1839 in which he made very remarkable applica¬ 
tions of analysis to the theory of numbers. It was in the first 
of these that he proved his wonderful theorem that there are 
an infinite number of primes in any arithmetic progression 
of the form a + nb, where a and b are positive integers with 
no common factor. His discoveries about absolutely con¬ 
vergent series also appeared in 1837. His convergence test, 
discussed above, was published posthumously in his Lec¬ 
tures on Number Theory ( 1863). These lectures went through 
many editions and had a very wide influence. 

In later life Dirichlet became a friend and disciple of 
Gauss, and also a friend and advisor of Riemann, whom he 
helped in a small way with his doctoral dissertation. In 1855, 
after lecturing at Berlin for many years, he succeeded Gauss 
in the professorship at Gottingen. 
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A.15 

UNIFORM 
CONVERGENCE FOR 
POWER SERIES 


Consider a power series 2 a n x n with positive radius of convergence R, and let fix) be its 
sum. Our purpose here is to prove that fix) is continuous and differentiable on (-R, R), 
and also that its derivative and integral can legitimately be calculated by differentiating 
and integrating the series term by term. 

Let S n (x) be the nlh partial sum of the series, so that 

S n (x) = a 0 + a x x + a 2 x 2 + * • • + a rt x n . 


We write 


fix) = S n (x) + R n (x) 
and call R n (x) the remainder Evidently, 

Rn(x) = a n +\x n+l + a n+ 2 x n+2 + ■ ■ ■. 

For each x in the interval of convergence, we know that R n (x) —»0 as n —> ©o; that is, for 
any given e > 0 we have 

\R n (x)\ < e for sufficiently large n, n> n 0 . (1) 

We emphasize here that this is true for each x individually, and is merely an equivalent 
way of expressing the fact that £ a n x n converges to fix). However, much more can be said: 
Throughout any given closed interval inside the interval of convergence, say \x\ < |jci| < 
R, (1) holds for all x simultaneously. Since R n (x) = fix) - S n (jc), we can express this in 
another way by saying that throughout the given closed interval, S n ix) can be made to ap¬ 
proximate fix) as closely as we please by taking n large enough. 

To prove this, we observe that for every jc in the given closed interval \x\ < \x\ \ < R, 
we have 

= |a„+i *" +1 + a n+ 2 x n+2 + • • • | 

- \dn + \x n+x \ + \a n+2 x n+2 \ + ■ ■ ■ 

^ K+i*i n+1 | + |a n +2^i n+2 | + • • • . 

The argument is completed by using the fact that the last-written sum can be made < e 
by taking n large enough, n > no, because of the absolute convergence of ^Za n x\ n . The 
point is, that the same no works for all ;t’s in the given closed interval. The conclusion 
proved here, that R n (x) can be made small independently of x in the given closed inter¬ 
val, is expressed by saying that the series is uniformly convergent in this interval. 


CONTINUITY OF THE SUM 

We will prove that fix) is continuous at each interior point xq of the interval of conver¬ 
gence. Consider a closed subinterval \x\ < |jci| < R containing *o in its interior. If e > 0 
is given, then by uniform convergence we can find an n with the property that \R n ix)\ < 
efor all ;t’s in the subinterval. Since the polynomial S„(jc) is continuous at x 0 , we can find 
8 > 0 so small that |jc — jcq| < 8 implies x lies in the subinterval and \S n ix) — S„(.xo)| < £■ 
By putting these conditions together we find that |x — xq\ < 8 implies 

l/to -f(xo )| = | [S„(x ) + /?„(*)] - [S„0to) + tf„(*o)]| 

= |[S„to - S n Uo)] + Rn(x) - Rnixo )I 

< |5„W - S n Cr 0 )| + l^ntol + |/?n(^o)| 

< e + e + e = 3e. 

Since 3e can be taken as small as we please, this proves that fix) is continuous at jcq. 
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INTEGRATING TERM BY TERM 

We have just proved that 

fix) = (2) 

is continuous on (-R, R). We can therefore integrate this function between limits a and 
b that lie inside the interval, 

l fix) dx = l dx. (3) 

Our purpose here is to show that the right side of this can legitimately be integrated term 
by term, 

[ * = X a nX" fa- 

In words, the integral of the sum equals the sum of the integrals. An equivalent statement 
is that (3) can be written as 


[ fix) dx = E f a nX n dx. (4) 

Ja Ja 

To prove this, we begin by observing that since 5„(x) is a polynomial, and therefore 
continuous everywhere, all three of the functions in 

fix) = S n (x) + Rnix) 

are continuous on ( —R, R). This allows us to write 

[ f (*) dx = f S n (x ) dx + [ R n (x) dx. (5) 

Ja Ja Ja 

Since any polynomial can be integrated term by term, the first integral on the right of 
(5) is 

f S„(jc) dx = f (<3o + a\x + a^x 1 + • • • + a n x n ) dx 

Ja Ja 

= [ ao dx + f a\x dx + f ci 2 X 2 dx + • • • + f a n x n dx. 

Ja Ja Ja Ja 


To prove (4), it therefore suffices to show that 

f R n (x) dx —> 0 as rc —» 


For this we use uniform convergence, which tells us that if e > 0 is given and |xl ^ \x\\ < 
R is a closed subinterval of ( —R , R) that contains both a and b, then |/?„(jc)| < efor all x 
in the subinterval if n is large enough. All that remains is to write 




|/?„(x)| dx < e\b - a\. 


and to notice that this can be made as small as we wish. 

As a special case of (4) we take the limits 0 and x instead of a and b, and obtain 


£ fu) dt = Y 


1 


n + 1 


v h+i 


= aox + — a x x 2 + — a 2 x 2 


n + 1 


an x n+l + • • •, 


( 6 ) 
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where the “dummy variable” t is used in the integral for the reason explained in Sec¬ 
tion 6.7. 


DIFFERENTIATING TERM BY TERM 

We now prove that the function f(x) in (2) is not only continuous but also differentiable, 
and that its derivative can be calculated by differentiating (2) term by term, 

= (7) 

To do this, we begin by recalling from Section 14.3 that the series on the right of (7) con¬ 
verges on ( —R, R). If we denote its sum by g(jt), so that 

<?(■*) — ^^na n x n 1 = a\ T 2a 2 x 4- 3a-$x 2 4- ■ ■ ■ 4- na n x n ~\ + • ■ ■ , 

then (6) tells us that 


g(t) dt = a\x 4- a 2 x 2 4- a$x 3 + • • • 

= f(x) - a 0 . 

Since the left side of this has a derivative, so does the right side, and by differentiating 
we obtain 

fix) = g(x) = Y^na n x n ~\ 

as required. 


A.l6 

DIVISION OF POWER 
SERIES 


In order to justify the division of power series as described in Section 14.7, it suffices to 
justify dividing a power series into 1. To see this, we have only to notice that 


Sa„-r n 

ZbnX" 


(2a n x n ) ■ 


for this tells us that if we can expand 1 /(Zb n x n ) in a power series with positive radius of 
convergence, then we can achieve our purpose by multiplying this series by Za n x n . It is 
clearly necessary to assume that bo =f= 0 (why?). We may assume that bo = 1 without any 
loss of generality, because if b 0 has any other nonzero value, we simply factor it out, leav¬ 
ing the leading coefficient equal to 1: 


_1_ _ J_ _ 1 _ 

bo 4- b\x 4- b 2 X 2 4- • • • bo 14- (b\/bo)x 4 {b 2 lbo)x 2 4* • • • ’ 


In view of these remarks, we direct our efforts at proving the following statement: 


If'Zb,pc n has bo = 1 and positive radius of comergence R, then 1 /(Zb n x n ) can be ex¬ 
panded in a power series Zc n x n which also has positive radius of convergence. 

We begin by determining the cf s. The condition U(Zb,p: n ) = Zc, X x n means that 
( Zb,pc n )(Zc n xI l ) = 1, so 

boco + (b 0 c\ 4- b\Co)x + ( b Q c 2 4- b\c\ 4- b 2 cd)x 2 

4- • • • 4- {boc n 4- + * * • 4- b n co)x n + ■ ■ • = 1 , 


and therefore 
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fr 0 Q) = 4 boC\ + b\CQ = 0, boC2 + b\C\ + b 2 co = 0, 

.. ., b 0 c n + b\C n —\ + • • ■ + b n co = 0,. . . . 
Since bo = 1, these equations determine the c„’s recursively: 
c 0 =1, Ci = -bic 0 , c 2 = ~b\Ci ~ b 2 c 0 , 

• • •» — b\C n —\ b 2 c n — 2 — • • • — b n CQ ,. . . . 

All that remains is to prove that the power series 2 c n .x n with these coefficients has pos¬ 
itive radius of convergence, and for this it suffices to show that the series converges for 
at least one nonzero x. This we now do. 

Let r be any number such that 0 < r < R, so that 2 b n r n converges. Then there exists a 
constant K > 1 with the property that \b n r n \ ^ K or \b n \ < KJr n for all n. It now follows 
that 


|co| = 1 s K, 

hi = |ft,c 0 | = 
|c 2 | s \b l c x \ + 

k 3 | ^ M + 



„ , K K K „ IK 2 

\bic 0 \ -+ -j • K - — j~< 

. K 2K 2 K K 

hci| + \b 3 c 0 \ ^ — • — + -2 • — 


K 3 

( 2+1 + 1 )— = 
r 3 



4K 3 2 2 K 3 

r 3 - r 3 ' 


since K 2 < K 3 . In general, 

|cj ^ \b\c n -\\ + \bic„- 2 \ + • • • + \b n c 0 \ 


K_ 2"“ 2 A:" _1 
r 


K 2 n - 3 K n ~ 2 


r n— 1 


- 2 ' 

n 


r n—2 


+ *•* 
r n 


< (2" _2 + 2"~ 3 + 


Ifn 

+1 +1)^- = 

r n 


2 n ~\ K n 2 n K n 

-<- 

y.n j.n ' 


We therefore have \c n x n \ < |(2 K/r)x \ ", so the series 2c„x" is absolutely convergent—and 
therefore convergent—for any * that satisfies the condition \x\ < r!2K. This shows that 
2 c n x n has nonzero radius of convergence, and the argument is complete. 


We shall prove that 

fxyteo* yo) =fyx(x 0. ?o). (D 

under the assumption that both mixed partials fxy and f yx exist at all points near (xo, yo) 
and are continuous at (xo, yo)- 

Let Ax and Ay be so small that/^y, and exist throughout the rectangle with vertices 
(*o, yo)> C*o + Ax, yo), (*o, yo + Ay), (*o + Ax, y 0 + Ay)(see Fig. A. 17). We carry out the 
proof by applying the Mean Value Theorem (Appendix A.4) in various ways to the ex¬ 
pression 

D = /(xo + Ax, y 0 + Ay) -/(x 0 + Ax, y 0 ) - /(x 0 , y 0 + Ay) + /(x 0 , yo)- (2) 
We begin by considering the function 

Fix) = fix , y 0 + Ay) - /(x, y 0 ). 


A.17 

THE EQUALITY OF 
MIXED PARTIAL 
DERIVATIVES 
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Figure A.17 


A. 18 

DIFFERENTIATION 
UNDER THE 
INTEGRAL SIGN 


(XO ’^0 + A ^) (*o + +A ^) 



where Ay is held fixed. The expression (2) can be written in the form 

D = [/(x 0 + Ax, y 0 + Ay) -fix o + Ax, y 0 ) ] - [/(x 0 , y 0 + A y) -/(x 0 , y 0 ) ] 

= F(x 0 + Ax) - F(x 0 ), 

so by applying the Mean Value Theorem we obtain 
D = Ax F'(xi) 

= Ax[/*(xj, y 0 + Ay) - A(x h y 0 )], 

where xj is some number between x 0 and x 0 + Ax. Since f x (x\, y) is differentiable as a 
function of y, we can apply the Mean Value Theorem again to obtain 

D = Ax Ay fxy(x i, yi), (3) 

where yi lies between y 0 and y 0 + Ay. 

We now start all over again with the function 

Giy) = fix o + Ax, y) -/(x 0 , y), 

where Ax is held fixed. The expression (2) can be written in the form 

D = [fix o + Ax, y 0 + Ay) -/(x 0 , y 0 + Ay)] - [/(x 0 + Ax, y 0 ) - fix 0 , y 0 )] 

= G(y 0 + Ay) - G(y 0 ). 

Just as before, by two applications of the Mean Value Theorem we find that 

D = Ay G'( y 2 ) = Ayt/^Cxo + Ax, y 2 ) - / v (-^o ? >’ 2 )] 

= Ay Ax f yx ix 2 , y 2 ), (4) 

where y 2 lies between y 0 and yo + Ay and x 2 lies between xq andxo + Ax. 

Finally, by equating (3) and (4) we see that 

fxyix 1 , y 1 ) =f y xix 2 , yi)- (5) 

Now let Ax and Ay approach zero, so that the rectangle shrinks toward the point (xq, yo)- 
As this happens, it is clear that the points (xj, yi) and (x 2 , y 2 ), which lie inside the rec¬ 
tangle, approach (xq, yo), and we obtain our conclusion (1) from (5) and the continuity of 
fxy and f yx at (xq, yo)- 


In Problems 32 in Section 19.2 and 19 in Section 19.6, we used the formula 

d [k (b 

~dx 1 ^ x ' dy = L ^ X ' ^ dy ' 

Our purpose here is to prove this under the assumption that fix, y) and its partial deriva¬ 
tive f x ix, y) are both continuous functions on the closed rectangle xo^x<xi,a<y<&. 
It is convenient to write 
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Fix) = f fix, y) dy. 

Ja 

If x and x + Ax both lie in the interval x 0 < x < x\, then 

F(x + Ax) - Fix) = f b f(x + Ax, y)dy~ f fix, y) dy 

Ja Ja 

= [ [f(x + Ax, y) - f(x, y)] dy. 

Ja 

Next, the Mean Value Theorem enables us to write this integrand in the form 
f{x + Ax, y) ~f{x, y) = A x f x (x, y), 

where x lies between x and x + Ax. Further, since f x {x y y) is assumed to be continuous on 
the closed rectangle, it can be shown that the absolute value of the difference 

fx(x, y) -f x {x, y) 

is less than a positive number e which is independent of x and y and approaches zero with 
Ax/ By putting these ingredients together, we obtain 

= I [ [ fx(x , y) ~ f x {x, y)] dy 

^ f \fx(x, y) — fx(x, >’)| dy 
Ja 


Fix + Ax) - Fix) 


Ax 


I* fxix, y) dy 

Ja 


If we now let Ax approach zero, then e also approaches zero, and we have 
Fix + Ax) - Fix) 


lim 


Ax 


: f 6 dy = eib - a). 

Ja 

caches zero, anc 
' = jfAx, y) dy, 

Ja 


which is (1). 


This lemma is stated and discussed in Section 19.4, and is the linchpin that holds together 
the main tools of Chapter 19. It has to do with a function z = fix, y) whose partial de¬ 
rivatives f x and f y exist at (xo, To) an d at all nearby points, and are continuous at the point 
(xq, To) itself. The assertion of the lemma is that under these conditions the increment 

Az = /(x 0 + Ax, y 0 + Ay) - /(*o, To) 


A.19 

A PROOF OF THE 
FUNDAMENTAL LEMMA 


can be expressed in the form 

Az = fix o, y 0 ) Ax + fyix o, y 0 ) A y + 6, Ax + e 2 Ay, (1) 

where and e 2 are quantities that -A 0 as Ax and Ay —> 0. 

To prove this statement, we analyze the change Az in two steps, as shown in Fig. A. 18, 
first changing x alone and moving from (xq, yo) to (xq + Ax, yo), and then changing y 
alone and moving from (xq + Ax, yo) to (xq + Ax, yo + Ay). We denote the first change 
in z by Aiz, so that 

Ajz =/(xo + Ax, yo) — fix o, yo)- 
By the Mean Value Theorem we can write this as 


y 


(x 0 + Ax, To + &y) 


Ay 


Ax 

ix 0 ,y 0 ) (* 0 +A X,y 0 ) 


*This is the two-dimensional analog of the property of uniform continuity that we discussed and 
proved in Appendix A.5. 


Figure A. 18 


x 
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A.20 

A PROOF OF THE 
IMPLICIT FUNCTION 
THEOREM 


Aiz = Ax/*(xi, To). 


( 2 ) 


where x\ is between xq and xq + Ax. Similarly, if we denote the second part of the change 
in z by A 2 z, so that 

A 2 z = fix 0 + Ax, To + by) - fix 0 + Ax, To). 


then 


A 2Z = At //x 0 + Ax, Ti), (3) 

where Ti is between To and To + A y. As Ax and At -» 0, x, —> xo and Ti —> To- By the as¬ 
sumed continuity of /, and / y at (xo, To). we can therefore write 

M*u To) = fx(x o, To) + ei 


and 


f y (x o + Ax, yi) = / y (x 0 , y 0 ) + * 2 , 

where and e 2 —3► 0 as Ax and Ay —> 0. This permits us to write (2) and (3) as 


Aiz = fxix 0 , To)A* + 6] Ax 

(4) 

A 2 z =fy(x 0, To) Ay + e 2 At- 

(5) 


Since A z = Aiz + A 2 z, we now complete the proof by adding (4) and (5) to obtain (1). 


We stated and discussed the Implicit Function Theorem in Section 19.10, and our purpose 
here is to provide a proof. 

To restate the situation, we assume that F(x, y) has continuous partial derivatives 
throughout some neighborhood of a point P 0 = Uo ^oX and also that F(x 0 , yo) = c and 
F y (x 0 , yo) ^ 0. We shall prove that there exists a rectangle centered on Po within which 
the graph of F(x, y) = c is the graph of a single differentiable function y =/(x), and also 
that the derivative of this function is given by the formula 


dy = _£± 

dx Fy' 


(1) 


First, we know from the Fundamental Lemma (Section 19.4) that since Phas continu¬ 
ous partial derivatives in the neighborhood mentioned above, F itself is continuous in this 
neighborhood. 

Let us suppose, for definiteness, that F y > 0 at P 0 . (A similar proof can be constructed 
if F y < 0 at Po.) We observe that if F y > 0 along a vertical segment, then F is an in¬ 
creasing function of y along that segment. It follows that no value of F (such as F = c, 
in which we are interested) can be taken on more than once on such a segment. 

We begin by constructing a rectangle P 0 centered on P 0 within and on the boundary of 
which F and F y are continuous and F y > 0. This is possible because of the continuity of 
the functions. Along any vertical segment across Po the function F is an increasing func¬ 
tion of y. By the Intermediate Value Theorem (Appendix A.3) we have F = c on this seg¬ 
ment if and only if/< c at the lower end and F > c at the upper end. Consider, for ex¬ 
ample, the vertical segment PjP 2 through Pq. Since F = c at Po, we have F < c at Pi, 
and by continuity we have F < c in some neighborhood of Pi on the lower edge of Po- 
Similarly, we have F > c in some neighborhood of P 2 on the upper edge of Po. A verti¬ 
cal segment close enough to PiP 2 will have its ends in both neighborhoods and will there¬ 
fore intersect the graph of F = c in exactly one point. 
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It is evident from this that we can shrink the base of Rq , if necessary, to form a new 
rectangle R centered on Po such that F < c on its lower edge and F > c on its upper edge. 
See Fig. A. 19. Inside this rectangle there is one y that corresponds to each x in such a 
way that F(x, y ) = c, and this defines our function y = /(x). It is clear that this function 
is continuous, because the height of Rq can be taken as small as we please to begin with. 

To establish differentiability and formula (1), we consider a change Ax and the corre¬ 
sponding change Ay = /(xo + Ax) - /(xq). Since the new point (xo + A*, yo + Ay) is still 
on the graph of F(x, y ) = c, we have 

F(x o, yo) — c and F(x o + Ax, yo + Ay) = c, so A F = 0. 

But the Fundamental Lemma gives 

A F = F x Ax + F y Ay + €\ Ax + 62 Ay, 
where e\, €2 —> 0 as Ax and Ay —» 0, and therefore 

0 = F x + F y ^- + ei + e 2 -^ 
y Ax Ax 


or 


Ay = F x + 61 
Ax F y 4- 62 


( 2 ) 


We know that y =/(x) is continuous. If Ax —» 0, it therefore follows that Ay —> 0, so 61 , 
6 2 —»0 and (2) implies that y = /(x) is differentiable with derivative given by (1). Finally, 
this proof of differentiability applies in just the same way to any other point on the graph 
inside the rectangle R. 


Our basic tools for integrating in polar, cylindrical, and spherical coordinates are the for¬ 
mulas 


dA = r dr d6, dV=rdrdddz , and dV= p 2 sin </> dp dcf) d6, (1) 

for the elements of area and volume in these three coordinate systems. However, the jus¬ 
tifications we gave in Chapter 20 were purely intuitive and geometric. Our purpose in this 
brief final appendix is to describe a broader theoretical setting within which these for¬ 
mulas can be understood as merely different aspects of a single idea. 

The problem that we now consider is the following: What happens to a multiple inte¬ 
gral 

II ■■■ I f(x, )', ■■■) dx dy ■■ ■ 

R 

if we change the variables from x, y, ... to u, v, . . . ? 

We know the answer to this question in the case of a single variable: If /(x) is contin¬ 
uous and the function x = x(w) has a continuous derivative, then 

fb Cd dx 

J a f(x) dx = | f[x(u)\ — du , (2) 

where a = x(c) and b = x(d). As an example of the use of this formula, we point out that 
the trigonometric substitution x = sin 0, dx = cos 6 dd enables us to write 

fl /-- Ctt /2 Ctt/I 1 77 

V 1 - x 2 dx— cos 6 • cos 6 dd = — (1 + cos 26) dd = — 

Jo Jo Jo 2 4 


y 





Figure A. 19 
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Figure A.20 



Students should observe particularly that the change of variable in this calculation is ac¬ 
companied by a corresponding change of the interval of integration. 

Our only similar experience in the two-variable case is with changing double integrals 
from rectangular to polar coordinates by using the transformation equations 

x = r cos 6 , y = r sin 6. (3) 

Up to this stage we have interpreted these equations as expressing the rectangular coor¬ 
dinates of a given point in terms of its polar coordinates. However, they can also be in¬ 
terpreted as defining a transformation or mapping that carries points (r, 6) in the r0-plane 
over to points (x, y) in the xy-plane. That is, if a point (r, 6) is given, then equations (3) 
determine the corresponding point (x, y), as suggested in Fig. A.20. Further, in order to 
make this correspondence one-to-one, it is customary to restrict the point {r, 6) to lie in 
the part of the r0-plane specified by the inequalities 0 < r, 0 < 6 <2tt. 

From this point of view, the formula for changing a double integral into polar coordi¬ 
nates [formula (3) in Section 20.4] can be written as 

jj f(x* y) dx dy = jjf(r cos 6 , r sin 6) r dr dd. (4) 

R S 


Thus, we are allowed to substitute x = r cos 6 and y = r sin 6 in the integral on the left, 
but we must then replace dx dy by r dr dd and R by the corresponding region S in the 
r0-plane. In our previous work we made no mention of the region S, but instead — and 
equivalently—changed the limits of integration on iterated integrals to describe the same 
region R in terms of polar coordinates. 

Formula (4) is a special case of a very general formula for changing variables in dou¬ 
ble integrals. The detailed proof is beyond the scope of this book, but at least we can state 
the result. First we need a definition. Consider a pair of functions of two variables, 

x = x(u, v), y = y(w, v), (5) 


and assume that they have continuous partial derivatives. The Jacobian of these functions 
is the determinant defined by 


d(x, y) 
3(m, v) 


dx 

dx 





a u 

dv 

_ dx 

dy 

dx 

dy 

dy 

dy 

du 

dv 

dv 

du 

du 

dv 






( 6 ) 


^Determinants of this form were first discussed by the German mathematician C. G. J. Jacobi 
(1804-1851). He did important work in the theory of elliptic functions, and applied his discoveries 
in astonishing ways to the theory of numbers. He also created a new and fruitful approach to theo¬ 
retical dynamics. The Hamilton-Jacobi equations are part of the standard equipment of every student 
of mathematical physics. Also, Jacobi uttered the following magnificent and unforgettable defense 
of science for its own sake: “The sole aim of science is the honor of the human mind, and from this 
point of view a question about numbers is as important as a question about the system of the world.” 
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This is often called a functional determinant, because it is a function of the variables it 
and v. As an example, we see that the Jacobian of the polar coordinates transformation 
(3) is 


djx, y) 
d(r, d) 


dx 

dr 

dx 

de 


cos 6 

-r sin 6 

dy 

dy 


sin 6 

r cos 6 

dr 

de 





The general change of variables formula for double integrals can now be stated as fol¬ 
lows: If (5) is a one-to-one transformation of a region S in the uv-plane onto a region R 
in the xy-plane, and if the Jacobian (6) is positive , then 


IIfix, y) dxdy = Hf[x(u, v), y(u, ^)] du dv. (7) 

R S 

Since r is the Jacobian of the polar coordinates transformation (3), it is clear that (4) is a 
special case of (7). Further, we can think of (7) as a two-dimensional extension of (2), 
with the derivative dx/du being replaced by the Jacobian d(x, y)ld(u, v). 

Formula (7) in turn can be extended to triple integrals. First, we define the Jacobian of 
the transformation 





dx 

dx_ 



du 

dv 

dw 

r x = X(u, V, w), 
y = y(u, v, w), by 

y, z) _ 

d(u, v ; w) 

dy 

du 

dy 

dv 

dy 

dw 

L z = z{u, v, w), 

A 

dz_ 

dz_ 




du 

dv 

dw 


Then, under the similar assumptions, we have the following extension of (7): 

/If y ' Z ) ^ dy dz = jjj F{u, v ; w) d/** y' du dv dw, (8) 

R S Vi W 

where F(u, v, w) = f[x(u, v, w), y(u, v, w), z(u, v, w)]. The main thing to notice here is 
that 

dx dy dz is replaced by d(.x, y, z) ^ ^ 

d(w, v, w) 

Two important special cases of (8) are those of cylindrical coordinates , 
f/<« z) dx dy dz = JJj F(r, 0, z)r dr d6 dz, 

R 5 

where F(r ; 0, z) = f(r cos 6, r sin 6, z); and spherical coordinates, 

JIJ/U, y> z) dxdy dz = jjj F(p, </>, 6)p 1 sin (/> dp d(j) d6, 

R S 

where F(p, </>, 6) = f(p sin 0 cos 6, p sin <f> sin 0, p cos <£). We leave it to the student to 
verify the spherical coordinates formula by using the transformation equations 

(x = p sin </> cos 0 , 
y = p sin </> sin 6, 

[z — p cos </>, 

to calculate the Jacobian 
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Figure A.21 


A v 


( u, v ) A u 


u 



d(x, y, z) 

3(p, c/>, 6) 


sin 4> cos 6 
sin 4> sin 6 
cos 4> 


p cos 4> cos 6 
p cos c/> sin 6 
— p sin 4> 


— p sin 4) sin 6 
p sin 4> cos 6 


0 


= p 2 sin 4>- 


It is in this way that we can understand a little more fully what lies behind formu¬ 
las (1). 

One question remains, and for the sake of simplicity we state it only for the two-vari¬ 
able case: What is the underlying reason for the presence of the Jacobian on the right side 
of formula (7)? We now give a very brief intuitive explanation of this by means of vec¬ 
tors. In the wz>plane the equations w = a constant and v— a constant determine a network 
of straight lines parallel to the axes, whereas in the xy-plane these equations determine a 
network of intersecting curves. A small rectangle in the w^plane with sides Aw and Av 
corresponds to a small parallelogram in the jcy-plane (see Fig. A.21) with sides that can 
be written in vector form as 


and 





Aw 





Az;, 


approximately. In calculating the integral on the left side of (7) as a limit of sums, it is 
natural to abandon the usual rectangular cells and instead use these small parallelograms. 
If we denote by A A the area of the parallelogram in the figure, then A A equals the mag¬ 
nitude of the cross product of the two vectors given above. Since this cross product is 


we have 


i 

j 

dx 

dy_ 

du 

du 

3* 

dy_ 

dv 

dv 


k 


0 


Aw Au = 


dfe y) 

d(u, v) 


Aw Av 


k. 


0 


A A 


d(s, y) 

3(w, v) 


Aw Av. 


(9) 


This shows that the Jacobian plays the role of a local magnification factor for areas. Fur¬ 
ther, these remarks constitute a sketch of a proof of (7), because all that remains to es¬ 
tablish (7) is to form the integral on the left side as a limit of sums and make use of (9). 
















A FEW REVIEW 

TOPICS 


The binomial theorem is a general formula for the expanded ^-factor product 

(<a + b) n = (a + b)(a + b) • • • {a + b). (1) 

If we compute a few cases by repeated laborious multiplication, we find that 
(a + b) x = a + &, 

(a + 6) 2 = a 2 + + b 2 , 

(a + 6) 3 = a 3 + + 3 ab 2 + fr 3 , 

(a + &) 4 = a 4 + 4a 3 & + 6 a 2 b 2 + 4 ab 3 + b 4 , 

(a + b) 5 = a 5 + 5 a A b + 10 a 3 b 2 + 10 a 2 b 3 + 5ab 4 + 

It is clear that the expansion of (1) begins with a" and ends with b n , and also that the in¬ 

termediate terms involve steadily decreasing powers of a and steadily increasing powers 
of b so that the sum of the two exponents is exactly n in each term. What is not so clear 
is the way the coefficients are calculated. To anticipate our final result, the expansion in 
question (the binomial theorem) is 

(a + b) n = a n + na n ~ x b + -^ ^^ a n ~ 2 b 2 

+ n(n ~ 2 D ( 3 ~ 2) -a"— 3 fc3 + -.- 

+ ^- 1x :: 2 2) 3 ::. ( ;-^ +12 ^ 

+ • • • + b n . ( 2 ) 

Our purpose is to understand the reasons behind the form of these coefficients. The best 
way to do this is to take a short detour through the closely related topics of permutations 
and combinations. 

Before starting this detour, we remind students that if n is a positive integer, then the 
product of all the positive integers up to n is denoted by n\, called n factorial: 

n\ = 1 • 2 • 3 • ■ ■ n. 

Thus, 1! = 1, 2! = 1 • 2 = 2, 3! = 1 ■ 2 ■ 3 = 6, 4! = 1 • 2 ■ 3 • 4 = 24, etc. For reasons 
that will appear below, we define 0! to be 1. These numbers increase very rapidly, as we 
see by doing a little arithmetic: 
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5! = 120, 6! = 720, 7! = 5040, 8! = 40,320, 

9! = 362,880, 10! = 3,628,800. 

Further, with the aid of a calculator we learn that 

20! = 2.433 X 10 18 and 40! = 8.159 X 10 47 . 

Any product of consecutive positive integers can easily be written in terms of factorials. 
For example, 


6 • 7 • 8 • 9 • 10 = 


1 - 2- 3- 4- 5- 6- 7‘8-9-lQ 
1 • 2 • 3 • 4 • 5 


10 ! 
5! ‘ 


In general, if k < n, then 

(*+ m + i = 


PERMUTATIONS 

We now discuss certain methods of counting that are useful in many applications of math¬ 
ematics. 

The reasoning on which our work is based can be illustrated by a simple example. Con¬ 
sider a journey from a city A through a city B to a city C. Suppose it is possible to go 
from A to B by 3 different routes and from B to C by 5 different routes. Then the total 
number of different routes from A through B to C is 3 • 5 = 15; for we can go from A to 
B in any one of 3 ways, and for each of these ways there are 5 ways of going on from B 
to C. 

The basic principle here is this: If two successive independent decisions are to be made, 
and if there are c\ choices for the first and c 2 choices for the second, then the total num¬ 
ber of ways of making these two decisions is the product cic 2 . It is clear that the same 
principle is valid for any number of successive independent decisions. 

The following is our main application of this idea. Given n distinct objects, in how 
many ways can we arrange them in order, that is, with a first, a second, a third, and so 
on? The answer is easy. There are n choices for the first object. After the first object is 
chosen, there are n - 1 choices for the second, then n — 2 choices for the third, etc. By 
the basic principle stated above, the total number of orderings is therefore 

n(n - 1 )(n — 2) - * • 2 • 1 = n\. 

Each ordering of a set of objects is called a permutation of those objects. We have reached 
the following conclusion: 

The number of permutations of n objects is n\. 

Example 1 (a) There are 5! = 120 ways of arranging 5 books on a shelf, (b) There are 

9! = 362,880 possible batting orders for the 9 players on a baseball team, (c) There are 
52! = 8.066 X 10 67 ways of shuffling a deck of 52 cards. 


We next consider a slight generalization. Suppose again that we have n distinct objects. 
This time we ask how many ways k of them can be chosen in order. Each such ordering 
is called a permutation of n objects taken k at a time, and the total number of these per¬ 
mutations is denoted by P(n, k). There are evidently n choices for the first, n — 1 choices 
for the second, n — 2 choices for the third, and n — (k — 1 ) = n — k + 1 choices for the 
/:th. The total number of these permutations is therefore 


P(n, k) = n(n - 1)(« - 2) • • • (n - k + 1). 




B. 1 THE BINOMIAL THEOREM 


847 


If we write this number in terms of factorials, then our conclusion can be formulated as 
follows: 


The number of permutations of n objects taken k at a time is 


Pin, k) = n(n - 1 )(n - 2) • • • (n - k + 1) = 


n\ 

0 n-k )\' 


Example 2 (a) If we have 7 books and only 3 spaces on a bookshelf, then the number 

of ways of filling these spaces with the available books (counting the order of the books) is 


P(7, 3) = 


7! 7! 

(7-3)! = T = ? 


6 • 5 = 210. 


(b) The number of ways (counting the order of the cards) in which a 5-card poker hand 
can be dealt from a deck of 52 cards is 


52' 52* 

P{ 52, 5) = - - = — = 52 • 51 • 50 • 49 • 48 = 311,875,200. 

Of course, the order of the cards in a poker hand is immaterial to the value of the hand, 
so the number of distinct poker hands is a considerably smaller number. We take account 
of this below, in our discussion of combinations. 


COMBINATIONS 


A set of k objects chosen from a given set of n objects, without regard to the order in 
which they are arranged, is called a combination of n objects taken k at a time. The total 
number of such combinations is sometimes denoted by C(n, k), but more frequently by 

^ j. For reasons to be explained, the numbers ^ are called binomial coefficients. 

Permutations and combinations are related in a simple way. Each permutation of n ob¬ 
jects taken k at a time consists of a choice of k objects (a combination) followed by an 


ordering of these k objects. But there are ^ j ways to choose k objects, and then k\ ways 
to arrange them in order, so 

Pin, k) 


Pin, k) = 


k\ 


k\ 


Our formula for Pin, k) now yields the following conclusion: 

The number of combinations of n objects taken k at a time is 


K kJ k\in-k)\ m 

The binomial coefficients have many properties, of which we mention only a few: 


0!w! 


= 1 , 


n 

n - 1 


l!(w - 1)! 


— n, 


and 


n-k 

The last fact can be established easily from the formula, or, more directly, by simply ob¬ 
serving that a choice of k objects from a set of n objects is equivalent to a choice of the 
n~k objects that are left behind. 
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Example 3 (a) The number of committees of 3 people that can be chosen from a group 

of 8 people is 

8\ _ 8! = 8-7*6 
3 ) 3!5! 2-3 

(b) A certain governmental commission is to consist of 2 economists and 3 engi¬ 
neers. If 6 economists and 5 engineers are candidates for the appointments, how 
many different commissions are possible? From the 6 economists, 2 can be chosen in 

6\ / 5 \ 

^ I ways; and from the 5 engineers, 3 can be chosen in I ^ I ways. The number of pos¬ 
sible commissions is therefore 


6 Y 5 UAT.^L = 6i5.5^. 

2f[3) 2!4! 3!2! 2 2 


150. 


(c) The number of different 5-card poker hands that can be dealt from a deck of 52 
cards is 


52! _ 52-51 -5Q-49- 48 
5!47! “ 2-3-4-5 


= 2,598,960. 


THE BINOMIAL THEOREM 

To establish the binomial theorem (2), all that is necessary is to look at (1) and observe 
that each term of the expansion can be thought of as the product of n letters, one taken 
from each factor of the product 

0 a + b)(a + b) • • • (a + b), n factors. 

Thus, a product a n ~ k b k is obtained by choosing k b's and the rest a s. The number of ways 
this can be done is ^ j. The coefficient of on the right side of (2) is therefore 

and the proof is complete. 



PROBLEMS 


1 Write in terms of factorials 

(a) 5 • 6 • 7 • 8 • 9; (b) 22 • 21 ■ 20 • 19 ■ 18 ■ 17; 

52- 51 • 50-49-48 
(C) 1 • 2 • 3 • 4 • 5 ' 

2 Compute 

8 ! 11' 15! 

( a) 5 T, (b) -gp (0 3 ^. 

(d) 4 tIf ; (e) F(22,2); (f) P(7 ’ 5) - 

3 If 6 horses run in a race, how many different orders of 
finishing are there? How many possibilities are there for 
the first 3 places (win, place, and show)? 

4 A club has 10 members. In how many ways can a pres¬ 
ident, a vice president, and a secretary be chosen? 

5 How many batting orders are possible for a baseball team 
if the 4 best hitters are the first 4 to bat? 


6 How many batting orders are possible for a baseball team 
if the fielders are the first 3 to bat and the pitcher bats 
last? 

7 How many 10-digit numbers can be formed from all 10 
digits 0, 1,2, 3,4, 5, 6, 7, 8, 9 if 0 is not allowed as the 
first digit? 

8 How many 5-digit numbers can be formed from the 10 
digits if the first digit cannot be 0 and no repetitions are 
allowed? If repetitions are allowed? 

9 How many license plates can be made using 7 symbols, 
of which the first 3 are different letters of the alphabet 
and the last 4 are digits of which the first cannot be 0? 

10 How many ways can 3 history books and 4 physics books 
be put on a shelf if books on the same subject must be 
kept together? If the history books must be kept together 
but the physics books need not be? 
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11 How many different signals can be made from 5 differ¬ 
ent flags if each signal consists of 5 flags placed one 
above the other on a flagpole? If each signal consists of 
3 flags? If each signal consists of one or more flags? 

12 In how many ways can 6 people be seated in a row of 6 
chairs? 

13 In how many ways can 6 people be seated in a row of 8 
chairs? 

14 In how many ways can 6 people be seated in a row of 6 
chairs 

(a) if a certain 2 insist on sitting next to each other? 

(b) if a certain 2 refuse to sit next to each other? 

15 In how many ways can 3 men and 3 women be seated 
in a row of 6 chairs if the men and women alternate? 

16 Compute 



17 A contractor employs 10 workers. In how many ways 
can he choose 4 of them to do a certain job? 

18 In how many ways can 12 jurors be selected from a panel 
of 20 eligible citizens? 

19 At a meeting, 28 people all shake hands with one an¬ 
other. How many handshakes are there? 

20 in an examination a student has a choice of any 10 out 
of 12 questions. In how many ways can she choose her 
questions? 

21 From a committee of 10, in how many ways can one 
choose a subcommittee of 4 and another subcommittee 
of 3, with no overlap? 

22 How many committees of 4 can be chosen from a group 
of 12 men? How many of these will include a specific 
man? How many will exclude this man? 

23 in how many ways can a committee of 5 men and 4 
women be chosen from a club of 10 men and 7 women? 

24 in how many ways can we select a committee of 5 from 
a group of 11 


(a) if 2 of the group insist on serving together or not at 
all? 

(b) if 2 of the group refuse to serve together? 

25 in how many different orders can we shelve sets of 5 
books, each set consisting of 3 history books and 2 chem¬ 
istry books, if the books are to be chosen from a set of 
9 history books and 7 chemistry books? 

26 Three bags contain 8 black, 6 white, and 10 red marbles, 
respectively. In how many ways can we choose 6 black, 
4 white, and 7 red marbles? 

27 From a deck of 52 cards, how many 5-card poker hands 
are flushes (all cards of the same suit)? How many are 
full houses (3 cards of one kind together with a pair of 
another kind)? 

28 How many lines are determined by 12 points in a plane 
if no 3 of the points are collinear? 

29 How many triangles are determined by 13 points in a 
plane if no 3 of the points are collinear? 

30 How many lines are determined by m points in a plane 
if k of them (where k < m) lie on the same line and, ex¬ 
cept for these, no 3 of the points are collinear? 

31 How many planes are determined by 9 points in space if 
no 4 of the points are coplanar? 

32 How many rectangles are formed by 5 vertical lines in¬ 
tersecting 8 horizontal lines? 

33 How many diagonals can be drawn in a regular polygon 
with n sides? (A diagonal is a segment joining 2 nonad- 
jacent vertices.) 

34 Use the binomial theorem to expand each of the follow¬ 
ing: 

(a) (2 jc - y) 7 ; (b) (3a + 2 b] 6 \ 

(c) (x 2 y - 3 z 3 ) 5 . 

35 Find the term of 

(a) (x-4) 15 that involves x u \ 

(b) (2 jc - 3) 9 that involves x 4 ; 

(c) (a 2 - b 3 ) 14 that involves a 10 . 


Discoveries in mathematics are sometimes made by carefully examining empirical evi¬ 
dence. As an illustration, let us try to find a formula for the sum of the first n odd num¬ 
bers, where n is any positive integer. We compute: 


for n - 

= 1, 

1 = 

1 






= i 2 . 

for n - 

= 2, 

1 + 

3 

= 

4 




= 2 2 , 

for n - 

= 3, 

1 + 

3 

+ 

5 

= 

9 


= 3 2 , 

for n - 

= 4, 

1 + 

3 

+ 

5 

+ 

7 = 

16 

= 4 2 , 

for n - 

= 5, 

1 + 

3 

+ 

5 

+ 

7 + 

9 = 25 

= 5 2 . 
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The pattern emerging from this evidence seems to suggest that the value of the sum al¬ 
ways equals the square of the number in terms in the sum. Since 2a — 1 is the ath odd 
number, we can formulate this conjecture as follows: 
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1 + 3 + 5 + • • • + (2n - 1) =n 2 (1) 

for every positive integer n. 

The evidence for (1) is suggestive but far from conclusive. If we continue to test our 
conjecture for n = 6, 7, 8, and so on, and if it continues to hold up for these additional 
values of n, then this will certainly increase our confidence that (1) is probably true for 
every positive integer n. However, verifications of this kind can never constitute a proof, 
no matter how far they may be carried. If we verify (1) for all values of n up to n = 1000, 
then the logical possibility still remains that (1) might fail to be true for n = 1001.* There 
is an infinite chasm between “probably true” and “absolutely certain.” What is needed is 
a logical argument proving that (1) is always true, for all values of n, beyond any doubt 
whatsoever. This is what the method of proof by mathematical induction accomplishes. 
We explain this method of reasoning by showing how it works in the case of formula (1), 
and then we state it as a formal principle. 

Example 1 To prove (I) by mathematical induction, we begin by observing that this 
formula is true for n = 1, because it reduces to I = l 2 . (We already knew this.) We next 
prove that if k is a value of n for which (1) is true, then (1) is necessarily also true for the 
next integer, n = k + 1. Thus, we assume that (I) is true for n = k, 

1 + 3 + 5 + ■ • • + (2k - 1) = k 2 . (2) 

With the aid of this hypothesis we try to prove that (1) is also true for n = k + 1, 

1 + 3 + 5 + ■ • • + (2k - 1) + (2k + 1) = (k + l) 2 . (3) 

(The next-to-the-last term on the left here is displayed for the sake of clarity in our next 
step.) By using (2) we see that the left side of (3) can be written as 

1 + 3 + 5 + • • • + (2fc - 1) + (2k + 1) = k 2 + (2k + 1) 

= (*+ l) 2 , 

so (3) is true if (2) is true. But this is enough to guarantee that (1) is actually true for all 
n. To see this, suppose we wish to assure ourselves that (I) is true for some specific value 
of n, say n = 37. The reasoning is as follows: We know by actual computation that (1) is 
true for n = 1; since it is true for n— 1, the argument just given tells us that it is also true 
for n = 2; since it is true for n = 2, it must be true for n = 3; and so on, up to n = 37 (or 
any other value of ri). 


Our main principle is little more than a distillation of the essence of this example. 


Principle of Mathematical Induction Let S(n) be a proposition depending on a pos¬ 
itive integer n . f Suppose that each of the following conditions is known to be satisfied. 

I 5(1) is true. 

II If S(n) is assumed to be true for an integer n = k, then it is necessarily true for the 
next integer, n = k + 1. 

Under these circumstances it follows that S(n) is true for every positive integer n. 


*As a trivial illustration of this point, the equation 

n 2 - 1 = (n + \)(n - 1) + [(n - 1 )(n - 2) • • • (n - 1000)] 

is clearly true for the first thousand values of n, because the bracketed expression is zero, and yet it 
is false for n = 1001, 1002, .... 

f This means that for each specific value of n, S(n ) is a statement that is either true or false, without 
any ambiguity. 
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Briefly, if we write down the propositions S(n) in order, 

5( 1), 5(2), 5(3),..., 


then the process of verification is started by I, and II is a link from each to the next guar¬ 
anteeing that the process continues without end. 

The idea of induction can be illustrated in many nonmathematical ways. For instance, 
imagine a row of dominoes standing on end. Suppose they are spaced in such a way that 
if any one of them falls, then it will knock over the next one. Suppose further that we ac¬ 
tually knock over the first domino. In this situation we know that all the dominoes will 
fall. Our knowledge is based on two facts, which are closely analogous to I and II: 

(i) The first domino does fall, because we knock it over. 

(ii) //any domino falls, then it will knock over the next one. 

We must be careful with the meaning of (ii); it does not state that any domino actually 
does fall, only that each domino is related to the next one in a certain way. 

We continue with two additional examples of the method, in which we establish two 
formulas that are needed in Chapter 6. These are formulas for the sum of the first n pos¬ 
itive integers, and for the sum of the first n squares: 



(4) 


and 


n{n + 1)(2 n + 1) 
6 


l 2 + 2 2 + 3 2 + • • • + n 2 = 


(5) 


Several of the remarks and problems that follow are concerned with the natural question 
of how such formulas can be discovered and understood. For the moment, however, we 
confine our attention to proving them by the method of mathematical induction. 

Example 2 To prove (4) by induction, we start by verifying I in this case, that is, we 
note that (4) is obviously true for n = 1: 


To verify II, we begin by assuming (4) for n = k, 


l+2 + 3+ -- -+ * = + 

O ’ 


( 6 ) 




(7) 


By using (6) we can write the left side of (7) as 


1 + 2 + 3 + •■■+&+(& + 1) = — ' — —- + (k + 1) 



(*+!)(*+ 2 ) 
2 


Condition II is therefore satisfied, so by induction (4) is valid for all positive integers n. 
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Example 3 The proof of (5) is just as easy. To verify I, put n = 1: 


1 2 = 


1-2-3 
6 ' 


To verify II, we must assume 


l 2 + 2 2 + 3 2 + • • • + k 2 = 


k(k + 1)(2 k± 1) 


and use this to prove 

l 2 + 2 2 + 3 2 + • • • + k 2 + (k + l) 2 = 
The details are routine, 


(k + 1)(* + 2)(2 k + 3) 


l 2 + 2 2 + 3 2 + • • • + k 2 + (k + l) 2 = k(k+ 1} i 2<: + - + (k + l) 2 


= (k + 1 ) 


= (*+ 1 ) 


k(2k + 1) 


+ (k+ 1 ) 


2k * 2 + Ik+ 6 


(k + l)(k + 2)(2 k + 3) 


so by induction the proof of (5) is complete. 


Remark 1 Mathematical induction is a venerable method of proof that every student of 
mathematics ought to understand. Our purpose here has been to explain this method, and 
also to illustrate its use by proving two formulas [(4) and (5)] that are necessary for other 
parts of our work. However, much remains to be said. 

Proofs by induction produce belief without insight, and are therefore fundamentally un¬ 
satisfying. It is important to know that a mathematical theorem is true, but it is often more 
important to understand why it is true. There are other proofs of formulas (1), (4), and (5) 
which convey much more insight into these formulas, and which also suggest how they 
might have been discovered in the first place. We begin with (4). 

If we denote the sum of the integers from 1 to n by S, so that 

S = 1 + 2 + •■• + (« — 1) + m, 


then it might occur to us to write this sum in the reverse order, as 
S=H + (rt — l) + -** + 2 + l. 

If we now notice that the two first terms on the right add up to n + 1, and also the two 
second terms, and so on, then it is natural to add these two equations together to get 


2 S = n(n +1) or 


n(n + 1) 
2 


These ideas serve to discover formula (4) and also to prove it, simultaneously. 

We next turn to formula (1), which again we discover and prove at a single stroke. Con¬ 
sider the sum of the first n odd numbers, 

1 + 3 + 5 + • • • + (2 n — 1). 

We notice certain obvious gaps in this sum, where the even numbers ought to be. If we 
fill in these gaps, and at the same time compensate for this filling in, then, using (4), we 
easily obtain 
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1 + 3 + 5 + ■ • ■ + (2n -‘1) = (1 + 2 + 3 + ■ • • + In) - (2 + 4 + • • ■ + 2n) 

= (1 + 2 + 3 + ■ ■ ■ + 2n) - 2(1 + 2 + • • ■ + n) 

_ 2n(2n + 1) n(n + 1) 

“2 '2 

= In 2 + n — n 2 — n = n 2 , 


which is (1). 

We have discovered and proved the formula for the sum of the first n positive integers, 

1 + 2 + 3+ -- ' + n = + ^ (4) 

2 • 

It is somewhat more difficult to discover (5), that is, a formula for the sum of the first n 
squares, 

l 2 + 2 2 + 3 2 + • • • + n 2 . 


We know the answer by Example 3, but let us disregard this for a moment and try to think 
how we might discover it. It is natural to consider the two sums together: 


n 

1 

2 

3 

4 

5 

6 

1 + 2 + ■ ■ • + n 

1 

3 

6 

10 

15 

21 

l 2 + 2 2 + ■ • • + n 2 

1 

5 

14 

30 

55 

91 

How are these sums related? It might occur to us to consider their ratio: 

n 

1 

2 

3 

4 

5 

6 

1 + 2 + ■ ■ ■ + n 

1 

3 

3 

1 

3 

3 

l 2 + 2 2 + • • • + n 2 

1 

5 

7 

3 

11 

13 

If we write these ratios in the form 







3_ 3_ 

_3 

3 

3 

3 



3 5 

7 

9 

11 

13 

* * 



then it is difficult to miss the pattern that emerges. It seems clear that 

1 + 2 + • • • + n _ 3 

l 2 + 2 2 + • ■ • + n 2 ~ 2n + \* 

and by using (4) we easily find that 

12 + 2 2 + ... +b 2 = "(”+1)(2«+1) j (5) 

6 

This is certainly not a proof of (5). Nevertheless, it presents us with a plausible conjec¬ 
ture which we can then try to prove by induction, as we have done in Example 3. 


Remark 2 There is another very ingenious way of discovering (5) that also constitutes 
a proof. It begins with the expansion 

(k + \) 3 = k 3 + 3 k 2 + 3*+ 1, 

expressed in the more convenient form 

(k + l) 3 -k 3 = 3 k 2 + 3* + 1. 

If we write down this identity for k = 1, 2,. . . , n and add, then by taking advantage of 
wholesale cancellations we find that 
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2 3 - l 3 = 3 • l 2 + 3 • 1 + 1 
3 3 — 2 3 = 3 • 2 2 + 3 • 2 + 1 


(ft + 1 ) 3 - n? = 3 • ft 2 + 3 • n + 1 

(n + l) 3 - l 3 = 3(1 2 + 2 2 + • • • + ft 2 ) + 3(1 + 2 + ■■■+«) + n 

This enables us to obtain a formula for the sum of the squares in terms of our known for¬ 
mula (4) for the sum 1+2 + ••• + «: 

l 2 + 2 2 + • • * + n 2 = }[ft 3 + 3 n 2 + 3 n - f n(n + 1) - n] 

= ^(2ft 3 + 6 n 2 + 6 n — 3 n 2 -3 n— 2 n) 


= ( 2 n 2 + 3 n + 1 ) 

6 


n(n + 1)(2 n + 1) 

6 


The idea of this proof is due to the great French writer-scientist-mathematician-theolo- 
gian Blaise Pascal. It can be extended quite easily to yield the sum of the first n cubes, 


l 3 + 2 3 + • ■ ■ + ft 3 


n(n + 1) 1 2 
2 


the sum of the first n fourth powers, and so on indefinitely. 


( 8 ) 


Remark 3 Mathematical induction as a method of demonstrative proof originated in the 
work of Pascal on the binomial coefficients. The interested reader will find this work de¬ 
scribed and quoted in vol. 1 of G. Polya’s remarkable book. Mathematical Discovery (Wi¬ 
ley, 1962), pp. 73-75. 


PROBLEMS 


1 Use (1) and (4) to find a formula for each of the following: 

(a) 2 + 4 + 6 + • • • + 2 n\ 

(b) (ft + 1) + (ft + 2) + (n + 3) + • • • + 3ft; 

(c) 1 + 3 + 5 + • • • + (4ft - 1); 

(d) (2ft + 1) + (2ft + 3) + (2ft + 5) + • • • + (4ft - 1); 

(e) 3 + 8 + 13 + • • • + (5ft - 2). 

2 Discover a formula for l 2 + 3 2 + 5 2 + • • • + (2ft - 1 ) 2 by 
using its relation to the sum l 2 + 2 2 + 3 2 + • • • + (2ft) 2 . 

3 Prove each of the following by induction: 

( , _J_ 1 _1_ = n 

W 1 *2 2-3 3-4 ft(ft+l) ft + 1’ 

(b) 1 ■ 2 + 2 ■ 3 + 3 • 4 + • ■ • + «(« + l) = 

ft(ft + l)(ft + 2) 

3 ; 

(c) 1 • 3 + TT + Tl + " ' + (2n - l)(2n + 1) " 

ft 

2ft + r 

(d) 1 ■ 3 + 3 ■ 5 + 5 ■ 7 + • • • + (2ft — l)(2ft + 1) = 

ft(4ft 2 + 6ft — 1) 


Prove (a) without using mathematical induction, by means 
of the algebraic identity 


1 


1 


1 


«(« +i) ft ft + r 

also, devise a similar method of proving (c). 
4 Prove each of the following by induction: 


(a) !+{ + } + 


(b) 1 + r + r 2 + 


+ — = 2 -f 

2 n 2 n 


+ r n = ■ 


1 


y.n-\~ 1 


1 - r 


(r± 1); 


1 2 3 

(c) 2 + 2^ + 


n_ _ j _ (” + 2 ) 
2 " 2 " ’ 


(d) r + 2r 2 + 3r 3 + ■ ■ • + nr" = 
r- (n + \)r n+l 


(e) - + —|-j- + 


(1 - r ) 2 
+ •••+■ 


2 ft - 1 


(r+ 1); 


+ ,. + . 
2 3 4 


3 


2ft - r 
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(f) 


2 4 2" 

+ . . , + ——r+ ■ ■ • + 


1 + * 1 + x 


1 + x 4 

_1 _ 2 n+l 

c — 1 + 1 — X 2 "* 


1 + X 2 " 

(x±±\y. 


(g) 1 3 + 2 3 + 3 3 + • • • + n 3 = 

(h) l 4 + 2 4 + 3 4 + • • • + n 4 = 

1 


—n(n + 1) 


— n(n + l)(6n 3 + 9 n 2 + n - 1). 


(1 -i)d - t)(1 -?) = {, 

(1 - 1)(1 - j)(l - |)(1 - i) = j. 

7 Guess the formulas that simplify the following products, 
and prove them by induction: 



(b) (1 - *)(1 + x)(l + x 2 )(l + x 4 ) • • • (1 + x r ). 
8 Let S(n) be the following statement: 


5 Use the method of Remark 2 to discover and prove the 
formulas in parts (g) and (h) of Problem 4. 

6 In each of the following, guess the general law suggested 
by the given facts and prove it by induction: 

(a) 1 = 1, 

1 - 4 = -(1 + 2), 

1 —4+9 = l+ 2 + 3, 

1 - 4 + 9 - 16 = -(1 + 2 + 3 + 4); 

(b) 1 - 1 = b 

(i -i)d -T) = T. 


+ 2 + 3 + 


+ n = 


(n - l)(n + 2) 


(a) Prove that if S(n) is true for n = k, then it is also true 
for n = k + 1. 


(b) Criticize the assertion, “By induction we therefore 
know that S(n) is true for all positive integers n.” 











CHAPTER 1 
Section 1.2, p. 8 

1- (a) Rational; (b) integer, rational; 

(c) integer, rational; (d) rational; 

(e) integer, rational; (f) irrational; 

(g) integer, rational; (h) irrational; 

(i) rational; (j) rational. 

3-11. 5. tt — 3 . 

7-5-jc. 9. x 2 + 10 . 

11 3jc 2 — 1. 

13. (a) x < 0 and jc > 1 ; (b) -2 < 
jc < 1; (c) jc < — 7 and jc > 3 ; 

(d) -1 < jc < 1; (e) -3 < x < {; 

(f) all x. 

15- (a) jc > 0 ; (b) -2 < jc < 0 and 
jc > 2; (c) jc < — 1 and jc > 3; (d) jc < 
— 1, 0 < jc < 1, and jc > 3. 

17- a = b. 

19. (a) Vertical; (b) horizontal; 

(c) horizontal; (d) vertical; 

(e) horizontal; (f) vertical; (g) vertical; 

(h) horizontal. 

21 - (a) 5V2; (b) Vl3; (c) V89; 

(d) | a - b\ V 2 . 

27. Center ( — 2 , f), radius yV 113. 

29 - (- 1 . - 1 ). 

31 ■ Symmetric with respect to the 
straight line through the origin that 
bisects the first and third quadrants. 

33. ^V 2 h. 

Section 1.3, p. 14 

1 (a) (b) f; (c) i; (d) - 1 ; (e) 0 ; 

(f) 10. 

3- (a) Yes; (b) no; (c) no; (d) yes. 

7- (a) y = —4* + 5; (b) 3* + 7y = 2; 


ANSWERS TO 
ODD-NUMBERED 

PROBLEMS 


(c) 2jc - 3 y = 12; (d) y = -4; 

(e) jc = 1; (f) jc + 3y + 2 = 0; (g) jc + 
2y = 11; (h) 3y — 2x = 17; (i) jc + 

2y = 9;(j)* + y = 1. 


13. p = fC + 32 or C = f(F - 32). 

Section 1.4, p. 22 

I • (a) (jc — 4 ) 2 + (y — 6) 2 = 9; 

(b) (jc + 3 ) 2 + (y - l) 2 = 5; 

(c) (jc+ 5 ) 2 + (y+9 ) 2 = 49; 

(d) (jt — l ) 2 + (y + 6) 2 = 2 ; 

(e) (jc — a) 2 + y 2 = a 2 or x 2 + y 2 = 
2ax\ (f) x 2 + (y — a) 2 = a 2 or x 2 + 
y 2 = 2 ay. 

3- (a) Circle, center (2, 2) and radius 
2 V 2 ; (b) point (9, 7); (c) circle, center 
(—4, -5) and radius 1; (d) circle, 
center ( — f, 4) and radius 3; (e) empty; 

(f) point ( 7 V 2 , -yV 2 ); (g) circle, 
center ( 8 , -3) and radius 11. 

5- Distinct real roots, b 2 - 4 ac > 0; 
equal real roots, b 2 — 4 ac = 0 ; no real 
roots, b 2 — 4 ac < 0 . 

7- y = ± 2 V 2 r + 4. 

9- (a) y 2 = - 12 *; (b) * 2 = 4y; 

(c) y 2 = 8 *; (d) 3 * 2 = —4y; (e) y 2 + 
12 * + 12 = 0 ; (f) * 2 - 6 * - 8 y + 

17 = 0. 

II 20 ft. 


Section 1.5, p. 28 

I- (a) 42; (b) 17; (c) -3; (d) 32; 
(e) 5a 2 + 30a + 42; (f) 125r 2 - 3. 
3-5. 5.2* +A. 


jc(jc + h) ' 

9- /(1) =0, /(2) = 2, /(3) = 10, 

/( 0 ) = — 2 ,/(-l)= — 10 ,/(— 2 ) = 
-30. 

13. (a) * > 0 ; (b) * s 0 ; (c) all *; 

(d) jc < - 2 , jc > 2 ; (e) all jc except 2 , 
- 2 ; (f) all x; (g) x < - 2 , jc > 1 ; 

(h) jc < —2, jc > 1 ; (i) -3 < jc < 1; 
(j)x< 0 , x>2. 

15. /( 0 ) = 0 ,/(l) does not exist, 

/(2) = 2,/(3) = y,/(/(3)) = 3. In the 
last part, it is tacitly understood that jc 
is restricted to those values for which 
/(/(jc)) exists: that is, jc ^ 1 . 

1 ^• /( 0 ) = 1 , /( 1 ) does not exist, 

/(2) = —1,/(/(2)) = y,/(/(/(2))) = 2. 
19 -/(*,)/'(*2 )=/(*. +* 2 ). 

2 1 • No; it is true if and only if ad + 
b = be + d. 

23. (a)a=4,fr = -5, c = 3. 

25. y = —x + V3 — x 2 and y = -x - 
V3 - x 2 

27. A = {jcVTb^. 

29. A = JcV4a 2 - jc 2 . 

c 2 

31- (a) Yes, A = -—; (b) yes, A f= 

leP ; (c) no. _ 

33. \/ = 27 rr 2 \ a 2 — r 2 , A = 2irr 2 + 

47 rr Vo 2 — r 2 ; V = ^ 7 r( 4 fl 2 /z — /z 3 ), 

A = y7r(4fl 2 — /z 2 ) + irh^/Aa 2 — h 2 . 
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ANSWERS TO ODD-NUMBERED PROBLEMS 


35. A = 2m 1 + —. 

r 

37. (a) Largest area = 625 ft 2 , both 
sides = 25 ft; (b) A = 1 OOx - lx 2 = 
1250 - 2(x — 25) 2 , largest area = 
1250 ft 2 , sides = 25 and 50 ft. 

Section 1.6, p. 37 

!• (a) i 






3- (a) 


1 2 3 


(b) 

Ml 

i i 

i i 


1 2 3 

(c) 

l 

(d) 

1 

II 
/ 1 
/ 1 
/ 1 
/ 1 
/ 1 

1 

1 

/ 1 

/ 1 

r i 


3 




7. Only (b). 


Section 1.7, p. 45 

77 ^ 577 
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ANSWERS TO ODD-NUMBERED PROBLEMS 


3. (a) -y; (b) yV3; (c) -fV3; 

(d) {V2; (e) -h (f) -1V2. 

7. (a) sin y; (b) sin 0; (c) -sin y; 

(d) —sin y; (e) cos 0; (f) cos y; 


(g) -cos y; (h) sin y; (i) cos y. 

11 - sin 15° = f V 2 - V3; cos 15° = 

fVTTvf. 

1 3. (a) cos y = yV 2 ; (b) cos yy = 


-V2. 


15. (a) sin 


27T _ V5 


1 . /r . . 17^ 

V 2 ; (c) sin — 

19. iV2(V3 + 1). 


2 

17tt = 1 


■; (b) cos ■ 


' 2 * 


Additional Problems, p. 47 

9. No, to both questions. 

15. ( y\ - y 2 )x + (x 2 - *i )y = x 2 y\ - 


xiy 2 - 


(C) 


l a + b 
l 3 ’ 3j- 


23. (a) * — 7;y + 5 = 0, 7* + y — 
1 5 = 0 ; (b) x = ( 1 ± V 2 )y. 

25. \b\ < 2Vl0. 

27. (a) (* - fa ) 2 + y 2 = fa 2 ; 

(b) (* 2 + t 2 ) 2 = 2 a 2 (* 2 - /). 


31. lx + y = 10 and x - y + 2 = 0 . 
33. y = —2x + 2; (0, 2) and (f, y). 
35. x 2 + y 2 - 2xy - 4x - 4y 4- 4 = 


0 . 

37. The line is x = 2/?m. 

41. No. 43. g(x) = x 3 . 

45. V = yAr — 77T 3 . 


47. 

49. 

y = 


2 

V=-na 


ad ~ be 


r 2 — a 2 

‘.P = 


-b 


ad — be ’ 


—c 


ad — be 


, 5 = 


ad~ be' 


51. (x — l)(x - 2 ) ••■(*- n); 

+ 1 ; 

53. (a) Odd; (b) even; (c) even; 


(d) odd; (e) neither; (f) odd; 

(g) neither; (h) neither. 

55. (a) Even; (b) even; (c) odd. 
57. j = 275(* - \)(x - 2) - 




(d) 



3). 





67. > 2 . 


CHAPTER 2 


Section 2.2, p. 57 

I . (a) Ax + y + 4 = 0; (b) 8* - y = 
16; (c) 8*-;y = 16. 

5. (a) 2*o - 4; (b) 2* 0 - 2; (c) 4* 0 ; 
(d) 2* 0 . 

7. 8* + ;y + 7 = 0. 

II. y = 4x + 1, y = — 4x + 25. 


Section 2.3, p. 62 

3. -16*. 

7. -10 + 30*. 

II. 3500 - 14*. 
15. -32*-40. 
19. (3, 0). 

23. 5 - 3* 2 . 

27. 1 + X. 

X 1 


5. -72. 

9. 6y + 7. 


13. 

17. 

21 . 

25. 

29. 


\0x + 25. 

( 0 , 6 ). 
( 10 , 100 ). 

6x 2 - 6x + 6 . 
1 

(x+ 1 ) 2 ‘ 



35. 


- 2 Qc 2 + 1 ) 
C * 2 - l ) 2 ' 


33. 

37. 


—2x 

(x 2 + l) 2 * 
1 

2Vx - 1 


39. (b) Area = 2. 

43. g'(0) = 0;y = 3. 

45 . ( 1 , 1 ). 

Section 2.4, p. 67 

I . v = 6t - 12 ; (a) t = 2 , (b) t > 2 . 

3. v = At + 28; (a) t = -7, 

(b) t > -7. 

5. v = 14r; (a) t = 0, (b) t > 0. 

7. v= 8t - 24; (a) r = 3, (b) f > 3. 

9. (a) 7 seconds; (b) 48 ft/s; 

(c) 176 ft/s; (d) 224 ft/s. 

II. (a) 12 ; (b) 6 ; (c) 18. 

13. 10 seconds. 

15. (a) 3200 gal/min; (b) 2400 gal/min. 
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17. drldt decreases as r increases. 

Section 2.5, p. 73 

1 . 15. 3. -5. 

5. 3. 7. -3. 

9. 4. 11. 5. 

13. 0 15. f 

17. (a) 6 ; (b) 4; (c) -2; (d) 0; (e) does 

not exist; (f) 

19. (a) 5; (b) {; (c) 0; (d) 1; (e) 1; 

(f) t; (g) l 
23. See Fig. 2.20. 

25. (a) Limit = 1; (b)* = 0.4. 

Section 2.6, p. 79 

1 . (a) None; (b) 1, -1; (c) 1; (d) all 
x<0; (e) all * < 0; (f) none; (g) 3, 
-4; (h) none. [Remember that a 
function is automatically discontinuous 
at every point not in its domain; thus, 
1/x is discontinuous at x = 0 even 
though it is a continuous function.] 

3. i 5. 3. 

7. i 9. 1. 

15. (a) y at x = ir/ 6 ; (b) yV2 at x = 
7t/4; (c) 1 at x = 7t/2. 

17. (a) Yes; (b) yes. 

19. (a) Yes, at * = 0; (b) yes, at x = 0; 

(c) no; (d) yes, at x = 0 . 

21. (a) No; (b) yes, at* = 0. 

23. No maximum, minimum = 1. 

25. Maximum = 1, no minimum. 

27. Maximum = —3, minimum = - 8 . 
29. No maximum, minimum = 2. 

Additional Problems, p. 81 

1 . b= - 6 . 

5. (b) Drop the perpendicular from P 
to a point A on the axis of the 
parabola. Draw the circle whose center 
is the vertex V and which passes 
through A. Let B be the second point 
at which this circle intersects the axis, 
and draw the line PB. This line will be 
tangent to the parabola at P 
7. (a) x = 0; (b) x = ±2; (c) x = y; 

(d) differentiable at all points. 

13. m = 2a, b = —a 2 . 

15. When t = {; 8 ft/s. 

19. Does not exist. 

21. -5. 

23. Does not exist. 

25. Does not exist. 

27. 2. 29. 2. 

31. -3. 33. j. 


35. -5. 37. y. 

39. 4. 41. 3a/2. 

43. 1. 45. 0. 

47. Does not exist. 

49. Does not exist. 

51. 3. 53. 0. 

55. 0. 57. 1. 

59. lim /(*), lim /(*), andlim/(*) 

; c -» 0 + x —> 0 - *-»0 

do not exist. 

61. Because there are rationals as 
close as we please to every irrational, 
and irrationals as close as we please to 
every rational. 

63. Slope = 0.693. 


CHAPTER 3 

Section 3.1, p. 87 

I . (a) 54* 8 ; (b) 0; (c) -60* 3 ; 

(d) 1500x"(x 40 ° + 1 ); (e) 2 x - 6 ; 

(f) x 4 + x 3 + x 2 + x + 1; (g) 4x 3 + 
3x 2 + 2x + 1; (h) 5x 4 - 40x 3 + 

120 x 2 - 16Qx + 80; (i) 12 x(x 10 + 
x 4 x 1); (j) 18x 2 — 6 x + 4. 

3. (a) v = — 6 + 6 r, a = 6 ; (b) v = 

-9 + 18r 2 , a = 36 r, (c) v = 18r - 12, 
a = 18. 

5. y = lx — 10. 

7. (1, -2) and C-h “§)■ 

II. a = 1, 3. 



15. (a) 3 ac < b 2 \ (b) 3 ac = b 2 ’, 

(c) 3ac > b 2 . 

17. y — a 3 = 3 a 2 (x - a)’, all a =£ 0 . 
19. y = 12* — 16 and y = 3* + 2. 
23. (-1, -2). 

Section 3.2, p. 91 

1 . 2*. 

3. 15* 4 + 57 * 2 + 6 . 

5. 18* 2 + 2*- 1. 

7. 72* 5 + 20 * 4 + 6 * + 1. 

9 ~ 2 

U - l) 2 ’ 

4x 3 + 12x 2 - 1 

' ' (x+ 2f ' 

3-6x2 -4* 

1 (1 + 2 x 2 ) 2 ’ ( 1 +x 2 ) 2 ’ 

—4x -3 

(x2-l) 2 ' ly ' (2x-3) 2 ' 

-x 2 + 2x + 3 

*' (x 2 + 2 x+ l) 2 ' 


10x 3 - 36x 2 + 42x 
(5x - 7) 2 

288x 10 - 360x 5 
(24x 5 - 5) 2 ' 


27. 

29. 


—x 2 — 2 x + 1 
’ x 2 (x - l ) 2 


2 x 4 - 2 


33. 

* J 

35. (3, 2) and (-3, -2). 

37. Two; (-3 ± V5)/2. 

39. 4* + 5y = 13, 5* - 4 y = 6 . 
41. 2y = x + 2 . 

43. * - 3y = 2, 3* + y = 6 . 

45. Area = 1. 

47. (0, 2), (±1, 1). 


Section 3.3, p. 97 

. 10 

' (2 — 5x) 3 ' 

3. 6(x + 2)(x 2 + 4x - l) 2 . 

5. (13 - 8x)(5 - x) 2 (4 + x) 4 . 
-12 


7. 


(3x+ l) 5 ' 
12 (x 3 - l ) 3 


9. -36(1 - 6 x) 5 . 


,13 


13. 16(2x + 1) 3 [1 + (2x+ l) 4 ]. 

,, —5(x 5 — 5)(x 5 — l ) 3 

15. ^ 26 

17. 4(x 5 - 3x ) 3 • (5x 4 - 3). 

19. 6 (x + x 2 — 2x 5 ) 5 • (1 + 2x - 10x 4 ). 
4x 

21 - 

(12 -x 2 ) 3 ' 

23. 7(x 2 + 3x - 5 ) 6 • (2x + 3). 

25. -6(3x 2 - 5x + 2 ) -7 • ( 6 x - 5). 

27. 4(5x + 3) 3 (4x - 3) 6 (55x + 6 ). 
2x(x 2 + 9) 


(9 - x 2 ) 3 


31. 2(2x - 3) 7 (3x 2 - x + 2) 9 (84x : 
108x + 31). 

-2(2 1- \) 2 (t 2 — 2t — 9) 


2 _ 


33. 

35. 


(t 2 + 3 ) 3 


72 


(5 - 4r) 4 ' 

37. 5(2x 2 + 5x - 3) 4 (4x + 5). 


39. 


20(3x + l ) 3 


41. 50x - 55. 


(1 - 2 x ) 5 ’ 

43. y = 16x - 15. 

45. (a) 3 « 2 (b) 4(2« - 1) 

dx dx 
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(c) 4 u(u 2 — 2) 


du 

dx‘ 


Section 3.4, p. 101 

1. 5 cos (5x — 2). 

3. -cos (cos x) • sin x. 
5. 3 sin 2 x ■ cos x. 


7. -. 9. 3x 2 cos x 3 . 

1 + cos x 

11.5 sec 2 5x. 

13. sec 2 (sin x) • cos x. 

15. 8 x(l + tan 2 x 2 )tan x 2 • sec 2 x 2 . 

17. 15(cos 3x + sin 5x). 

19. —45(5x - 3 ) 2 sin 3(5x - 3) 3 . 


2 \. __ 23. 

(1 — sin x) 2 ' 

lt . —x sin x — cos x 


3 

1 + cos 3x‘ 


27. 3x 2 sin -y - 2 cos 

x z x z 

29. 6 sin x cos x (2 - cos 2 x) 2 . 

31. cos (tan x) • sec 2 x. 

35. 2tur + or 2nir — n an 
6 6 

integer. 

37. (a) The particle starts at s = A 
when t = 0, moves to s = —A when 
t = 7 r/k, and moves back to s = A 
when t = 27 r/k. This oscillatory motion 
continues with period 27 r/k. (b) v = 
—Ak sin kt. (c) v = 0 when x = ±A 
and \v\ has its largest value Ak when 
s’ = 0. (d) a = -Ak 2 cos kt = — k 2 s. 


Section 3.5, p. 107 

3x^_ 

4y 2 


1 . 


3. 


3x 2 ~ 4 y 
3 y 2 + 4x' 


7. 


1 


1 - ly 6 ' 



9. 



11 


2 

3x 2 ’ 


13. 


±3 

2V3x - 1 ’ 


15. 


-2 

(1 + x) 2 ' 


17. 


±9x 


2V36 - 9 x 2 


19. fx-' /s . 21. -fx 

23. fx- 3/5 . 

2 , 3(x 3 - 16) J x 2 

4x 3 V x 3 + 8' 

9_ (x + 2) l/2 
2 (x - 1) 5/2 


-7/4 


29. (a) x + 4y = 7; (b) x + 2y = 4; 
(c)x+ 3y = 0; (d) x + 3y = 19. 

33. (a) 2 x 3/2 ; (b) 2x 5/2 . 


35. 


39. 


y 

cos y — x ’ 


cos x 
3y 2 + 2y 


41. 


cos Vx 
2Vx 


^ tan Vx sec 2 Vx 


5 sin 2x 
V6 — 5 cos 2x 

3 sec 2 (3x — 1)~ 1/2 
2(3x - 1) 3/2 


Section 3.6, p. 110 

I. (a) 8, 0, 0, 0; (b) 16x - 11, 16, 0, 
0; (c) 24x 2 + 14x - 1,48x + 14, 48, 
0; (d) 4x 3 - 39x 2 + lOx + 3, 12x 2 - 
78x + 10, 24x - 78, 24; (e) fx 3/2 , 

il v l/2 i5 _|/2 _ 15 -3/2 

3. (a) n!(l - x)-(" +l> ; 

(b) ( —1 )"n! 3"(1 + 3x) -(n+1) ; 

(c) (— l)" +1 n!(l +x)-(" +1) . 

, (n — 1 )a n x n ~ 2 

y 2«-i • 

7. (a)r = y, s = 0, v= 12; (b) r = 4, 
s = 32, v = 6; (c) t = 1, s = 6, v = 
-3. 

9. 3, j. 

13. (a) sin x, cos x; (b) — sinx, —cosx; 
(c) sin x, cos x; (d) — cos x, sin x. 


Additional Problems, p. Ill 

I. (-1, 10) and (3, -22). 

3. (1, 2) and (-1,-2); the smallest 
slope = 1 , at ( 0 , 0 ). 

5. Slope = 4x 3 — 4x; x = 0, ± 1; 

-1 < x < 0 , x > 1 . 

7. a = 1, b = 1, c = 0. 

9. a = 1, b = 0, c = -1. 

13. a = 1 ,b= ~2,c = 2,d= -1. 
17. ( 6 , 9), (-2, 1), (-4, 4). 


19. 


(a) 


—4x 

(x 2 - D 2 


; (b) 


—4(x + 1). 

u - i ) 3 ; 


(c) 

(e) 


x(4 - x 3 ) 
(x 3 + 2) 2 


; (d) 


x 2 (3 - x 2 ) 
(1 -x 2 ) 2 


; (f) 


- 2 x 2 — 6 x — 11 
(x 2 + x — 4) 2 ’ 

- 2 

(I +X) 2; 


(g) 


1 8 x 4 - 24x 3 - 9 
(x - 1 ) 2 


; (h) 


-10C* + 3) 
(x - 2) 3 


23. (a) (x + 2)(x + 3) + (x + 1 )(x + 3) + 
(x +!)(* + 2); (b) (x 3 + 3x 2 )(x 4 + 4) X 
(2x + 2) + (x 2 + 2x)(x 4 + 4) X 
(3x 2 + 6x) + (x 2 + 2x)(x 3 + 3x 2 ) X 
(4x 3 ). 

25. (0, I0V5), (±3, V5). 

27. (2, -2) and (-10, 1). 

29. (a) -6(1 +2x) 2 (4- 5x) s (15x+ 1); 
(b) 10x(x 2 + l) 9 (x 2 - l) l4 (5x 2 + 1); 


(c) 


— 2 x( 2 x 2 - 19) 


; (d) — 3x 6 (3 - 2x) 2 


(16 + x 2 ) 4 
X (4x - 9)(32x 2 - 96x + 63). 

31. (a) y = (x 4 4- l) 3 ; (b) y = 

2 (x 6 + 1 ) 6 . 

33. (a) 3 sin (1 — 3x); 

(b) — 7 x 6 cos (1 - x 7 ); 

(c) sin (cosx) • sin x; 

(d) sin [sin (cos x)] • cos (cos x) • sin x; 

(e) —4 cos 3 x • sin x; 

(f) 90x(l — 3x 2 ) 2 cos 4 (1 — 3x 2 ) 3 • 


sin (1 - 3x 2 ) 3 ; (g) -- : -; 

1 — sin x 

(h) 15 sin 4 3x • cos 3x; (i) -4x 3 sin x 4 ; 

(j) 15 sin x • cos 4 x ■ (1 — cos 5 x) 2 ; 

(k) -3 sec 2 (1 - 3x); 

(l) — 24x 2 tan 3 (1 - 2x 3 ) • 

sec 2 (1 - Zv 3 ); (m) -sin (tan x) • sec 2 x; 
(n) -cos [cos (tan x)] • sin (tan x) • 
sec 2 x; (o) 20 x 4 tan 3 x 5 • sec 2 x 5 . 


77' 

35. 2nir ± —, n an integer. 

39. (a) y —» 0 as x —> ±°o; 

(c) 2 x sin -— cos —; (d) 0 . 
x x 


41. (a) 


6 x+ 5 


; (b) ■ 


3(3x 2 + 5x - 1) 2/3 ’ v ' 5x 3/5 


x 1/2 

(C) 1, -i; (d) - (8 _ jr 3/2 ) i/3- 

43. (a) y = lOx — 19; (b) x - 4y + 
9 = 0; (c) 12x — 13 y = 11; (d) y = 
— 2x - 15. 

47. (a) -2(1 + 3x) _5/3 ; 

(b) 

(d) fx 3/2 ; (e) -i *" 3 ' 2 + |x“ 5/2 ; 

(f) 20(x 2 + 1 )(x 2 + 4) 1/2 . 

53. (a) —20 cos x + x sin x; 

(b) -3 62 sin 3x. 


27. 
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CHAPTER 4 
Section 4.1, p. 119 











23. (0, 3), (t r, -1), (2 t r, 3) and 

(1 3 X /4 3 v 

G 77 * 2)* G 77 * 2)* 


27. max = fV3” at x = min = 

—|V3 at * = —- 7 -. 

0 

29. 125. 

Section 4.2, p. 122 
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Section 4.3, p. 129 


13. 


17. (2,0). 19. <2=3. 

3. (a) a > 0 ; (b) a < 0. 



(c) 



27. (a) PI x = 77, CU 0 < x < 77, 

CD 77 < x < 277; (b) PI x = 

CU 0 < x<-j, CD -2- < 

4 4 4 4 

377 377 577 577 

x< — cu x T’ cd X 

< x < —~r, CU —y- < x < 2it; 

4 4 

(c) PI x = 77 , CD 0 < x < 77 , CU 77 < 

X < 277. 


7. alb . 


9. $8.50. 


13 

17 

21 

25 

29 


4, 4. 

4 by 8 in. 
24 in. 


11 

15 

19 

23 

27 


2 p.m.; 30 mi. 
108. 

1. 

V3. 

1. 


(a 2/3 + b 2 / 3 )3/2 (convince yourself 
that this and the preceding problem are 
essentially the same). 

A _ 

8 

33. \(B + H ) 2 . 


3 1 • ^min 


277 


Section 4.4, p. 137 

I. 1. 

5. 4 by 4 in. 

9. (1, 1). 

II . (a) 1 mi; (b) 1 
(c) 8 min longer. 

13. 15/VTo mi/h. 
19. 1. 

23. * = y V2 a. 

27. (2, 4). 


f R- 
2- 


h and 44 min; 


21 . a = 2 . 

25. (a) 0; (b) 1. 

29. A max = }V3. 

31. The spider should walk straight to 
the midpoint of a side not containing S 
or B, then straight on to B. 

Section 4.5, p. 142 

1. (a) 12077 ft 2 /s; (b) 240 t 7 ft 2 /s. 

3. 2/77 ft/min. 

5. 4 ft/s at each of the stated 
moments. 

7. 3 ft/s. 

13. 52 mi/h. 

15. 7 lb/in 2 per min. 

1 


9. 4 7 ft/s. 


17 


19. 


(a) — in/min; (b) 
77 

■j— in/s. 

577 


1 

2^277 


in/min. 


21 . 2/(V2- 1 ) = 7.69 h. 

23. 40V2 ft/min. 

Section 4.6, p. 146 

3. 0.618034. 5. 2.154435. 

7. 1.305407 ft. 

II. 0.918247 and 2.863580. 

13. (a) 1.236123; (b) 0.876726. 

Section 4.7, p. 155 

1. $43/unit; $43.03. 

3. $80/unit; $80.14. 

5. $193.89; * = 89. 

7. $0.85; * = 232. 

9. $0.31; x = 93. 11.200. 


15. 345. 
2 yx; (b) $55. 


5000. 

17. (a) p = 110 
19. $160. 21. $16 

$573.33; x = 20. 

636. 


23. 

25. 



5. 

( K3 y 

-2 

Y i 

-? ; 

1 t 

1 / 

1 / 

1 

1 / 

v 1 y 

(-2.-24) 




9. 
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19 . 

2 

5 








— i 1 



37. (a) Point of inflection at a: = 1 ; 

(b) points of inflection at x = 1 , 2 ; 

(c) points of inflection at x = - 2 , 0 , 1 , 
2, 3. 

39. a = -3. 41. yV3. 

43. x = 2l,y = 35. 

45. 18 = 16 + 2. 47. 5, 5, 5. 

51. 7 V 3 a, \b. 59. 2 in. 

63. 4000 knives at a price of $18 
apiece. 

65. 20 days. 

67. (a) 120 ft; (b) 312 ft. 

69. i 71. V2. 

73. tt/4. 75. 4. 

77. 4/tt. 


79. A square with side \{a + - 

Vg * 2 - afc + b 2 ). 

81. V2. 

83. 3 by 6 by 12 inches. 
bs 

85. a - . m, if this number 

is positive. 

87. a: 2 + y 2 = 32. 

89. (3,3). 

91. (5, 0) and (-5,0). 

99. (a) 12 ft/s; (b) 3 ft/s. 

101. l/ 77 ft/min. 103. At least 9 ft. 


109. 0.32 lb/min. 

111. 14477 m 3 /min. 

115. Decreasing 1 in 2 * * * /min. 

117. When t = R ° V ^ - roV ^ • 
a\^a - b\b 

119. (a) 3.316625; (b) 1.903778; 
(c) 2.087798. 

123. 1.856636 in approximately. 
125. $42. 127. 30. 

CHAPTER 5 

Section 5.2, p. 169 

1. (63a: 8 - 15a: 4 ) dx. 

(2x - 3a: 3 ) dx 

' vr^ • 

^ (2 - x) dx 
Vhc - X 2 ' 

7. (2 * _1/3 + 2x~ 415 - 17) dx. 
g (15a: 2 + 8a:) dx 
2V3x + 2 


— 15(3m 2 - 2u + 1)a: 2 (a: 3 + 2 ) 4 

(m 2 - m ) 2 

13. AA = 277r Ar + 77 Ar 2 and dA = 

2 / nrdr = 277T Ar, since dr = A/: 

Imagine that the thin circular ring is 

cut across and unrolled—with slight 
distortions—into a long thin rectangle 
with length 2vr and width Ar. 

15. 12, 12.048064. 

17. 16.167, 16.166236. 

19. 26.75,26.749612. 

21. 6.019, 6.018462. 
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23. 0.849, 0.848048. 

25. 1.037 mi. 27. 125.66 ft. 


Section 5.3, p. 177 

1 . {x 2 + X + c. 

3. yx 3 + yx 4 E {a 5 + c. 

5. 2 Vx + c. 7. *x 7/4 + c. 

9. y x 223 -E c. 

11 . f x 212 — 2 Va + c. 

13. 6 Vx + yx 3/2 + c. 

15. jx 3 + \x b + c. 

17 . lx + jx 2 + c. 

19 jx 6 + 3a 2 — 5a + c. 

21. —2x -2 + c. 

23. fx 3/2 — 4x 7/2 — 3x 1 + c. 

25. 9x l/3 - 16x 1/4 + c. 

27. 2Vx - 4x l0/3 + c. 

29. a 4 — 4a 2 + 17x + c. 

31. ^a 10/3 + fx m + 3a 473 + c. 

33. |x 3 ' 2 -fx 7/2 + |fx ' 1/2 + c. 

35. 4 a S00 + c. 

37. 4 (3 + 4a) 3/2 + c. 

39. — j(2x- 3)“' + c. 

41. -yV5 - 4 a 2 + c. 

43. f(l + Vx ) 5/4 + c. 

45. 4(1 + * 2 ) 3/2 + c. 

47. i(7 - A )" 6 + c. 

49. —y(2 — x 2 ) 3/2 + c. 

51. 2 Vx 3 - 5 + c. 

53. no(IOx + 10 ) 11 +c. 

55. 4<(3t 2 + 4) 5/2 + c. 

57. 4V3a 3 — a + 2 + c. 

59. {a 15 + c. 61. y = a 3 + 2. 

63. (a) y sin 2a + c; (b) —y cos 5a + 
c; (c) 2 sin 2x — 3 cos 5x + c; 

(d) —y cos 2x + j sin 5x + c. 

65. (b) yx — y sin 2x + c. 

67. (a) y sin 5 x + c; (b) -7 cos 6 x + 
c; (c) sin (sin x) + c. 

69. (a) y sin 2 x + c; (b) —y cos 2 x + c. 
They differ by a constant. 


Section 5.4, p. 181 

1. y = 2x 3 + 2x 2 — 5x + c. 

3. y = 6 x 4 + 6 x 3 — 4x 2 + 3jc + c. 
5. 3y 2 — 4y 3/2 = 3x 2 + 4x 3/2 + c. 

_ 1 1 , 

7 x 2 


9. v = 


y = ■ 


2x 2 — 31 
33 - 2x 2 ' 


13. 3 Vy = xVx - 3. 

Section 5.5, p. 187 

I. 8 s; velocity = -128 ft/s and 
speed = 128 ft/s. 

3. v= -32 t + 128; 7 = — 16r 2 + 
128f. 

5. loVlO s. 7. 40 ft/s. 

9. 96 ft/s. 

II. i/ 0 2 /64 ft; 96 ft/s. 

Additional Problems, p. 188 

I. fx 5 — yX 4 + lOx + C. 

3. yx 3 — \x 2 + x — 4 Vi + c. 

5. yX 4 + fx 3 + yX 2 + C. 

7. 1 lx 3 - 27x 4 T c. 

9 . 6x — fx 3/2 ~ 7X 2 + c. 

II. —1(2 — 3x) 3/2 + c. 

13. y^(5x + 2) 165 + c. 

15. 5Vl + x 2 T c. 

17. }V2x 3 - 1 T c. 

19. f(x 2 — 2x + 3) 2/3 4- c. 

21 . -jV2 -x 2 + c. 

21 ~i( ,+ jf +c - 

25. f(x + 1 ) 7/3 + c. 

27. |(1 4- x) 7/3 — y(l 4- x) 4/3 4- c. 
29. ff(x 3 4- x 4- 32) 11/2 4- c. 

31. yg(jt 3 - 1 ) 4/3 (4x 3 + 3 ) + c. 

31 <»»- J¥v-- 

(b) 3Vy - 4 = (a - 1) 3/2 - 2 . 

35. a 2 — _y 2 = c - 

39. (a) 25 s, 1200 ft/s; (b) 25 V2 = 
35 s, 800V2 = 1120 ft/s. 

41. 30 m/s. 

43. (a) 44 ft; (b) 680 ft. 

45. About 1.86 mi. About 0.36 in. 
47. About 1 79,427 mi/s. 


CHAPTER 6 


Section 6.3, p. 196 

1. (a) 55; (b) 62; (c) 206; (d) 0; (e) 0; 
(f) 1500; (g) 7. 

5. (a) (b) -—————————————— * 


(c) 


2 

0 n — 1 )n 


Section 6.6, p. 212 

1. 9. 3. f. 

5. 12. 7. f. 


9. 48V2/5. II. f- 


13. 

21 

2 • 

15. 

26 

3 ’ 

17. 

33. 

19. 

26 

3 • 

21 . 

4 

9 • 

23. 

2 

3 ■ 

25. 

1. 

27. 

13 

3 • 

29. 

5 -2 

48<3 • 

31. 

1 

6- 

33. 

« 4 /4. 

35. 

b 2 / 6 . 

37. 

1 

30- 

39. 

i 

2- 

41. 

3. 




Section 6.7, p. 216 

1. (a) 7; (b) f; (c) 19; (d) 7 
3. y. 13. na 2 /2. 


Additional Problems, p. 217 

7. (a)f;(b)f;(c)12;(d)5V5/3; 
(e) la- 

9. (a)4(2V2- 1); (b) f; (c) 3; (d) 
4a 3 „, 2a 


II. (a) 


(c) 


1 +x 4 
3a 2 


; (b) 


; (d) 


1 + A 2 ’ 

5x‘ J 


V3a 3 + 7 ’ Vl + x 10 ' 


CHAPTER 7 


Section 7.2, p. 224 

i 1 


I- f- 

3. f. 

5. f. 

7. tjV2. 

9. 4. 

11. 36. 

13. f. 

15. 8. 

17. -fA/2. 

19. 1?. 

21. f. 

23. (a) 7(b) 

25. 

27. 2(V/> - 

1) —> 00 as Z? —> 00 . 


29. (a) 2(V2 - 1); (b) y(3V3 - 
Vl - 3); (c) + y- V2 - 2 . 

33. 4V ^~ 5 a 2 = (0.21 895 )a 2 . 

Section 7.3, p. 229 

I. (a) 87 t; (b) 16 tt/ 15; (c) 37 t/ 5 ; 

(d) 2 tt/ 3; (e) 8 tt/ 3; (f) 16 to 3 /105. 

5. 47t. 7. y a 3 . 

9 . f« 3 . 11 . 2ir 2 a 2 b. 

13. lV3a 3 . 15. 9V2/2. 

17. fa 3 . 19. 577 A 2 in 3 . 

CH 

21. (a) The volumes are A(x) dx 

CH 

and J o B(x) dx , which are equal 
because A(x) = B{x) for every x. 
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23. \TTa 2 h. 25. y — ax 4 . 

Section 7.4, p. 235 

3. 128tt/5. 5. 486tt/5. 

7. 2-nib-a). 9. (2 - V2). 

11. (a) 8 tt; (b) 256 tt/15. 

13. rr/i 3 /6. 15. 877 / 9 . 

17. By a factor of 1.71, 
approximately. 

Section 7.5, p. 240 

i. jf (loVIo - i). 

^ 53 g 118 

A 6 ■ J . 9 . 

7. 12. 

Section 7.6, p. 244 

1. 253tt/20. 3. 1277. 

5. ^-(ioVTo- i). 

7. -y- (2V2 - 1 )p 2 . 


2 

13. (a) v = — *; (b) area = 4ir 2 ab. 

7T 

Section 7.7, p. 249 

I. 64 ft-lb. 3. 6000 ft-lb. 

5. 550 ft-lb. 7. 50,000 ft-lb. 

II. (b) 250 ft-lb. 13. 5 ft-lb. 

15. GMm/2a. 17. mgRh/(R + h ). 

19. 24077W’ ft-lb. 21 . j7Ta 3 w(h + a). 
23. About 118,500. 

25. m\ = \ni2- 


9, ^ 77 a 2 . 


11 . 0 


Section 7.8, p. 254 

I. 150 tons. 3. 125 tons. 

5. | ton. 7. | tons. 

9. 3007rlb. 

II . 10 f 6 tons; 11 ft. 

13. 300V2w or approximately 
13.2 tons. 

Additional Problems, p. 254 

1 l ^ 128 

I . 6 . .V i5 . 

5. 36. 7. 

9. 18. 11. f. 

13. 64. 15. f. 

17. 

19. (a) 5677 / 15 ; (b) 56 tt/15; (c) 32t7/3; 

(d) 4 877 / 5 ; (e) tto 3 /15. 

21. (a) 51277/15; (b) 128 tt/3. 

23. f a\ 25. \7r(b 5 - a 5 ). 


27. (a) 277tf 3 ; (b) 577 a 3 ; (c) { 5 - 77 a 


29. a 2 /*. 

33. 12877/3. 
37. 13577/2. 
43. t- 


51. AB = 


31. 

35. 


Wh. 

1177/15. 


168 77. 

7 ft-lb. 

945 • 2 13 ft-lb. 

67 D 

324 a B- 
21 


39. 277. 

45. 

49. 

53. 9077. 

57. 7Ta 2 H. 

61. 2325 ft-lb. 


il 
3 ■ 
495 
8 • 


55. 

59. 

63. 

65. 

67. yy c, where c is the constant of 
proportionality. 

69. 12577W/6. 


73. 10 tons. 

77. 187.5 tons. 
81. 40961b. 


71. f ’ 8 3 7tw ft-lb. 
75. 1 ton. 

79. 60 tons. 


CHAPTER 8 

Section 8.2, p. 263 

1. (a) log 4 16 = 2; (b) log 3 81 = 4; 

(c) logs, 9 = 0.5; (d) log 32 16 = f. 
3. (a) 4; (b) 6; (c) -4; (d) f 

5. (a) a = 32; (b) a = yy; (c) a = 6; 

(d) a = 49. 

9. (a) 7; (b) acidic pH < 7, basic 
pH > 7. 

Section 8.3, p. 269 

1 l (e x _ e -xy 3 ( X 2 + 2*)^. 

5. 7. Jce ax . 

9. 4x 2 e 2 T 1 1. ye 3 * + c. 

13. 5e* /5 + c. 15. 2e x 3 4- c. 

17. (a) Max. pt. (0, 1), no min. pt.. 


pts. of infl. 




max. pt., min. pt. 


-3. 

e 


(b) no 


pt. of 


infl. 


-6- -pr 

b _ „-b 




19. f(e 

23. Area =1 — e~ b — » 1 as /? —> °°. 
25. (a) e; (b) e; (c) e; (d) e 2 ; (e) Ve. 
29. 8 %. 


Section 8.4, p. 276 

I. (a) 2; (b) 3; (c) 1/x; (d) l/x; (e) -x; 
(f) 1/*; (g) x; (h) 3x; (i) 0; (j) y; (k) y; 
( 1 ) 0 ; (m) x 3 }’ 2 ; (n) 


(o) 2 e 3 ; (p) x 2 e ■* 


3. (a) 


y( 


+ 2 x) y(l + xy) 

x(3y- 1)’ x(l -xyY 


5. (a) y In (3x + 1) 4- c; 


(b) y In (3jc 2 + 2) + c; (c) fx 2 + 

2 In x + c\ (d) x + In x + c\ 

(e) x - In (x + 1) + c; 

(f) y In (x 2 + 1) + c; 

(g) In (3 - 2x 2 ) + c; 

(h) In x(x — I) + c; (i) y(ln x) 2 + c; 
(j) In (In x) + c; (k) 2 In (Vx + I) + 
c; (1) In (e* + e~ x ) + c. 

9. No max., no pt. of infl., min. 

(3, 9 — 18 In 3) = (3, -11). 


11. 77 In 4. 
17. I/Ve. 


15. min. 




19. (a) y 1 + 


lb)£ 


2 x 


6x ■ 


2x 


1 


5 \x 2 + 3 x + 5/’ 


21 . (a) x x *x x 

1 - 


(b) 'Vx 


\_ 

X 

In x 


+ (In ;t)(l + In x) 
. Max = Ve. 


Section 8.5, p. 282 

1. (a) 2520; (b) 13.8. 

3. 20.1. 5. 95.8 percent. 

7. When t — 6 . Can you solve this 
problem without calculation, by 
merely thinking about it? 

9. jc = 10(f)'. 11.2 more hours. 

13. (a) About 3330 years (1380 B.C.); 

(b) about 3850 years (1900 B.C.); 

(c) about 10,510 years; (d) about 7010 
years. 

15. jc->A if A < B\ x^B if A > B. 


Section 8.6, p. 287 

3 = *o*i 

'' X *o + (*i - xo)e ~ cx 4 ' 

5. j = —(1 — e~ ct ). 
c 

1. When v< 1, the resisting force in 
the second case becomes very small. 
9. About 53.4 lb. 


Additional Problems, p. 288 

1. ~xe^ x ~ x2 l Vl - x 2 . 

3. (2x-2)e x2 ~ lx+[ . 

c e^ x , 1 \/~~x 

5. —- + —\e x . 

2 \Tx 2 


7. 


_> -3.r 


+ C. 


II. 2\/e x + 1 + c. 
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ANSWERS TO ODD-NUMBERED PROBLEMS 


13. ( 1 /a, e). 17. jie 6 - 1). 


23. (a) 


y(3x + 1 ) 
42T 2 - 1) 


; (b) 


y(24 + 1 ) 
41 - 3y) ' 


25. (a) y In (1 + 2x) + c\ (b) -y In 
(1 -34 + c; (c)y In 2; (d) y In 10; 

(e) In 3; (f) 2VIn* + c; (g) y In 7; 

(h) -i In (1 -x 2 ) + c; (i) y (In 3) 2 ; 

(j) i In (34 - 3* + 7) + c; (k) In 
(e* + 1 ) + c; ( 1 ) In (x + 1)4 + 2 ) + c; 
(m) y (In 4 3 + c; (n) 7 (In x) 2 + c; 

(o) y- [In (In a :)] 2 + c; (p) -y(lnjr ) 2 + c. 

29. ^- + t In 2; a = 

2 a 4 

31. (a) (In 10)10*; (b) (In 3)3*; 



(c) (In 7T)7T*; (d) (3 In 7)7 3 *; 
(e) (In 6)(2x - 2 ) 6 * 2 " 2 *; 


(f) 


\TsTxr 


33. Max. at x = -——; pts. of infl. at 
In 5 

2 ± V2 

X In 5 ' 

35. (a) (In x) x 7 ^- 
In * 

fln Y \lnx 

(b) (2 In 4 a" 1 *" 1 ; (c) u - X 


+ In (In 4 


4 ^ 


[1 +ln (In 41; (d) 

( e ) ^2/3 0 3 ■*)• 


(1 + 7 In x); 


37. In the year 3524, approximately. 
39. In 4 more hours. 

41. 73.12°F. 

47. 17 more days. 

49. v 2 = — (1 - e~ 2cs )\ v—> J— as 

C V c 

S —> °o. 

51 . When r = 4.86 min; when t = 
21.50 min. 

53. About 1.39 h. 


CHAPTER 9 

Section 9.1, p. 299 

1 . (a) tt/ 12; (b) 7 tt/ 12; (c) 2 tt/3; 

(d) 5 tt/ 12; (e) 5 77 / 6 ; (f) 377 / 4 ; 

(g) 577 / 4 ; (h) 777 / 6 ; (i) 7 t 7 / 2 ; (j) 5w. 
3. 0 = 2 radians. 

5. A = 25 cot {0. 

I. H = L tan 6. 

II . sin 36 = 3 sin 6 - 4 sin 3 6 , 
cos 3 6 = 4 cos 3 6 — 3 cos 6. 


13. sin 40 = (4 sin 6 - 8 sin 3 6) cos 0. 
15. (a) t(V 6 - V 2 ); 

(b) 7 V 2 — V3. Show that these 
numbers are equal. 

17. (a) 0, 2 t 7 ; (b) 77 / 2 , 377 / 2 ; (c) 77 . 


'77 , 27771 '77 , 27771 

T + — or f2 + — 


19. (a) 6 

for all integers w; (b) 0 = (1 + 2 /i) 57 r 
for all integers n\ (c) 0 : 


1 IT 27771 

11 *7 30 5 

1 1 77 27771 

or H-— tor all integers n. 

35. For a proof by geometry, use the 
fact that the vertex opposite the fixed 
side must lie on a circle of which this 
side is a chord. 

37. |V2 ft. 


Section 9.2, p. 304 

I . 3 cos (3* - 2). 

3. 48 cos 16*. 5. 2x cos x 2 . 

7. 15(cos 3x — sin 5*). 

9. x cos x + sin x. 

II . } sin 12 x. 13. cos 5 x. 

15. 3e 2x cos 3x + 2e 2 * sin 3*. 

17. -tan x. 21. 45°. 

23. 5. 

31. 77 / 3 , tangent = 3. 

33. 1. 35. 1 . 

37. 7. 39. 1. 

41. 1. 43. -1. 


Section 9.3, p. 309 

I. —7 cos 5* + c. 

3. 7 cos (1 - 9x) + c. 

5. sin 2 x + c or -cos 2 * + c or 
-7 cos 2 x + c. 

7. 7 sin 4 2x + c. 9. 7 sin 8 \x + c. 

II. -2 cos Vx -I- c. 

13. 7 sin (sin 2x) + c. 



17. -In (cos x) + c. 

19. sin (x 2 + x) + c. 

21 . f. 23. V2 - 1. 

25. f. 27. 3. 

29. yir 2 . 


II. sec 2 x e tm *. 

13. — j cot 6x + c. 

15. —y cot 2 x + c. 

17. y tan 5 x + c. 

19. —y esc lx + c. 

21. y. 23. y(l7 — 2). 

25. 2. 27. 4V3;no. 

29. 3.277 mi/s. 31. (c) 66°. 

Section 9.5, p. 318 

I. yV3, -yV3, — V3, fV3, -2. 

3. (a) it; (b) tt/2; (c) 0.123; (d) 0.8; 
(e) 0.96; (f) 77/7; (g) 77 / 6 ; (h) 77 / 4 . 


5. 1/(25+4). 



7. 1 

VjcU + 1) 

9. sin 1 x. 


11. (sin -1 *) 2 . 

1 n ^ 


5 + 3 cos *’ 

15. 77 / 6 . 

17. 7 sin 1 

2x + c. 

19. 77 / 8 . 

21 . 7 sin -1 

Jx + c. 

23. 7 tan 1 f jc - 1 - c. 


25. -77/12. 

27. 77/4. 



29. (a) sin 1 7 ; (b) 7 rad/s. 

31. The formula is invalid, because 
the integrand 1 /V 1 — x 2 is 
discontinuous at the point x = 1 in the 
interval of integration. 

33. 4 77 2 £7 2 . 


Section 9.6, p. 323 

1. (a) x = 5V2 sin ~ 
A = 5V2, T= 2tt; (b) * = 
2 sin ^3f + yj, A = 2, T = 

(c) x = V2 sin fr + A 


277 
3 ; 

:V2, 


r= 277 ; (d) x = 4 sin |2r - — 

A = 4, T = 77. 

„ 4 VT45 . ^ 77 

3. A = —— in; T = - 

5. T= 277./— = 89 min. 

V £ 

7. About 39 in. 


Section 9.4, p. 311 

1. 8x sec 2 4x 2 . 

3. 2 tan (sin x) • sec 2 (sin x) • cos ;c. 
5. 0. 

7. 24 esc (-6x) cot (-6x). 

9. -Vcsc 2x cot 2x. 


Section 9.7, p. 329 

I. (a) !; (b) f; (c) ft 
13. 3x 2 cosh j: 3 . 

15. 6 csch 6x 17. 0. 
19. y cosh (5x - 3) + c. 

21. 2V2 sinh \x + c. 
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23. x - tanh x + c. 

29. 4r sinh 1. 
a z 

Additional Problems, p. 330 

1. -9 cos (1 - 9x). 

3. —2 sin x cos x = —sin 2x. 

5. —10 sin 5x cos 5x = —5 sin lOx. 
7. —6 sin 6x. 

9. —x 2 sin x + 2x cos x. 

1 1 . x cos x. 

13. (sin x)[sin (cos x)]. 

15. — (cos x)[sin (sin x)]. 

17. cos x. 25. 0. 

27. 1. 29. 2. 

31. I 33. 2. 

35. 77/4. 37. y sin 3x + c. 

39. -2 sin (1 - yx) + c. 

41. jg sin 6 3x + c. 

43. -f cos 3x + y cos 3 3x + c. 

45. y sin x 3 + c. 

47. y cos (cos 2x) + c. 

49. -{esc4x + c. 

51 • — -7 + c - 

2(3 + 2 cos x) 

53. -fV7 - sin 5x + c. 

55. j. 57. i 

59. 2V2. 61. 2177. 

67. 12 sec 2 3a. 69. — l ~ sc2 — . 

Vcot 2x 

71.4 sec 2 x tan a:. 

73. —10 cot 5* esc 2 5x. 

1 1 9 1 

75. tan-sec z —. 

XX X 

\/sec Vx + tan Vx 
4Vx 

79. sec 2 x sec 2 (tan x). 

81. -3 esc yx + c. 

83. -y cot 3x + c. 


107. — tan 1 V5x 4- c. 

V5 

109. y sin " 1 fx + c. 

111 . { tan -1 x 4 + c. 

113. 36 ft from the point on the road 
closest to the billboard. 

115. ^ rad/s. 

117. T= 2t7/V2^ = 0.56 s. 

119. A = 5,/= 1/tt. 


CHAPTER 10 

Add a constant of integration to the 
answer for each indefinite integral in 
this chapter. 

Section 10.2, p. 339 

1. —y(3 - 2x) 3/2 . 

3. y In [1 + (In x) 2 ]. 

5. —y cos 2x. 

7. y In [sin (3x - 1)]. 


9. i(A 2 + 1) 3/2 . 

11 . ye 5 *. 

13. -y cot (3x + 2 ). 

15. 2. 

17. 2Vl - cos x. 

19. £? tan_1 *. 

21. y sec 5x. 

23. y(ln x) 2 . 

25. In 2. 

27. sin -1 

29. y sin 3 x. 

31. In (1 + e x ). 

33. -y In (cos 3x). 

35. 4Va 2 + 1 . 

37. tan -1 e x . 

39. \{e x + l) 7 . 

41. y tan 5x. 

43. — j esc 2a. 

45. 4(V2 - 1 ). 


h/. 3 . 

49. (a) n = 3, }e x4 ; (b) n = 2, j sin x 3 ; 
(c) n = - 1 , y(ln x) 2 ; (d) n = -y, 

2 tan Vx. 


Section 10.3, p. 344 

1 . yx - { sin 2 x. 

3. ^x + { sin 2x + sin 4x — 


85. -{ esc 4 x + c. 

87. —{ cot 4 x + c. 

89. 477/3. 91. 300 km/h. 

93. (a) - 77 / 3 ; (b) 77 / 3 ; (c) - 77 ; 
(d) 0.7; (e) 0.7; (f) - 1 ; (g) tt/3. 


95. 

1 

97. 

A 4 

V25 - a 2 

1 + A 10 ' 

99. 

1 

101 . 

1 +A 

xVx 2 - 1 

1 + A 2 ' 


Vx 2 "-^ 


105. tt/2. 


yy sin 3 2 x. 

5. —y cos 3 x + y cos 5 x. 

7. sin x — y sin 3 x. 

9. y sin 372 x - y sin 772 x. 

11 . yx — sin 12 x. 

13. i 

15. y sec 7 x — f sec 5 x + y sec 3 x. 

17. —cot x — x. 19. —{ cot 4x. 

21. -y cot 2x — y CSC 2x. 

23. y sin 3x. 

25. (a) tan x — x, y tan 3 x - tan x + x, 
y tan 5 x — y tan 3 x + tan x - x; 


(b) y tan 2 x + In (cos x), { tan 4 x — 
y tan 2 x — In (cos x), y tan 6 x — 

{ tan 4 x + y tan 2 x + In (cos x). 

27. (a) 77 2 /2; (b) 77 ; (c) (4 t 7 - 77 2 )/8; 
(d) 3 77 2 /l 6. 

29. y[sec x tan x + In (sec x + tan x)]. 


Section 10.4, p. 348 

1 . —sin 


! X Vfl 2 ~ X 2 


a 


o 1 „i x , x 

3. —r tan 1 —I- _ , , —y-. 

2 a 3 a 2 a\a 2 + x 2 ) 


5. -jV9 - x 2 (x 2 + 18). 


7. - In 
a 


+ Vtf 2 -I- x 2 / 


9. In (x + Vx 2 — a 2 ). 

11. {xVtf 2 + X 2 + 


y a 2 In (x + V<z 2 + x 2 ). 

1 1 a + x 

13. — In-. 

2a a — x 


15. \Ja 2 + x 2 - 

.ln( a + V f T ^ ). 

17. In (a: + Va 2 - a 2 ) - ■ - ■ — 

X 

19. y- [a 4 sin -1 — + 
o a 

Va 2 — x 2 ( 2 a 3 - a 2 x)]. 

21. — V4 — x 2 . 23. — tan -1 —. 

a a 

25. -{(9 - x 2 ) 372 . 

27. V9 + x 2 . 31. 2iz 2 ba 2 . 

33. 3 - V2 + In (1 + yV 2 ). 


Section 10.5, p. 350 

I. sin -1 (x - 1). 3. tan -1 (x + 2). 

5. —V2x — x 2 + 2 sin -1 (x - 1). 

7. f sin” 1 - 6V6jc - a 2 - 

j(a — 3) V 6a — a 2 . 

9. j In (a 2 + 2a + 5) + 

3tan - 1 (£±i). 

II . In (a - 1 + Va 2 - 2 a - 8 ). 

13. £ In (2a + 1 + V4a 2 + 4a + 17). 


4Va 2 - 2a - 3 


103. 


X 
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Section 10.6, p. 356 

1. (a) * + 1 + 


x - V 

iv2 1 


yjc 2 + X + 


In (jc — 1); (b) jx 2 — f .jc + yy — 


3 1 2 _i_ 4 

~ ~X Z + 


3x + 2 ’ 9 " 

& (3-* + 2); (c) x - 


1 2 1 
— — — 


In (jc 2 + 1); (d) 1 + 


jc + 2’ 


jc 2 + T 


x + In (jc + 2); (e) 1 — 
x — 2 tan -1 x. 

3. 3 In (jc — 3) + 4 In (jc + 2). 
5. 5 In (jc — 7) — 3 In x. 

7. 2 In jc — 4 In (jc + 8) + 

3 In (jc ~ 3). 

9. 3 In x + 2 In (jc + 13) — 

In (jc - 3). 

2 


I. —In (jc + 1) — 


x + 


- 3 In x. 


13. 2 In x + y In (jc 2 + 2x + 2) - 
6 tan -1 (jc + 1). 


5 


15. x + y In (x — 1) - ^ _ 2 
f In (jc 2 + 1) + 2 tan" 1 x 
17. jx 2 — 2jc + 4 In (jc + 2). 

1 9. yjc 2 - 2 jc + 5 In (x + 2). 

1 / sin 0—1 

21. — In . ^ . 

5 \ sin 0 + 4 


„ 1 . (e x -2 

23 - 4 ln tn r 


29. jc = 


*0 


Xo + (1 - xo)e 


-kr 



3. jx 2 tan 1 jc — jx + \ tan 1 



1 1 . yjc sin (3jc - 2) + y cos (3jc — 2). 
13. x tan x T- In (cos x). 

15. x In ( a 2 + x 2 ) — lx + 2a tan" 1 —. 
17. y(ln x) 2 . 19. 77 ( 77 - 2 ). 

23. (b) jc(ln x) 5 - 5jc(ln x ) 4 + 

20jc(ln x ) 3 — 60jc(ln x) 2 + 120jc In x — 
12 Ox 

25. 2t 7[V2+ In (Vl + 1)]. 


Section 10.8, p. 368 

1. - V1 - x 2 . 

3. y sin 3 jc — y sin 5 x + 7 sin 7 x. 
5. 2Vl + In jc + 


]n ( Vl + In jc - 1 
Wl + In jc + 1 


7. ~lVx cos Vx + 2 sin Vx. 

9. —cos x. 

1 1 . y(jc 2 — x sin 2x — y cos 2x). 
13. e x - In (e x + 1). 

t( 


15. f(x — 1) 8/3 + f(x - 1) 5/3 + 

!u- d 2/3 . 

17. 2Vx tan -1 Vx — In (1 + x). 
19. 3x+ 11 In (x — 2). 


21. 2 In 


2 - vTT 


VT 


23. -je~ xl (x 3 + 1). 


25. In (x + 3) + 


x + 3' 


1 , (x 2 + 1 

27 ? tan_l hr 


29. In jcVjc 2 — 1 — Vjc 2 — 1 + 
tan" 1 Vx 2 - 1. 


1 x 2 

31. -y cos jc 3 . 33 — In l-^y- 


+ 1 
+ 4 


35. In (jc — 1) — 


x- 1 2(jc — l) 2 ' 


37. y tan 6 jc + y tan 4 jc. 


39. -In (1 + Vl - jc 2 ). 

41. yV(JC — y sin 4jc + y sin 3 2jc). 
43. 2Vx - 1 - 
Vjc - 1 


4 tan 


45. jc In (jc 2 + 3) — 2jc + 

2V3 tan" 1 —. 

V3 


47. (3 sin 3jc + 5 cos 3jc). 

1 / jc 2 — 3 

49. 4- In 1 


8 \jc 2 + 1 
51. j In (x 5 + 5x + 3). 

53. -4 In (Vx + 1 + 2) + 

6 In (Vx + 1 + 3). 

55. — —L tan -1 (V3 cos x). 
V3 


57. —{(x + I)" 4 + j(x + I) -5 - 
l(x + I)” 6 + f(x + I)" 7 . 


59. t tan 5 x 


,5 v - 4 


tan 3 jc + tan jc 


61. —2 In (jc + 2) + 3 In (x + 3). 
63. x In V2x — 1 — yjc — 
i In (2x - 1). 


65. sin" 


• vT 


67. yx ” y|o sin 20x 
69. y[ln (sin x)] 2 . 

1 1 y CSC 3 2jc — CSC 5 2x 

73. jc In (2jc + x 2 ) - 2x + 2 In (x + 2). 

7S l r 4/3 _ A r 11/6 

77. jc In (1 + jc 2 ) — 2jc + 2 tan" 1 jc. 

79. jc tan jc — yjc 2 + In (cos jc). 

81. 7 sec 7 jc. 


83. y(2jc 2 sin' 

jcVl - JC 2 ). 

85. y tan -1 jc' 
89. 2*?^. 


jc - sin 1 jc + 

87. In (tan jc). 
91. 5(ln jc) 2 . 

■ 4 tan 1 (|Vjc 


93. 2Vjc - 2 
95. —cot jc. 

97. y(sin -1 3 jc + 3jcV 1 — 9x 2 ). 
99. In (jc 2 + 5jc + 6). 

1 / r 2 - 3 

101. 4ln' 


2 ). 


8 


x 2 + 


103. \x 2 + 7 In (x — 1) — j In (x 2 + 
1 / 2 x + 1 

l vy 


jc + 1) + - tan 

V3 


105 

107 

109 


113 
1 15 
1 17 

1 19 

121 

123 


y sec 4 jc - y sec 2 JC. 

COS JC — y COS 3 JC. 


I(X~ D : 


+ fu¬ 


ll 2 ' 3 . 


— (ax + b) In (ax + b) - x. 
a 


sin e x . 

|jc 3/2 (3 In jc — 2). 
tan jc — cot x 

x — 


7ln, x + 3/ x + 3' 

(x — 1) In (1 — Vjc) — yjc — Vx. 
y(jc — l) 2 tan -1 (x — 1) — 


y(x - 1) + (x - y) tan -1 (jc - 1) - 
T In [(x - 1) 2 + 1 ]. 

125. y(xV 1 — x 2 - sin -1 x). 

Section 10.9, p. 374 

1. (a) 0.643; (b) 0.656. 

3. 2.2845. 5. 0.881. 

7. 3.14156. 

9. About 23,630 yd 2 . 
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Additional Problems, p. 375 

I. f(3x + 5) 3/2 . 3. In (1 + 3x * 2 ). 

5. —{ sin (1 - 5x). 

7. 2 sec Vi. 9. tan -1 x 2 . 

II. { In (sin 4x). 13. — 1/ln x. 

15. In (tan x). 


• 2 / 3x — 5 

17 C ° S \ 2 
19. -2cscx 3 . 21 . tan " 1 (In x). 


““ 3(3x + 5) ’ 

25. -{ In (3 - 2x). 

27. { sin (1 + x 3 ). 

29. — { cot (x 2 + 1). 

31. tan " 1 (sin x). 

33. 7 In (sin 2x). 35. {(tan -1 x) 2 . 

37. 7 In (2x + 1). 

39. 3e xl3 . 41 . tan (sin x). 

43. { sin " 1 5x. 45. tan -1 (sec x). 

47. {(In x) 3 . 

49. -In (1 + cos x). 

51. -je' 3x 53 . -cos (In x). 


55. esc —. 57. 7 tan 1 e 2x . 

x 

59. {(2 +x 4 ) 372 . 61. In (e x + x). 

63. -4/V?. 65. cot x. 

67. -{ In (cos x 2 ). 

69. 7 In (1 + x 2 ). 

71. je 3x2 ~ 2 . 73. tan x + sec x. 

75. f(l + x 5/3 ) 3/2 . 

77. e tanx . 

79. -{(1 + cos x) 5 . 

81. sin (tan x). 83. n/6. 

85. {• 87. i f. 

89. 7 X — 7 ) sin lOx. 

91. {x + ^ sin 14x. 

93. —7 cos 3 x + 7 cos 5 x - 7 cos 7 x. 
95. { sin 4x - -}7 sin 3 4x. 

97. esc x - 7 esc 3 x. 

99. f sin 8/5 x. 

lOl.y tan5 x + 7 tan 3 x + tan x. 

103. 7 sec 9 x — 7 sec 7 x. 

105. -7 cot 4 x + 7 cot 2 x + In (sin x). 
107. 7 tan 3x - 7 cot 3x - 
7 In (esc 6 x + cot 6 x). 


109. - 7 - sin 1 —— + - 7 - xV3 — x 2 . 
2 V3 2 


x 

111 . x — a tan ” 1 —. 

a 


113. — y{a 2 - x 2 ) 3/2 (3x 2 + 2tf 2 ). 


115. In (x + Va 2 + x 2 ) — 


Va 2 + * 2 


1 17 ' _ 3 a ^ 3 "^ a2 " • x2 ( 2x2 + « 2 )- 


, 1 x X 

119. — tan 1 - —j - 7 -. 

2a a 2 (a 2 + x 2 ) 

\/x 2 — 9 v 

121. 123. 


9x 


Vl - 9jc 2 


,25. 4m P + 7^ ). 


127. - - sin " 1 -. 

Vtf 2 - x 2 a 


129. In (x + Vtf 2 + x 2 ) 


Va 2 + x 2 ' 


131. {xVx 2 - a 2 + 


7 a 2 In (x + Vx 2 - a 2 ). 


133. sin 


135. &V 2 tan -1 


-1 / x + 1 


V2 


137. - sin 


1 — 1 / 3X 1 


V? 


139. f sin " 1 (jc - 1) - 2V2x - a: 2 - 

i(jc — 1)V2 jc - x 2 . 



145. 3Vjc 2 + 4* + 8 + In (x + 2 + 
Vjc 2 + 4x + 8 ). 


4V* 2 + 2x - 3' 


149. 19 In (jc — 4) — 3 In (x + 3). 
151. 3 In (2x + 1) - 5 In (2x - 1). 
153. 5 In x + In (x + 4) — 

3 In (x - 3). 

155. -2 lnx + 3 In (x + 3) - 
3 In (x - 3). 

1 3 

157. 2 In x H- —y - 

x 2x 2 

5 In (x + 1). 

159. -lnx + In (x 2 + 4x + 8) - 



x + 2 


161. {x 3 tan 1 x — {x 2 + 7 In (1 + x 2 ). 
163. {x[cos (In x) + sin (In x)]. 

165. x 3 sin x + 3x 2 cos x - 6 x sin x - 
6 cos x. 


167. — * n X + In x - In (x + 1 ). 
x + 1 

169. {V1 + x 2 (x 2 - 2). 

e ax 

171. 9 , . 7 ( a sin bx — b cos bx). 

a* + b^ 

173. -xe~ x - e~ x . 

175. — {x 3 ^ -2 * — \x 2 e~ 2x — jxe~ 2x — 
le ~ 2x . 

177. 2 7 r. 179. a 1 ' 2 . 

183. (b )^2lM)i_A^L£)i + 

x 6 In x _ X 6 
36 216' 

185. (b) {[sec x tan x + In (sec x + 
tan x)]. 


CHAPTER 11 
Section 11.2, p. 391 

I. liix x(°44 

5. (0, ja). 7. (f. f). 

9 ' (3(4-7r) fl ’ 3(4 - 77 ) a } 

II. (ja,ja) 

4 a 2 + ab + b 2 . 2 

13. -—- — -; this —> — a as 

37 t a + b 7 t 

b -» a. 

15. (a) On the axis, a distance 7 h from 
the center of the base; (b) on the axis, 
a distance \a from the center of the 
base. 


Section 11.3, p. 393 

1 < * ) ( o 44 (b) 



3. (a) y-ira 2 -, (b) 6\/27 ra 2 . 
5. 77 ra 3 ‘, 6 V 37 ra 2 . 

7. (a) 7 rr 2 h\ (b) {7 jr 2 h. 
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ANSWERS TO ODD-NUMBERED PROBLEMS 


Section 11.4, p. 396 

I . \Ma 2 . 3. jMh 2 . 

5. jMa 2 . 7. \Ma 2 . 

9. ±Ma 2 . 

II. (a) yV2a = 0.707a; 

(b) ^\40a = 0.548a; 

(c) iVTOa = 0.632a. 

Additional Problems, p. 396 

3. (a) (i f); (b) (0,1); (c) (1, f); 

(d) (|, b\ (e) (0, t); (f) (if, If); 

( 8)(t^7' 0 )- 

5. Sirabc, 87 r(a + b)c. 

7. | Ma 2 . 


3. (a) m = £ (b) m = 0 . 

5. 16, 34, 34, 14, 2 percent. 

Additional Problems, p. 424 

1 . f. 3 . 44 . 

5 . 12 . 7 . i 


Q 

75* 

11. 

oo. 

13. 6. 

15. 

3. 

17. i 

19. 

oo. 

21. 0. 

23. 

1 

9 1 

25. 0. 

27. 

1 

24- 

29. {. 

31. 

9. 

33. 3. 

35. 

i 

3- 

37. f . 

39. 

1 

3- 

41 — 

16* 




CHAPTER 12 


Section 12.2, p. 403 


1. 3. 

3- it- 

5. i 

7. -f 

9. i 

1 1. -6. 

13. 3. 

15. 4. 

17. 

19. 1/77. 

21. 16. 

23. i 

25. 6. 


27. f(8) = ja 

; 2 (sin 6 - sin 6 cos 0), 

g(e) = fa 2 (0 

- sin 0 cos 0); limit = 

Section 12.3, 

p. 408 

1. -3. 

3. 1. 

5. 3. 

7. 0. 

9. 2. 

11. 1. 

13. 0. 

15. 0. 

17. 0. 

19. 2. 

21. i 

23. 1. 

25. 1. 

27. 1. 

29. 1. 

31. 1. 

33. 

35. 1. 

37. 1. 

39. 1 Ne. 

41. eP. 


Section 12,4, 

p. 413 

1. \/(2e 6 ). 

3. f. 

5. 1 — cos 1. 

7. 1. 

9. 0. 

11. lnV3. 


13. V2(ln 4 — 4). 

15 . 1 . 

17. Converges if p < 1, diverges if 
P ^ 1. 

19. (a) 7 t/ 5 ; (b) 7 r. 

Section 12.5, p. 423 

1 . k = 77 , k = —, no k. 

2 7r 


43. 0. No; instead, it emphasizes the 
logical point that L’Hospital’s rule 
makes a definite statement only when 
the limit on the right exists. 


49. 

0 . 

51. 

0 . 

53. 

0 . 

55. 

0 . 

57. 

0 . 

59. 

0 . 

61. 

P- 

63. 

i 

3- 

65. 

0 . 

67. 

0 . 

69. 

i 

6- 

71 

1 . 

73. 

1 

75. 

1 . 

77. 

1 . 

79. 

1 . 

81. 

1 . 

83. 

— oo. 

85. 

1 . 

87. 

1. 

89. 

1 . 

9L 

1 . 

93. 

e 2 . 

95. 

e 4 . 

97. 

e 3 . 

99. 

1 . 

101. 

. l/(3e 6 ). 

103 

. 1 . 

105. 

i 

■ 2- 

107 

. 7T/4. 

109 

. tt/8. 

1 1 1 

1 

• 3- 

1 13 

. 2. 

1 15 

. Diverges. 

1 17 

. Diverges. 

1 19 

. 3. 


CHAPTER 13 

Section 13.2, p. 437 

1. (a) D; (b) C, 0; (c) C, 0; (d) C, 0; 

(e) D; (f) C, {; (g) C, 0; (h) C, 

(i) C, 0; (j) D; (k) C, 0; (1)C,0; 

(m) D; (n) C, 0; (o) C, it; (p) C, 

5. (a) a/2; (b) 4a 3 . 

9. A decreasing sequence of positive 
numbers converges. 

Section 13.3, p. 444 

5. (a) C; (b) D; (c) D; (d) C; (e) D; 

(f) C; (g) D; (h) D. 

9. 40 mi. 


11. All terms must be zero from some 
point on. 

13. (a) 0.6000 ... ; (b) 1.666 . . . ; 

(c) 1.08000. ..; (d) 1.125000...; 

(e) 1.0384615384615 .... 

Section 13.4, p. 449 

3. (a) W < 1, ax/(\ -x 2 ); (b) \x\>l, 
\l(x - 1); (c) |l + x\ > 1, 1 + x (also, 
if x = 0 the sum is 0); (d) e~ l < x < e, 
In x/(\ - In x). 

5. x<0. 

Section 13.5, p. 454 

I. (a) D; (b) C; (c) C; (d) C; (e) C; 

(f) D; (g) C; (h) C. 

3. D. 5. D. 

7. D. 9. D. 

II. D. 13. C. 

15. C. 17. D. 

19. C. 

21. C if p > 1, Dif p< 1. 

23. C. 

25. Cifp> l,Difp< 1. 

29. In 2. 

Section 13.6, p. 460 

1. C. 3. D. 

5. C. 7. C. 


Section 13.7, p. 464 

1. C. 3. D. 

5. C. 7. C. 

9. D. 11. C. 

13. D. 17. D. 

19. C. 21. D. 

23. C. 

Section 13.8, p. 469 

1. CC. 3. D. 

5. AC. 7. AC. 

9. D. II. CC. 

13. CC. 15. AC. 

17. AC. 19. AC. 

21. CC. 23. D. 

25. CC. 

27. (a) F; (b) T; (c) F; (d) F; (e) T; 
(f)F. 

Additional Problems, p. 470 

1. (a) 0; (b) i; (c) 0; (d) 1. 

5. (a) i; (b) (c) 

A n — B n 

7. x n = -——, where A and B are 

V5 

the positive and negative roots of 
x 2 -x - \ = 0. 
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1 + Vl + 4 a 
2 

23. |*| > V2. 

„ *(1 — x n ) nx n+1 

25 ^ = 7r^“T^7- 

31. (a) -In 2; (b) 1 . 

35. (a) C; (b) D; (c) C; (d) D; (e) C; 
(f) D; (g) C; (h) D; (i) C; (j) D; (k) C; 
( 1 ) C; (m) C; (n) C; (o) C; (p) D; 

(q) C; (r) C; (s) C; (t) D; (u) C; (v) C. 
51. C. 53. Inconclusive. 

55. D. 57. C. 

59. D. 


3. R = 0. 

7. [-1, 1]. 

11. (-V3, V3). 
15. Kj]. 

19. [-1, 1]. 

23. R = 

27. (0, 2e). 


CHAPTER 14 

Section 14.2, p. 489 

1. (-4,4). 

5. [-1, 1). 

9. R = 0. 

13. [-1, 1]. 

17. [-1, 1). 

21 . ( 2 , 6 ). 

25. R = 0. 

29. (a) R = 1; (b) R = 

Section 14.3, p. 494 

1 . (a) 2 (-l )" +1 nx n ~', |*| < 1 ; 

(b) 2 (- 1 )" - - 2) 2 ( " + - *", |*| < 

1 1 + x e x — 1 

3 - (a) j 1 " (b)/(*) = 

if x ^ 0,/(0) = 1; 

x , , v x 


(c) 


(1 -x) 2 


(d) 


(1 -x 2 ) 2 ' 


Section 14.4, p. 503 

15. (a) * - 


3-3! 5-5! 

(b) X + JX 4 — 56* 7 + 

(c) * - + jfX 9 - • 

17. 3.14085; 3.14159. 


“»i 1+ -* ! V+W' 

aoe x2 ; (b) y = a 0 - (<j 0 - 1)* + 

i£ o ~ !) ..? _ ( fl o ~ D ..i .... 
2! 3 ! 


1 + («o — 1) x 


l ~ X+ 2\~U' 


1 + (a 0 - \)e~ x - 

5. y = 


a i 


, * 2 . * 3 . * 4 . 
x 1!2! 213! 314! 


X “"verges for 

all x. 


Section 14.7, p. 519 

I . * + * 2 + y * 3 - ^* 5 - -^ x 6 + ■ ■ ■ . 
31 . * + |* 3 + ^* 5 + jjy * 7 + • • • ; 

R = tt/2. 

35. (a) 1; (b) i 37. 272. 


Additional Problems, p. 523 

1 . (a) o°; (b) oo; (C) e. 

5. (-1, 1). 


<‘>£ 


tan 


dr, 


(b) (1 + x) In (1 + x) — x; 


(c) - J * 1 (d) -J In (1 - * 4 ); 


(e) 


x + 4x 2 + x 3 

(1 -x) 4 


-(f) 


4 — 3* 
(1 - x .) 2 ’ 


11 . fi(x) = 2x/(l - *) 3 . 
15. (a) -i; (b) 0; (c) 


CHAPTER 15 


Section 14.5, p. 509 

1. (a) 9; (b) 13. 3. 1.64872. 
5. 0.978148. 7. 0.848048. 



13. 1.0833, 1.0981, 1.0985, 1.0986. 
15. 0.000006. 

Section 14.6, p. 513 

1 . (a) y = 


Section 15.2, p. 534 

1. (a) Circle, center (1,3) and radius 
5; (b) empty set; (c) point (5, -1); 

(d) circle, center ( 8 , - 6 ) and radius 2 ; 

(e) point (-3, 7); (f) empty set. 

3 . (a) (- 2 , - 1 ), (- 2 , 0 ), y = - 2 ; 

(b) (3,1), (5, 1),*= 1; (c) (-2, 5), 
(-2, 1), y = 9; (d) (-2, 1), (-5, 1), 
*= 1 ; (e) (- 1 , 2 ), (~\,hy = i 

5. b 2 y = Ahx(b - *). 


Section 15.3, p. 541 

I . (a) * 2 /25 + >’ 2 /21 = 1; (b) * 2 /36 + 
y 2 l 52 = 1; (c) 4* 2 /9 + y 2 /4 = 1; 

(d) * 2 /16 + y 2 H = 1; (e) * 2 /27 + 
y 2 /36 = 1; (f) 24* 2 /2500 + y 2 /100 = 1. 
3. (a) (0, 0), (0, ±5), (0, ±4), e = f; 
(b) (0, 0), (±2, 0), (±V3, 0), e = 
V3/2; (c) (-2, 1), (-2, 1 ± V2), 

(- 2 , 2 ) and (- 2 , 0 ), e = V 2 / 2 ; 

(d) ( 1 , 0 ), ( 2 , 0 ) and ( 0 , 0 ), (1 ± 

0), e = V3/2; (e) (2, -1), 

(5, - 1 ) and ( — 1 , - 1 ), (2 ± V5, - 1 ), 
e = V5/3; (f) (0, 2), (±2V2, 2), 

(± 2 , 2 ), e = V 2 / 2 . 

7. (a) y 7 rab 2 \ (b) 71 ra 2 b. 

17. _ 

21 . y = mx ± v b 2 + a 2 m 2 . 

23. r\ + Y2- 

Section 15.4, p. 549 

I. (±2, 0), (±Vl3, 0), 2y = ±3*, 
e = V73/2, * = ±4VT3. 

3. (0, ±2), (0, ±VT3), 3>- = ±2*, 
e = Vl3/2, y = ±4/Vl3. 

5. (0, ±2), (0, ±2V5), 2y = ±x, 
e = V5, y = ±2/V5. 

7. ( 0 , ± 1 ), ( 0 , ±V 2 ), y = ±x, 
e = V 2 , V 2 j = ±*. 

9. y 2 l9 - * 2 /16 = 1. 

II. * 2 /9 - t 2 /36 = 1. 

13. * 2 /36 - y 2 /28 = 1. 

15. * 2 /36 - >> 2 /45 = 1. 

17. Hyperbola with center (1, -2) and 
horizontal principal axis. 

19. Two straight lines 5(y + 1) = 

±6(x + 2 ). 

3i. (±fVbHT 2 , -f). 

Section 15.6, p. 557 

I . e = 45°, / 2 /4 + y' 2 = 1, ellipse. 

3. 6 = 30°, / 2 /2 — *' 2 /2 = 1, 
hyperbola. 

5. 6 = 45°, *' 2 = 4V'2y', parabola. 

7. 6 = 45°, x' 2 /2 + y' 2 /4 = 1 , ellipse. 
9. 6 = 60°, / 2 /3 + y' 2 / 11=1, ellipse. 

II . 0 = sin -1 1 /Vlo, *' 2 + 3y' 2 = 1 , 
ellipse. 
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ANSWERS TO ODD-NUMBERED PROBLEMS 


Additional Problems, p. 558 

1. 4 p. 

9. \n(2 + Vl7) ft 3 . 


Section 16.5, p. 582 

1. \ttci 2 . 5. 77/4. 

7. TT + 3v7. 9. a 1 (- 


7 77 
~\2 



CHAPTER 16 

Section 16.1, p. 563 

1. (a) (V2, V2); (b) (2, -2V3); 

(c) (0, 0); (d) (M,T);(e) (0, -2); 

(f) (-2V2, 2V2); (g) (-3, 0); 

(h) (-3V2, 3V2); (i) (1, 0); (j) (0, 0); 
00(1, V5); (1) (5, 12); (m) (-2V3, 2); 
(n) (0, 3). 

3. (1,0), (1, 277/5), (1, 477/5), 

(1, 677 / 5 ), (1, 877 / 5 ). 

7. (x-2 ) 2 + (j - 2) 2 = 8; circle 
with center (2, 2) and radius 2V2. 

9. (a) Line y = 2; (b) line x = 4; 

(c) line y = -3; (d) line x = —2. 


Section 16.2, p. 567 

5. (a) r = 5 sec 0; (b) r = — 3 esc 0; 
(c) r = 3; (d) r 2 = 9 sec 20; 

(e) r = tan 0 sec 0; (f) r 2 = 2 esc 20; 
sin 2 6 

(g) r = • 


(h)r = 


cos 0 cos 20’ 
2 cos 0 
tan 2 0 - 1' 


9. (1, 277/3) and (l,4rr/3); 
11. r = a sin 20. 


13. x 3 = y 2 (2a - x). 

15. (x - a)(x 2 + y 2 ) = b 2 x 2 . 


Section 16.3, p. 573 

I. 9 = r 2 + 16 — Sr cos (0 — 77 / 6 ). 
3. r = 10 cos 0. 

5. r = 4V3 cos (0 - 77/3). 

7. r = a(\ + cos 0). 

9. (a) Via, Via; (b) yV5a, \Via. 

II. r = epl{ 1 + e sin 0). 

13. e = j. 15. e = j. 

17. ±V e 2 — 1. 


19. (a) 


1 - <?’ 


0 ; (b) 


ep 

1 4- e ’ 


21 . (a) x = j cot -p-; 

2a 

(b) r = — 0 esc 0. 

77 


Section 16.4, p. 579 

C 32 9 

5. -^77 a 1 . 

11 . (a) 477 a 2 ; (b) 477 2 a 2 . 

17. VI 


Additional Problems, p. 583 

1. (a) tan 0 = 4; (b) r 2 = 36/(4 + 5 X 
sin 2 0); (c) r = 2 cos 0-4 sin 0; 

(d) r = 3/(2 cos 0 - 5 sin 0); (e) r = 

4 cot 0 esc 0; (f) r = 1 + 4 sin 0; 

(g) r = 6 sin 20/(sin 3 0 + cos 3 0). 


3 . (a) (V 2 a, 77 / 4 ); (b) the origin and 


( 2-V2 
\ 2 



/ 2 + V2 
1 2 



(c) (a/2, ± 77 / 6 ); (d) (3V2, 77/4), 
( 3 V 2 , 377/4); (e) the origin and 
({, 2 77 / 3 ), ({, 477 / 3 ); (f) (±fl, 77 / 6 ), 
(±a, — 77 / 6 ); (g) the origin and 
(8a/ 5, sin -1 3/5); (h) (4, - 77 / 3 ); 

(i) (±2, 77 / 2 ); (j) (2, ±77/3), (—1, 77 ); 
(k) the origin and (f, ±77/3); 


( 1 ) the origin and 


2 + V2 77 \ 

2 a 'J)' 


( 2 -V 2 

l 2 


a, 



; (m) the origin and 


(±a, 77/4), (±a, 377/4). 

15. Larger angle = 277/3. 
17. }a[(4 + 4 tt 2 ) 3/2 - 8]. 
21. V3 - yir. 

25. ya 2 (3V3 — 7r). 


CHAPTER 17 

Section 17.1, p. 590 

1. (a) x + y = 2; (b) 2x — y = —4. 

3. x + y = 3. 

5. x — l — (y — 3) 2 . 

I. x 2 - y 2 = \. 9 . y= \ - 2x 2 . 

II. No; the second is part of the first. 
13. (c) 45°. 

15. x = a cos 0 + a6 sin 0,y = 
a sin 0 — ad cos 0. 

Section 17.2, p. 599 


a 

V2 ay — y 2 + x. 

7. 6a. 

11. x = 2/? cos 0 + Z? cos 26, y = 
2b sin 6 — b sin 20; ^-a. 


Section 17.3, p. 605 

I. (a) VTo, I 3 i - 34j, 4i - 13j; 

(b) V53, —36i - 4j, — 39i + 14j; 

(c) 6, lOi - 33j, —2i — 33j; (d) V34, 
-i + 55j, 20i + 22 j. 

5 (a, (b) ± tz3±m 

V74 25 


7. ±2(121 +5j). 
13. 6 = 45°. 


9. 


A A 

|A| + |B| ’ 


Section 17.4, p. 610 

I. The line through the head of A 
which is parallel to B. 

5. 2/i + j, 2i, V4f 2 + 1. 

7. i + (3r 2 - 3)j, 6tj, 

Vl + 9(r 2 - l) 2 . 

9. sec 2 t\ + sec t tan rj, 2 sec 2 t tan t X 
i + (sec 3 t + sec t tan 2 f)j, 

|sec r|V2 sec 2 t — 1 . 

II. R = 2 a * 2 i + v o^ + Ro- 

Section 17.5, p. 615 

1. (a) -2/(1 + 4x) 3/2 ; (b)cosx; 

(c) 2x 3 /(2x 4 — 2x 2 + 1) 3/2 ; 

(d) -jV2e->; (e) — 4r 2 /(4/ 4 + 1) 3/2 . 

3. (a) 1; (b) Ts5 5/4 6 1/2 ; (c) none. 

5. If (as usual) s’ increases on the 
circle in the counterclockwise 
direction, then k as calculated from (4) 
has the wrong sign on the upper half- 
circle, because s’ increases in the 
direction of decreasing x. Change the 
sign of this result to get 1 la on both 
halves of the circle. 

7. (-lln2,{V2),|V3. 

9. | a at 0 = 77/4. 

1 1. 4 a sin \6. 


Section 17.6, p. 619 

3. (— aco sin cot )i + {aco cos cot) j, 
(~aco 2 cos cot)\ + ( — aco 2 sin cot) j, aco, 
0, aco 2 . 

5. (- e 1 sin t + e l cos f)i + ( e x cos t + 
e l sin r)j, ( -2e { sin r)i + ( 2e l cos t) j, 
V2e r , V2e\ Vie 1 . 


4 1 


t 2 + 


- i + 


2r 2 


t 2 + 1 


2 40 


r 2 ) : 


C t 2 + l ) 2 


1 + 


8 1 


0r 2 + 1) 


rj, 2, 0, 4/(r 2 + 1). 
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9. a n = 0 when t = 0, a n = — I when 
t = 77/2. 

II. z; > 30\ 2. 


Section 17.7, p. 626 

1 . Since M = yy j, determine 

the ratio a 3 IT 2 for any particular 
planet (for instance, the earth) and 
proceed with the arithmetic. 


Additional Problems, p. 627 

1. 3a 2 /2. 

3. (a) 57r 2 a 3 ; (b) y-7 to 2 . 

5. Si, + 

a 

1 1. Smallest radius = 9/(7V28), at 
t = 

13. Approximately 5.94 X 10 27 g. 


CHAPTER 18 

Section 18.1, p. 635 

I. Faces: x = 1, y = 4, z = 5. Edges: 
*= l,y = 4;y = 4, z = 5;* = 1, 

z =5. 

3. 64. 

5 . (a) The yz-plane and the xz-plane 
taken together; (b) all three coordinate 
planes taken together. 

7. (0, 4, 0). 

9. x 2 + y 2 + (z — l) 2 = 49 or ;t 2 + 
y 2 + z 2 = 14z. 

II. (a) The sphere with center 

( —1, 3, 5) and radius 3; (b) the point 
(5, - 1, 3); (c) the empty set; (d) the 
point (—1, 7, 3); (e) the sphere with 
center (2, —3, 0) and radius y. 

15. f. 

17. y(7i + 5j + 4k). 

19. {(A + B + C + D). 

Section 18.2, p. 639 

3. (a) 60°; (b) 45°; (c) 90°. 

7. No. 13. 2i — k; 

19. Two; 45° and 135°. 

21. c(zt - Z 2 )• 

Section 18.3, p. 646 

I . (a) 14i + 7j; (b) 3i - 3j; (c) 2i - 
14j - 22k; (d) k. 

3. 2V6. 

5. Assuming that their tails coincide, 
all three vect ors l ie in a plane. 

II. (b) 11/VT07. 


Section 18.4, p. 651 

1. (a) T; (b) F; (c) T; (d) T; (e) F; 

(f) T; (g) F; (h) F. 

3. They are parallel. 

5. (a) x = 2 + t, y = - 1 + 4r, z = 

-3 - 2v, (b) x = 2 + 3t, y = -1 - t, 
z = ~3 + 6r; (c) ;t = 2 + 2 t, y = 

-1 - 2r, z = -3 + 5 t. 

7. (a) * = 2 + 3r, y = -3r, z = 3 — 2r; 

(b) * = 4 + 4r, y = 2, z = - 1 . 

9 . (i, i,i). 11 . 6 . 

13. ~2x - y - z + 2 = 0. 

17. 8/V21. 19. (9, 0,0). 

x 2 — y+1 — z 
' 17 -2 -7' 

23. The second plane. 

25. 4x + 3;y + 4z + 2 = 0. 

29. (a) h (b) 0. 

Section 18.5, p. 656 

I. Parabolic cylinder. 

5. Plane. 

9. x 2 + (z - a) 2 = a 2 . 

II. (a) z = (b) x 2 + z 2 = 

e“ 2 > 2 . 

13. (a) y = x 2 + z 2 ; (b) 9(;t 2 + z 2 ) + 
4y 2 = 36; (c) z = 4 — x 2 — y 2 \ (d) x = 
y 2 + z 2 

15. (x - 2z) 2 + (y - 3z) 2 - 
6( x - 2 z) = 0. 

Section 18.6, p. 660 

I. Ellipsoid. 

3. Circular paraboloid. 

5. Hyperboloid of two sheets. 

7. Hyperbolic paraboloid. 

9. Hyperboloid of two sheets. 

II. Ellipsoid. 

13. Hyperbolic paraboloid. 

15. (6, -2, 2), (3, 4, -2). 

17. (a) A{k ) = 7 Tab ^1 - j; 

(b) \irabc. 

23. Hyperbolic paraboloid. 

Section 18.7, p. 663 

I. (a) (2V2, tt/4, 1); 

(b) (2, -tt/ 3, 7); (c) (2V3, tt/6, 2); 

(d) (3V5, tan -1 2, 5). 

3. (a) (2V2, 77 / 6 , 77 / 4 ); (b) ( 2 V 2 , 

5 77 / 6 , — 77-/4); (c) (2, 77 / 4 , 7t/4); 

(d) (2, 77 / 6 , — 7 t/ 2 ). 

5. r 2 + z 2 = 16. 7. r 2 = z 2 . 

9. r = 2 sin 9. I I. r = 3. 


13. p = 4. 15. p = 6 cos (f). 

17. p 2 sin 2 (p + p cos (p = 4. 

CHAPTER 19 

Section 19.1, p. 669 

I. The entire plane except the line 
y = 2x. 

3. The first and third quadrants, 
including the axes. 

5. The part of the plane above the line 

y = 3x. 

7. All of xyz-space except the origin. 

9. The solid sphere x 2 + y 2 + z 2 < 16. 

II. All of ryz-space for which z > 0 
except the planes z= it, 3tt, ... . 

25. Away from the origin. 

27. In the positive x, negative y, 
positive z direction. 

Section 19.2, p. 674 

I. 2, 3. 

3. — 6y 2 /(3;c + l) 2 , 4y/(3jc + 1). 

5. 2x sin y, x 2 cos y. 

7. tan 2y + 3y sec 2 3;c, 2x sec 2 2y + 
tan 3x. 

9. -3 sin (3* - y), sin (3* - y). 

II . e x sin y, e x cos y. 

13. 2eyix, ey\nx 2 . 

15. 2 xy 5 z 7 , 5 x 2 y 4 z 7 , lx 2 y 5 z 6 . 

17. In 

z y z 

19. (a) y = 3, z = 8x + 1; (b) ^ = 2, 
z = 6y — 1. 

31. f(x, y) = 3xy 2 — x 2 cos y + 2y. 

Section 19.3, p. 678 

I. z - 25 = 20(;c — 1) + 40(y - 2). 

3. z = 4x + 5y. 

5. z — 1 = 6(x — 3) — 8(y — 2). 

7. z - 1 = y. 

9. lOx + 13y + 13z = 75. 

13. x&c/a 2 + y 0 y/b 2 + zoz/c 2 = 1. 

15. 60°. 17. The origin. 

19. h( 1 + a)/Va. 

Section 19.5, p. 685 

I. (a) 8i + 4j + 2k; (b) 2i; (c) f(i + 

2j - 2k); (d) i(-i + 2j + fk). 

3. (a) 3, - j; (b) V^, i + j + k; 

(c) 2V19, i + 3j + 3k; (d) 3e 2 , 
i + 2j + 2k. 

5. ±(i + j - 2k)/V6 . 

7. 56/V2I; i + j + 2k; 4/V6. 
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ANSWERS TO ODD-NUMBERED PROBLEMS 


Section 19.6, p. 691 

1. 0. 

3. -9(r 2 + 9)/(r 2 - 9) 2 . 

5. 4 t 3 + 4rw 2 , 4 m 3 + 4r 2 w. 

Section 19.7, p. 695 

I. (3, 5), a minimum. 

3. (1, 2), a minimum; (-1, 2), a 
saddle point. 

5. (-1,-2) and (—2, 8), both saddle 
points. 

7. (0, 0), a saddle point; (-1, -1), a 
maximum. 

13. 2,4,6. 15. \l(21abc). 

17. i 19. Vl4. 

21 . ;c/6 + y/6 + zJ 3 = 1. 

23. 3V3/2. 

27. Base of rectangle = (2 — V3)P; 
height of rectangle = |(3 — V3)P; 
height of triangle = |(2V7 — 3 )P. 

Section 19.8, p. 701 

I. i 

3. Comer in first quadrant is 
(V2, yV2). 

5. 2r = h. 7. y, y. 

9. x 2 /3 +y 2 /\2 + z 2 m = 1. 

II. 77- 13. a 3 . 

17. (a) |d|/Va 2 + 6 2 + c 2 . 

Section 19.9, p. 707 

7. W = Ci* + C2. 

Section 19.10, p. 713 

I. 3*/y. 

3. (1 — sin y)/(;t cos y — 1). 

5. (ye-^ — 2y 2 )/(4xy - 
7. 3z/(z ~ 1), 2z/(l - z). 

;ryz cos xz + y sin ;rz 
1 - x 2 y cos ;cz 
;r sin xz 

I - * 2 y cos xz ’ 

II 

ii. v 1Q . 

CHAPTER 20 

Section 20.1, p. 717 

1. The triangle bounded by x = 0, 
y = ly = x. 

3. ft 5. 98. 

7. tt/ 8. 9. f' 

13. i In 2. 


15 


\' o \lf(x,y)dydx. 

n ln x 

, /(•*. y) dy dx. 

Io £ 2x3 d y dx = i 
I' J o 2 (5-2x-y)dxdy = 5. 

f2 [4x-x* 

23 -LL-2 “y*- 


17 


19 


21 


25. \abc. 
29. jsTTa 3 , 


27. 4. 


Section 20.2, p. 722 

Q 


9 

1 . 2 - 
5. 1 - e 

Q 625 
V. 12 . 

13. f. 

i(* 3 


3. a 2 . 


25_ 
2 ■ 


15. |tt. 


17. 


- 


a 3 ). 


Section 20.3, p. 726 

I. M = a 3 \x = y = 

3. M = J f;3c = f^y = 0. 


5. M = ya 3 ; a: = -fy7ra, y 


= 0. 


7. M — tt\ x— {it 2 — 4)/7t, y 
9. y Ma 2 . 11 . yMfl 2 . 

13. y Ma 2 . 


V = 77 


Section 20.4, p. 730 

I. 7TO 2 . 3. y(47T - 3V3)fl 2 . 

5. a 2 . 7. y(3V3 — 7r)a 2 . 

9. y(8 + 7r)a 2 . 

II. y(9V3-2 t r)a 2 . 

13. iy(9V3 4- Sir). 

15. jTTa 4 . 

Ctt/ 2 r 3 

17. zr dr dQ. 

J - tt /2 Jo 

f tt/4 f tan 8 sec 8 

19 Jo Jo zrdrdd - 

r tt/ 4 ri 

21. zr dr dd. 

Jo Jo 

23. (a) y7r[a 3 - (a 2 - b 2 ) 3/2 ]\ 

(b) y7r(a 2 — b 2 ) 312 = y7r/z 3 . 

25. x = y = 0. 


27. 5c = 0, y 



29. y7ra 3 . 

31 . \ira A b\ x = y = 0, z = ya 2 b. 
33. | Ma 2 . 


35. Ma 2 



Section 20.5, p. 735 

1. 4 3. labchr. 

5. 24. 7. 4rr. 

9. ft 

"■m; f{x, y, z) dx dy dz. 
13. a 4 /8. 

15. ,, r-/m rv^_ 

Jo J - V 1 - 2 J-Vl ->2-2 
/O, >7 z) dr dy dz. 

17. 4. 19. Hit. 

21. y nabc. 23. yabc. 

27. yMa 2 . 

Section 20.6, p. 738 

I. 77/2. 

3. r = y = 0, z = \h. 

5. yA/(2a 2 + 3b 2 ). 

7. fttf(a 2 + Ah 2 ). 

9. x = y = 0, z = ja. 

II. y(8 - 3V3 )to 3 . 

13. tt/ 32. 15. a 3 /3. 

17. fa 3 (377 - 4). 

19. f Ma 2 . 21. }jtfa 2 . 

23. y77a 3 (l — cos a). 


Section 20.7, p. 743 

1.5- y7Tfl 5 (y — cos a + y cos 3 a) = 


yMa 2 [y - y cos a (1 + cos a)]. 


3. 27T 2 fl 3 . 

7. y ota 3 . 
13. y7 TO 3 . 


5. ■ 

11 . 


3 ( a*-b* \ 
8 \a 3 — b 3 ) 
cS7ra n+5 
3{n + 5)’ 


15 ^ o) 


p 2 sin (f) dO dtf) dp\ 

(b) M = 477 p 2 f(p) dp. 


19. irGmSa sin 2 a. 
21. 2irGm 6. 


Section 20.8, p. 747 

I. 3VT4. 3. 77 a 2 V3. 

5. 277V6. 7. a 2 {ir - 2). 

9. >(5V5 - 1). 

II. y77a 2 (5V5 - 1). 

15. -^a[3VT0 + In (3 + VlO)]. 
17.2a 2 . I9.|(20-3 t7). 


I I. 77. 
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CHAPTER 21 

Section 21.1, p. 757 

I. (a) —2; (b) -f; (c) -4. 
3. 0. 

7. (a) f; (b) f; (c) (d) §. 

9. (a) 0; (b) 0; (c) -f. 

II. | for all paths. 

13. 77 for both paths. 

15. 

19. 0 along all paths. 

21 . 0 . 

Section 21.2, p. 763 

9. 0. 11- 6. 

13. e. 15. 12. 

Section 21.3, p. 769 


1. 

i 

12- 

3. 

2 In 4 

5. 

3 

2- 

7. 

1. 

9. 

1 

6- 

11. 

3 

' 10- 

13. 

1 

2- 

15. 

3 TTQ 2 . 

17. 

\ira 2 . 

19. 

. 2. 

21. 

\a 2 . 

23. 

xy 3 . 

25. 

— x 2 4- y 2 . 



27. 

x sin y + y cos 

x. 


31. 

277. 




Section 21.4, p. 778 

I. (a) 0; (b) 0; (c) 2; (d) ^ sin jc; 
(e) 2/r. 

3. 4irabc. 5. y-77 a 5 . 

7. 2^- + f(r). 
r 

II. (a) 36; (b) 18. 

13. 20V. 15. 4ira 5 . 

17. y f° r both integrals. 

19. -1277. 

Section 21.5, p. 783 

5. 77. 7. -1. 

9. 477. 11. 1877. 

13. -877. 15. 0. 

APPENDIX A 

Section A.9, p. 815 

5. iVx - 2 In (1 + V*). 

7. l\Tx - 3'Vx + 6^x - 
61n(< / x + 1). 

9. 2\fx - 2 tan -1 Vx. 

11. j.r 3/4 — 4^* + 4 tan -1 'C / x. 
13. iVx + 2 - 2 tan -1 V* + 2. 

Section A. 12, p. 824 

3. (a) k > 3; (b) all k\ (c) all k. 


Section A.14, p. 832 

3. C if JC ^ I&77, D if X = y/:77. 


APPENDIX B 

Section B.l, p. 848 

9 1 22’ 52’ 

>• (a) 4! ;(b) T6[ ;(c) W 
3. 720; 120. 5. 2880. 

7. 9 • 9! = 3,265,920. 

9. 140,400,000. 11. 120; 60; 325. 

13. 20,160. 

17. 210. 

21. 4200. 

25. 211,680. 


15. 2(3 !3!) = 72. 
19. 378. 

23. 8820. 


27. 41 

3744. 

29. 286. 
/n\ 


= 5148; 


31. 84. 


33. 

35. (a) 349,440.x 11 ; (b) -489,888.x 4 ; 

(c) —2002 a w b 21 . 

Section B.2, p. 854 

1. (a) n(n + 1); (b) n(4n + 1); (c) 4 n 2 \ 

(d) 3 n 2 \ (e) \n(5n + 1). 

(a) n ~ 1 , n > 2; (b) 1 - x 2n+ \ 


7. 
n> 0. 


2 n 











INDEX 


Abel, N. H., 106, 456, 457, 565 
partial summation formula, 831 
series, 457 
test, 833 
Abscissa, 6 
Absolute: 
maximum, 116 
value, 4 

Absolute convergence, 825 
Absolutely convergent series, 467 
Absorption, Lambert’s law of, 282 
Acceleration, 65, 182, 608 
angular, 394 
due to gravity, 183 
Accurate to n decimal places, 505 
Adams, J. C., 626 
Addition formulas, 42, 295 
Adiabatic gas law, 250, 21 ~ 

Age of rocks, 280 
Agnesi, M., 592 
Air resistance, 285 

Algebra, fundamental theorem of, 353, 522 
Algebraic area, 214 
Algebraic function, 27 
Algebraic volume, 720 
Alternating series, 465 
Alternating series test, 465 
Amplitude, 319 
Analysis: 
complex, 523 
vector, 605 

Analytic geometry, 15 
Analytic trigonometry, 39 
Angle of inclination, 12 
Angle between two surfaces, 678 
Angles, direction, 640 
Angular acceleration, 394 
Angular velocity, 394 
Antiderivative, 171 
Aphelion, 542 
Apogee, 542 


Apollonius, 520 

Approximation, tangent line, 166 
Arago, F., 274 
Arc, 236 
centroid of, 390 
length, 236 
element of, 238 

Archimedes, 66, 190, 192, 193, 206, 225, 
228, 243, 257, 384, 530, 534, 572, 738 
principle in hydrostatics, 250, 324 
principle of the lever, 259, 384 
spiral of, 572, 579 
on surface area of sphere, 228, 243 
on volume of sphere, 257 
Area, 191 
algebraic, 214 
of circle, 191 
of curved surface, 744 
geometric, 214 
of parabolic segment, 192 
Area: 

element of, 221 
problem of, 51 
Aristotle, 218, 257 
on Hippocrates’ misfortune, 218 
Arithmetic mean, 702 
Arrow diagram, 24 
Arterial branching, 312 
Astroid (see Hypocycloid, of four cusps) 
Astronomical unit, 626 
Asymptote of a curve, 33 
Atmospheric pressure, 282 
Average cost, 148 
Average velocity, 63 
Axiom, least upper bound, 789 
Axis(es): 
of a cone, 529 
coordinate, 632 
of a hyperbola, 544 
of a parabola, 18, 533 
polar, 560 


Axis(es) (Cont.y. 
rotation of, 554 

Badeer, H. S., 234 
Barrow, Isaac, 212 
Bartlett's Familiar Quotations, 30 
Beckmann, P., 500 

Bernoulli, James, 67, 169, 275, 379, 495, 
565, 598 

Bernoulli, John, 67, 169, 274, 379, 401, 
495, 596, 598 
on Euler’s feats, 275 
joy on solving the catenary problem, 
379-380 

solution of brachistochrone problem, 
596-598, 629 
Bernoulli numbers, 525 
Bessel equation, 494, 512-513 
Bessel, F. W., 275, 513 
Bessel function, 489, 494, 513 
Big Bang, 189 
Big Crunch, 189 
Binomial coefficients, 847 
Binomial series, 503 
Binomial theorem, 84, 110, 845 
Black holes, 187 
Blood flow, 233 
Bound: 
lower, 432 
upper, 433 

Bounded sequence, 432 
Bounded variation, function of, 809 
Boundedness theorem, 796 
Boyle’s law, 250 

Brachistochrone, differential equation of, 
630 

Brachistochrone problem, 596-598, 629 

Bracket symbol, 209 

Brahe, Tycho, 620 

Branches of hyperbola, 543, 544 

Brouncker’s continued fraction, 382 
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INDEX 


Brouwer’s fixed point theorem, 800 

Brown, P. L., 541 

Buck, R. C, 694 

Buff on’s needle problem, 307 

Biihler, W. K., 423 

Burton, A. C, 234 

Cajori, F., 275 
Calculus: 

differential, 51 

fundamental theorem of, 52, 190, 
212, 758, 808 
for line integrals, 758 
infinitesimal, 169 
integral, 51 

Leibnizian spirit in, 578 
Calder, N., 541 
Canceling, 35 
Cardano, G„ 106 
Cardioid, 564, 567, 575, 577 
Cartesian: 

coordinates, 15 
equation of a plane, 648 
plane, 15 

Cassegrain telescope, 548 
Cassini, G. D., 571 
ovals of, 571 

Catenary, 327, 328, 347, 378-380 
Cauchy, A. L„ 456, 464 
Cauchy condensation test, 474 
Cauchy integral test, 456 
Cauchy product of two series, 828 
Cavalieri, Bonaventura, 205, 230 
Cavalieri’s principle, 230 
Cavendish, Henry, 626 
Center: 

of circle, 532 
of curvature, 614 
of ellipse, 536 
of gravity, 244, 385 
of a hyperbola, 544 
of mass, 385, 386, 388, 724 
of population, 386 
Central force, 622 
Central gravitational force, 623 
Centripetal force, 609 
Centroid, 244, 388, 724 
of an arc, 390 
Certainty, Ultimate, 119 
Certitude, 787 
Chain rule, 93, 686, 688 
Champollion, J., 707 
Chebyshev, P., 816 
Church, wrathful, 620 
Cicero, Marcus Tullius, 259 
Circle, 16, 532 
area of, 191 
center of, 532 
of curvature, 614 


Circle (Cont.)\ 
equation of, 16, 532 
involute of, 592 
parametric equations for, 588 
polar equation of, 569 
radius of, 532 
Circular error, 322, 598 
Circular functions, 325 
Circular motion, uniform, 608 
Circulation, 779 

208, Circulation density, 779 

Cissoid of Diodes, 568, 583 
Clarke, Arthur C., 540, 619 
Classical physics, 251 
Clepsydra, 230 
Closed curve, 756 
Closed interval, 4 
CNN, 269 

Cobb-Douglas production function, 701 
Coefficient, binomial, 847 
Coefficient of friction, 331 
Coefficients of fix), Taylor, 495 
Combination, 847 
Comets, 549 
Common logarithm, 263 
Communications satellite, 618 
Comparison test, 451 
for improper integrals, 414 
limit, 452 

Complete elliptic integral, 542 
Complete ordered field, 790 
Completing the square, 17, 348 
Complex analysis, 484, 523 
Complex number, 521 
Composite function, 25, 93 
Computation of it, 498 
Concave down, 120 
Concave up, 120 
Conchoid: 
general, 584 

of Nicomedes, 568, 583, 584 
Condensation test, Cauchy’s, 474 
Conditional convergence, 825 
Conditionally convergent series, 468 
Cone: 

axis of, 529 
elliptic, 659 
generator of, 529 
right circular, 529 
vertex of, 529 
vertex angle of, 529 
Conic section, 529 
degenerate, 530 
Conics, 529 

Conjugate axis of hyperbola, 544 
Conservation of energy, law of, 248, 762 
Conservative field, 761, 781 
Constant, Euler’s, 275, 458, 460 
Constant of integration, 172 


Constraint, 696 
Construction problems, 566 
Continued fraction: 

Brouncker’s, 382 
expansion of V2, 438 
Laplace’s, 422 
Continuity, uniform, 839 
theorem on, 806 

Continuous function, 75, 666, 795 
Continuous at a point, 75 
Continuous vector, 606 
Continuously compounded interest, 268 
Contour map, 667 
Convergence: 
absolute, 825 
conditional, 825 
interval of, 486, 488 
radius of, 486, 488 
of series, 439 
unconditional, 830 
uniform, 490, 834 
for power series, 834 
Convergent improper integral, 409, 411, 
455 

Convergent sequence, 433 
Convergent series, 439 
Coolidge, J. L., 382 
Cooling, Newton’s law of, 282 
Coordinate axes, 632 
Coordinate plane, 6, 633 
Coordinate system, rectangular, 5 
Coordinates: 

Cartesian, 15, 633 
cylindrical, 661 
polar, 560 
rectangular, 633 
spherical, 662 
Copernicus, Nicolaus, 620 
Corners, 118 

Corridor problem, 130, 312 
Cosecant, 294 
Cosine, 293 

of an acute angle, 38 
of an arbitrary angle, 40 
direction, 640 
graphs of, 42, 298 
hyperbolic, 324 

power series expansion of, 431, 483 
series, 431,450 
Taylor series f or, 503 
values of, 42 
Cosines, law of, 299, 637 
Cost: 

average, 148 
function, 146 
marginal, 147 
Cotangent, 294 

Euler’s partial fractions expansion of, 
526 


INDEX 
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Coulomb’s law, 250 
Courant, R., 522 
Critical point, 116, 693 
Critical value, 116 
Cross product, 640 
Curl, 772 
Curvature, 611 
center of, 614 
circle of, 614 
radius of, 613 
Curve: 

asymptote of, 33 
closed, 756 
level, 667 
nonrectifiable, 808 
normal (Gaussian), 418 
piecewise smooth, 755 
rectifiable, 808 
simple, 765 
simple closed, 765 
positively oriented, 765 
smooth, 239, 610 
Curvilinear motion, 181 
Cusps, 118 
Cycloid, 592, 593 
parametric equations for, 592 
Cylinder, 653 
generator of, 653 
right circular, 653 
Cylindrical coordinates, 661 
element of volume in, 736, 739 
Cylindrical shells, 231 

Decay: 

exponential, 280 
radioactive, 278 
Decimal, 450, 790 
infinite, 428 
repeating, 443 

Decomposition into partial fractions, 352 
Decreasing function, 77, 115 
Definite integral, 192, 201, 804 
Leibniz notation for, 200 
Definition of e, 265 
Degenerate conic section, 530 
Degree of polynomial, 27 
Del operator, 685 
Delta notation, 55 
Deltoid curve, 600 
Demand curve, 152 
Demand, elasticity of, 153 
Demand function, 152 
Democritus, 243, 257, 815 
Dependent variable, 24 
Derivative, 58 
directional, 681 
left, 62 

mixed second partial, 673 
partial, 670 


Derivative (Cont.)\ 
second-order, 672 
right, 62 
second, 107 
of vector, 606 

Descartes Ren6, 15, 20, 136, 594, 600 
dismay of, 592 

folium of, 106, 107, 592, 627, 770 
refuted by Fermat, 136 
on Snell’s law, 136 
Determinants, 643 
Difference quotient, 58 
Differentiable function, 58, 680 
Differentiable at a point, 58 
Differential, 164, 680 
partial, 688 
total, 680, 688 
Differential calculus, 51 
Differential equations, 178 
general solution of, 179 
order of, 178 

particular solution of, 179 
Differential, partial, 688 
Differential triangle, 238 
Differentiation, 58, 83 
implicit, 103, 708 
logarithmic, 277 
Diffusion equation, 704 
Dirac, P. A. M., 821 
Direction angles, 640 
Direction cosines, 640 
Directional derivative, 681 
Directrix, 17 
of ellipse, 538 
of hyperbola, 546 
of parabola, 531, 532 
Dirichlet, P. G. L., 707, 821, 833 
Dirichlet’s test, 831 
Dirichlet’s theorem on primes, 833 
Discontinuity: 
of a function, 36 
infinite, 34 
Discriminant, 557 
Disk method, 227 
Distance formula, 7, 634, 652 
Distance from a point to a line, 48, 160 
Distribution: 
normal, 420 
standard normal, 420 
Distribution function, 419 
normal, 419 
Divergence, 771 
of series, 439 
of vector field, 771, 773 
Divergence theorem, 775 
Divergent improper integral, 409, 411, 455 
Divergent series, 439 
Divine intervention, 706 
Division of series, 516 


Double-angle formulas, 42 
Domain, 23, 665 
Doomsday equation, 291 
Dot product, 636 
Double integral, 718 
Double-angle formulas, 296 
Doubly ruled surface, 660 
Dummy variable, 215 
Dyne, 245 

e, 269 

definition of, 265 
irrationality of, 448 
series for, 447 
transcendence of, 815 
e x , power series expansion of, 430, 491 
Eccentricity: 
of ellipse, 536 
of hyperbola, 545 
Eddington, Arthur, 461 
Edwards, H. M., 815 
Egypt, Great Pyramid of, 251 
Einstein, Albert, 183, 202, 697 
on elegance, 697 
equation E = Me 2 , 251-252 
law of motion, 251 
Elastic, 153, 154 
Elasticity of demand, 153 
Electric circuit, 283 
Electrostatic potential, 703 
Element: 

of arc length, 238 
of area, 221 

in polar coordinates, 727 
of force, 253 
of volume, 225 

in cylindrical coordinates, 736, 739 
of work, 246 
Elementary function, 334 
Ellipse, 217, 529, 535 
area of, 217, 541 
center of, 536 
directrix of, 538 
eccentricity of, 536 
equation of, 537 
focal radii of, 537 
foci of, 536 
latus rectum of, 558 
major axis of, 536 
minor axis of, 536 
parametric equations for, 588 
reflection property of, 539 
semimajor axis of, 536 
semiminor axis of, 536 
string property of, 536 
vertices of, 536 
Ellipsoid, 657 
Elliptic cone, 659 
Elliptic cylinder, 654 
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Elliptic integral, 348, 579, 813 
complete, 542 
of the first kind, 628, 816 
Elliptic paraboloid, 659 
Endpoints, 4, 118 
Energy: 

kinetic, 247, 395, 761 
law of conservation of, 248, 762 
potential, 247, 761, 769 
total, 248 
Epicycloid, 594 
Epsilon-delta definition, 70 
Equation(s): 

Bessel, 494, 512-513 
of a circle, 16, 532 

differential (see Differential equations) 

diffusion, 704 

doomsday, 291 

general linear, 13 

of a graph, 15 

heat, 702 

intercept, 15 

Laplace’s, 674, 702, 703 
of a line, 11 
Maxwell’s, 784 
one-dimensional heat, 704 
parametric, 586 
for circle, 588 
for cycloid, 592 
for ellipse, 588 
of line, 647 
of plane, 648 
Cartesian, 648 
vector, 648 
point-slope, 12 
slope-intercept, 13 
of a sphere, 635 
of state, 692 
symmetric, of line, 647 
vector, of line, 647 
Volterra’s prey-predator, 290 
wave, 675, 702 
one-dimensional, 705 
Equiangular spiral, 578 
Equipotential surfaces, 668 
Erg, 245 

Escape velocity, 186 
Euclid, 47, 480 
on primes, 437 

Euler, L., 169, 274, 446, 448, 456, 474, 
480, 525, 750, 815, 830 
constant, 275, 458, 460 
discovery of e, 273 
elegant truths of, 525 
expansion of cotangent, 527 
formula, 521 
for 2 \/n 2 , 476, 748 
for 2 \/n 2k , 528 
identity, 480 
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Euler (Cont.): 

infinite product for sine, 477 
irrationality of e, 449 
quoted: on fabric of world, 136 
series 2 1 /m 2 , 748 

theorem: on homogeneous functions, 688 
Even function, 50, 216, 518 
Evolute, 614 

Exhaustion, method of, 190 
Expansion, power series, 428, 483 
of cos x, 431, 483 
of e x , 430, 491 
of /(*), 489 
of sin x, 431, 483 
of tan -1 x, 491 
Exponential decay, 280 
Exponential function: 
general, 261 
the, 265 

Exponential growth, 268, 278 
Exponential spiral, 578 
Exponents, fractional, 104, 261 
Extreme value theorem, 77, 197, 797 
Eynden, C. V., 482 

Factorial, 109, 430, 845 
Factorial notation, 109, 430 
Fechner-Weber law, 290 
Fermat, P., 15, 21, 52, 53, 136, 190, 212, 
592 

concept of tangent, 53 
and Descartes, 592 
last theorem, 21, 591, 821 
principle of least time, 136, 629 
theorem, 78 
Fermi, Enrico, 691 
Feynman, Richard, 494, 784-785 
Fibonacci, 471 
Fibonacci sequence, 471 
Field (algebraic), 790 
ordered, 790 
complete, 790 
Field (in vector analysis): 
conservative, 761, 781 
force, 752 
gradient, 752 
of scalar, 758 
irrotational, 781 
scalar, 752 
vector, 752 

First law of motion, Newton’s, 182, 611 
First Mean Value Theorem of Integral 
Calculus, 208 
First-order reaction, 283 
Fixed point theorem, Brouwer’s, 800 
Flux: 

through surface, 774 
of vector field, 772 
Focal chord of a parabola, 49 


Focal radii of ellipse, 537 
Focus, 17 
of ellipse, 536 
of hyperbola, 543 
of parabola, 532 

Focus-directrix-eccentricity definition, 550 
Folium of Descartes, 106, 107, 592, 627, 
770 
Force: 

central, 622 

gravitational, 623 
centripetal, 609 
of gravity, 183 
restoring, 320 
Force field, 752 
Formula(s): 

Abel’s partial summation, 831 
addition, 42, 295 
distance, 7, 634, 652 
double-angle, 42, 296 
Euler’s 476, 521 
for 2 1/m 2 , 476, 748 
for 2 1/m 2 *, 528 
half-angle, 42, 46, 296, 341 
Heron’s, 695 
Legendre’s, 747 
Leibniz, 317, 361, 692 
midpoint, 8 
quadratic, 22 
recursion, 511 
reduction, 344, 356, 360 
Stirling’s, 472 
subtraction, 45, 296 
Four squares theorem, 21 
Four vertex theorem, 615 
Four-leaved rose, 566 
Fourier, J., 275, 704, 707 
Fractional exponents, 104, 261 
Franklin, Benjamin, 269 
Franklin, P., 719 
Frequency, 320 
Frequency density, 415 
Function, 23 
algebraic, 27 
Bessel, 489, 494, 513 
of bounded variation, 809 
circular, 325 
composite, 25, 93 
continuous, 75, 666, 795 
cost, 146 

decreasing, 77, 115 
demand, 152 
differentiable, 58, 680 
discontinuity of, 36 
distribution, 419 
domain of, 23 
elementary, 334 
even, 216, 518 
and odd, 50, 518 
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Function (Cont.y. 
exponential, 265 
of a function, 26 
gamma, 413 
general exponential, 261 
general logarithm, 262 
graph of, 24 
homogeneous, 688 
hyperbolic, 325 
implicit, 102, 710 
improper rational, 351 
increasing, 77, 115 
integrable, 202, 806 
inverse, 317 
inverse hyperbolic, 327 
limit of, 68 
linear, 29 

normal distribution, 419 
odd, 217, 518 
period of, 43 
periodic, 43, 294 
potential, 769 
price, 152 

probability density, 415 
normal (Gaussian), 418 
profit, 151 
proper rational, 351 
quadratic, 29 
rational, 27, 351 
improper, 351 
proper, 351 
revenue, 151 
transcendental, 27 
trigonometric, 292 
inverse, 313 
zero of, 31 
zeta, 474, 480 
Fundamental lemma, 680 
Fundamental theorem of algebra, 353, 522 
Fundamental theorem of calculus, 52, 190, 
208, 212, 758, 808 
for line integrals, 758 

Galileo, 1,230, 379,448, 530, 594 
Gamma function, 413 
Gas law: 

adiabatic, 161, 250, 283 
ideal, 671 

Gauss, C. F„ 202, 274, 423, 456, 513, 565, 
579, 821, 833 
on lawyers, 787 
Gauss’s test, 823 

Gauss’s theorem on divergence, 755 
Gelfond, A. 0., 815 
General concoid, 584 
General exponential function, 261 
General linear equation, 13 
General logarithm function, 262 
General solution, 179 


Generalized mean value theorem, 401, 803 
Generalized Stokes theorem, 783 
Generator: 
of a cone, 529 
of a cylinder, 653 
Geometric area, 214 
Geometric mean, 702 
Geometric series, 440 
Geometry, analytic, 15 
Grade inflation, 421 
Gradient, 682, 771 
Gradient: 
field, 752 

of a scalar field, 758 
Graph: 

of an equation, 15 
of a function, 24 
Graph(s): 

of cosine, 42, 298 
of polar equation, 561 
of sine, 42, 298 
of tangent, 298 
Gravitation: 
constant of, 246 

Newton’s law of, 185, 246, 621, 706 
Gravitational attraction of spherical shell, 
741 

Gravitational force, central, 623 
Gravitational potential, 703 
Gravity: 

acceleration due to, 183 
center of, 244, 385 
force of, 183 

Great Plague of London, 66 
Great Pyramid of Egypt, 251 
Green, George, 765 
Green’s theorem, 765, 780 
Growth: 

exponential, 268, 278 
population, 277 
inhibited, 284 

Growth curve, inhibited or sigmoid, 285 
Gyration, radius of, 396 

Half-angle formulas, 42, 46, 296, 341 
Half-life, 279 
Hall, Tord, 423 
Halley, Edmund, 540 
Halley’s Comet, 540 
Halmos, P. R„ 445 
Hamilton, Sir W., 501 
Hardy, G. H„ 595 
Harmonic series, 441 
Heart, working, 248 
Heat equation, 702 
one-dimensional, 704 
Heath, T. L., 572 
Heilbroner, R. L., 1 54 
Heliocentric system, 620 


Henderson, J. M., 689 
Hermite, C„ 479, 815 
Heron of Alexandria, 135 
Heron’s formula, 695 
Hille, E„ 790 

Hippocrates of Chios, 193, 218, 583 
lune of, 193 
Histogram, 414 
Hobbes, Thomas, 382 
Holder’s inequality, 702 
Holmes, Oliver Wendell: 
on certitude, 787 
on Descartes, 20 

Homogeneous function, Euler’s theorem 
688 

Hooke’s law, 245 
Horizontally simple, 720 
Hubble’s law, 189 
Hungarian proverb, 118 
Hutchinson, G. Evelyn, 778 
Huygens, Christiaan, 379, 598 
and tautochrone, 598 
Hydrostatics, 252 
Hyperbola, 529, 543 
asymptote of, 545 
axes of, 544 
branch of, 543, 544 
center of, 544 
conjugate axis of, 544 
directrix of, 546 
eccentricity of, 545 
focus of, 543 
latus rectum of, 559 
principal axis of, 544 
rectangular, 546 
reflection property of, 548 
vertices of, 544 
Hyperbolic functions, 325 
inverse, 327 

Hyperbolic paraboloid, 659 
Hyperbolic spiral, 572 
Hyperboloid: 
of one sheet, 658 
of two sheets, 658 
Hypergeometric series, 489, 824 
Hypocycloid, 594 
of four cusps, 114, 240, 596 

i, 521 

i-, j-, k-component, 634 
I-beam, 396 
Ideal gas law, 671 
Identities, trigonometric, 41-42 
Imaginary unit, 521 
Implicit differentiation, 103,708 
Implicit function, 102, 710 
Implicit function theorem, 710, 840 
Improper integral, 409, 455 
comparison test for, 414 
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Improper integral (Cont.)\ 
convergent, 409, 411, 455 
divergent, 409, 411, 455 
Inclination, 12 
Increasing function, 77, 115 
Increasing sequence, 435 
Increment, 55 

Indefinite integral, 171-172, 192 
Independent variable, 24 
Indeterminate form, 398, 404, 405 
Index of refraction, 136 
Induction, mathematical, 471 
principle of, 850 
Inelastic, 153, 154 
Inequalities, 3 
Inequality: 

Holder’s, 702 
Schwarz’s, 160, 702 
Inertia: 

moment of, 393, 394, 724 
polar moment of, 725 
Infinite decimal, 428 
Infinite discontinuity, 34 
Infinite product for the sine, Euler’s, 477 
Infinite product, Wallis’s, 275 
Infinite series (see Series) 

Infinitesimal, 168 
Infinitesimal calculus, 169 
Infinity, 5 

Inflection, point of, 120 
Inhibited growth curve, 285 
Inhibited population growth, 284 
Initial condition, 180 
Instantaneous velocity, 64 
Integers, 2 

Integrable function, 202, 806 
Integral, 171, 804, 805 
complete elliptic, 542 
definite, 192, 201, 804 
double, 718 
elliptic, 348, 579, 813 
of the first kind, 628, 816 
improper, 409, 411, 455 
comparison test for, 414 
convergent, 409, 411, 455 
divergent, 409, 411, 455 
indefinite, 171-172, 192 
iterated, 715 
line, 752 

fundamental theorem of calculus for, 
758 
lower, 805 
Riemann, 203, 707 
sign, 171, 201 
surface, 774 
test, 456 
Cauchy, 456 
triple, 731 
upper, 805 
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Integral calculus, 51 

first mean value theorem of, 208 
Integrand, 171, 201 
Integration, 171 
constant of, 172 
Leibniz approach to, 723 
limits of, 201 
numerical, 369 
by parts, 357 
by substitution, 174, 176 
variable of, 201 
Intercept equation, 15 
Interest, continuously compounded, 268 
Interior point, 5 

Intermediate value theorem, 78, 143, 799 
Interval, 4 

of convergence, 486, 488 
Inverse function, 317 
Inverse hyperbolic function, 327 
Inverse sine, 313 

Inverse square law of universal gravitation, 
Newton’s, 621 
Inverse substitution, 817 
Inverse tangent, 314 
Inverse tangent series, 498 
Inverse trigonometric functions, 313 
Involute of a circle, 592 
Irrational number, 2, 444 
Irrotational field, 781 
Isothermal surfaces, 668 
Iterated integral, 715 

Jacobi, C. G. J., 842 
Jacobian, 842 
James, D. E., 689 
Jesus, horoscope of, 106 
Jointly proportional, 130 
Joule, 245 

Kelvin, Lord (W. Thomson), 729 
Kepler, Johannes, 530, 540, 571, 605, 620, 
621 

first law, 625 

laws of planetary motion, 620 
problem of, 159 
second law, 623 
of planetary motion, 581 
third law, 625 
Keynes, J. M., 154, 445 
Kinetic energy, 247, 395, 761 
Knopp, K., 517,525 
Kummer, E. E., 821 
Kummer’s theorem, 821 

Lagrange, J. L., 169, 501 
Lagrange multiplier, 698 
Lagrange’s remainder formula, 501 
Lambert, J. H., 815 
Lambert’s law of absorption, 282 


Landau, E., 790 
Laplace, P.S., 275, 706, 765 
Laplace transform, 413 
Laplace’s continued fraction, 422 
Laplace’s equation, 674, 702, 703 
Latus rectum: 
of ellipse, 558 
of hyperbola, 559 
of parabola, 558 
Law: 

of absorption, 282 
adiabatic gas, 250, 283 
Boyle’s, 250 

of conservation of energy, 248, 762 
of cooling, Newton’s, 282 
of cosines, 299, 637 
Coulomb’s, 250 
Fechner-Weber, 290 
of gravitation, Newton’s, 185, 706 
Hooke’s, 245 
ideal gas, 671 
Kepler’s: 
first, 625 
second, 623 

of planetary motion, 581 
third, 625 

of the lever, Archimedes’, 384 
of mass action, 283 
of motion: 

Einstein’s, 251 
Newton’s first, 182 

second, 182, 251,608, 761 
Ohm’s, 312 
Parallelogram, 639 
Poiseuille’s, 234, 312 
of reflection, 134-135 
of refraction, Snell’s, 135, 629, 702 
of sines, 50 
Torricelli’s, 230 
Lawyers, Gauss on, 787 
Least squares, 159 
method of, 138, 696 

Least time, Fermat’s principle of, 136, 629 
Least upper bound, 789 
Least upper bound axiom, 789 
Legendre, A. M., 747 
Legendre’s formula, 747 
Legendre polynomials, 524 
Leibniz, G. W., 21, 52, 67, 169, 171, 190, 
207, 211, 274, 361, 382, 429, 441, 
465, 466, 495, 598 
alternating series test, 465 
approach to calculus, 168-169, 222 
approach to integration, 723 
formula, 317, 361,499, 692 
for differentiating integrals, 692 
for 7t/4, 317, 361,499 
notation, 60, 61 

notation for definite integral, 200 
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Leibniz (Cont.): 

notation for derivatives, 60-61 
series, 317, 441, 499 
use of differentials, 168 
Leibnizian myths, 168 
Leibnizian spirit in Calculus, 222, 578 
Lemniscate, 48, 565, 576, 580 
Length of vector, 634 
Leonardo of Pisa, 471 
Level curve, 667 
Level surface, 668 

Lever, Archimedes’ principle of, 259, 384 

Leverrier, U. J., 626 

Lewis, J. H., 599 

L’Hospital, G.F.A. de, 401 

L’Hospital’s rule, 401 

Libby, W., 281 

Lima^on, 564, 567 

Limit: 

of a function, 68 
of a sequence, 433 
Limit comparison test, 452 
Limit laws, 72 

Limit-of-sums process, 199-201 
Limits of integration, 201 
Lindemann, F., 815 
Line: 

equation of, 11 
parametric equations of, 647 
slope of, 11 

symmetric equations of, 647 
vector equations of, 647 
Line integral, 752 

fundamental theorem of calculus for, 758 
Linear approximation, 166 
Linear equation, general, 13 
Linear function, 29 
Liouville, J., 813, 815 
Liouville’s theorem, 815 
Lithotripsy, 540 
Little old man, 504 
Little-oh notation, 458 
Lobachevsky, N., 347 
Locus, 20 
Logarithm, 262 
common, 263 
natural, 269 

Logarithm function, general, 262 

Logarithm series, 490^91, 508 

Logarithmic differentiation, 277 

Long division, 351 

Lord Brouncker, 382 

Lower bound, 432 

Lower limits of integration, 201 

Lower sum, 199 

Lune of Hippocrates, 193 

Machin, J., 500 
Machine diagram, 24 


Maclaurin’s series, 500 
Magic, in mathematics, 176 
Major axis of ellipse, 536 
Marginal analysis, 151 
Marginal cost, 147 
Marginal profit, 151 
Marginal Q, 66 
Marginal revenue, 151 
Mass, 724 

center of, 384, 385, 386, 388 
Mass action, law of, 283 
Mathematical induction, 471 
principle of, 850 
Maximum, 116 
absolute, 116 

at endpoints, cusps, and corners, 118 
relative, 692 

Maxwell, James Clerk, 784 
Maxwell’s equations, 784 
Mead, D. G., 813 
Mean, 416 

arithmetic and geometric, 702 
proportional, 584 

Mean value theorem, 76, 171, 237, 399, 
802 

generalized, 803 
Mechanics, Newtonian, 182 
Median, 10 

Mersenne, Marin, 584, 594 
Mertens, F., 830 
Method: 

of cylindrical shells, 231 
disk, 227 

of exhaustion, 190 
of least squares, 138, 696 
of moving slices, 227 
of partial fractions, 351 
of substitution, 337 
washer, 228 
Microeconomics, 146 
Midpoint formulas, 8 
Millikan, R. A., 283 
Minimum, 116 

at endpoints, cusps, and comers, 118 
Minor axis of ellipse, 536 
Mixed second partial derivative, 673 
Mixing, 286 

Moment, 244,259, 384, 724 
of inertia, 393, 394, 724 
polar, 725 
Momentum, 189 
Monte, Guidobaldo del, 448 
Morgan, J. P., 431 
Morse, P. M., 691 
Motion: 

curvilinear, 181 
Einstein’s law of, 251 
Newton’s: 

first law of, 182, 611 


Motion (Cont.): 

second law of, 182, 251, 608, 761 
rectilinear, 181 
simple harmonic, 3 19 
uniform circular, 608 
Mountain building, 251 
Moving slices, method of, 227 
Multiplier effect, 445 

Napoleon, 706, 707 
Nappes, 529 
Natural logarithm, 269 
Natural number, 2 
Needle problem, Buff on’s, 307 
Neighborhood, 666 
Nephroid curve, 600 
Neumann, J. von, 445 
Neutron stars, 187 
Newman, J. R., 626, 784 
newton (unit), 245 

Newton, Isaac, 21, 52, 66, 190, 207, 211, 
234, 274, 472, 495, 530, 540, 571, 
574, 598, 605, 619, 626, 627, 743 
area under quadratrix, 574, 584 
and Bernoulli, John, 598 
and Fermat, 21 
first law of motion, 182, 611 
and Halley, 540 

inverse square law of universal 
gravitation, 621 
law of cooling, 282 
law of gravitation, 185, 246, 706 
mechanics, 182 

method of approximate solution, 144 
second law of motion, 182, 251, 608, 
761 

theorem on attraction of spheres, 741 
Niven, I., 479, 482, 595 
Norm of partition, 804 
Normal, 88 

Normal curve (Gaussian), 418 
Normal distribution, 420 
function, 420 

Normal probability density function, 418 

Notation, factorial, 109, 430 

nth term test, 442 

nth-degree Taylor polynomial, 501 

Number(s), 

Bernoulli, 525 
complex, 521 
irrational, 2, 444 
natural, 2 
prime, 437 
rational, 2, 444 
real, 787 

transcendental, 815 
Numerical integration, 369 

Oblate spheroid, 229, 541, 657 
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Odd function, 50, 217, 518 
One-dimensional heat equation, 704 
One-dimensional wave equation, 705 
Open interval, 4 
Orbit, synchronous, 619 
Orbital speed, 618 
Ordered field, 790 
complete, 790 
Ordinate, 6 
Origin, 5, 632 

Oscillation of a function, 806 
Ovals of Cassini, 571 

/^-series, 453, 456 
Paddle wheel, 779 
Paint paradox, 414 
Pappus of Alexandria, 391, 574 
Pappus’ theorems, 391-392 
Parabola, 17, 529 
axis of, 18, 533 
directrix of, 531, 532 
equation of, 18, 533 
focal chord of, 49 
focus of, 532 
latus rectum of, 558 
reflection property of, 88, 534 
vertex of, 18, 533 
Parabolic cylinder, 654 
Paraboloid: 
elliptic, 659 
hyperbolic, 659 
Parallel axis theorem, 397 
Parallelogram law, 639 
Parallelogram rule, 602 
Parameter, 586 
Parametric equations, 586 
for circle, 588 
for cycloid, 592 
for ellipse, 588 
of line, 647 
Parthenon, 220 
Partial derivative, 670 
mixed second, 673 
second-order, 672 
Partial differential, 688 
Partial fractions, 351 
decomposition into, 352 
expansion of cotangent, 526 
method of, 351 
theorem, 354, 818 
Partial sums, 439 
Particle, 181 
Particular solution, 179 
Partition, 804 
norm of, 804 
Parts, integration by, 357 
Pascal, Blaise, 21, 190, 212, 584, 854 
Pascal, Etienne, 584 
Path, 755 


Pendulum, 322, 627 
period of, 322 
Perfect competitor, 155 
Pericles, 219 
Perigee, 542 
Perihelion, 542 
Period, 319, 294 
of a function, 43 
Periodic function, 43 
Permutation, 846 
pH, 264 
pi (t r): 

computation of, 498 
definition of, 191 

experimental method of calculating, 309 
irrationality of, 479 
Leibniz’s formula for n/4, 317, 499 
value of, 500 

Wallis’s product for 77 / 2 , 361, 380 
Piecewise smooth curve, 755 
Piecewise smooth surface, 775 
Plane: 

Cartesian, 15 
equation of, 648 
coordinate, 633 
equation of, 648 
tangent, 676 
vector equation of, 648 
Planetary motion, Kepler’s laws of, 581, 
620 

Plato, hatred of Democritus, 257 
Point: 

critical, 693 
of inflection, 120 
interior, 5 
surface, 660 
Point-slope equation, 12 
Poiseuille, Jean, 234 
Poiseuille’s law, 234, 312 
Poisson, S., 283 
Poisson’s gas equation, 283 
Poker hands, 848 
Polar axis, 560 
Polar coordinates, 560 
element of area in, 727 
Polar equation: 
of circle, 569 
of conic section, 571 
graph of, 561 

Polar moment of inertia, 725 
Polya, G., 129, 276, 854 
Polynomial, 26 
degree of, 27 
Legendre, 524 
Pope, Alexander, 67 
Population, center of, 386 
Population growth, 277 
inhibited, 284 
Population, world, 289 


Positive integer, 2 
Potential, 246, 769 
Potential, electrostatic, 703 
Potential energy, 247, 761, 769 
Potential function, 769 
Potential, gravitational, 703 
Power rule, 95 
Power series, 484 
expansion, 428, 483 
of cos x, 431, 483 
of e x , 430 
of f{x), 489 
of sin x, 431, 483 
of tan -1 jc, 491 
interval of convergence, 486 
properties of, 490 
radius of convergence, 486 
for tanx, 526 
uniform convergence, 834 
in x - a, 488 
Powers of belief, 181 
Pressure, 252 
atmospheric, 283 

Prey-predator equations, Volterra’s, 290 
Price function, 152 
Prime number, 437 
Primes, series of reciprocals of, 480 
Principle, Archimedes, 324 
Principal axis of a hyperbola, 544 
Principle of mathematical induction, 850 
Probability, 307 
Probability density, 416 
standard normal, 419 
Probability density function, 415 
normal (Gaussian), 418 
Problem(s): 
of areas, 51 
brachistochrone, 596 
construction, 566 
of tangents, 51 
Product: 
cross, 640 
dot, 636 

of numerical series, 827-829 
of power series, 514-516 
of series, 515 
Product rule, 88, 112 
Product, Wallis’s, 275, 361, 380, 472, 477 
Profit function, 151 
Profit, marginal, 151 
Projection: 
scalar, 636 
vector, 636 

Prolate spheroid, 229, 541, 657 
Proper rational function, 351 
Properties of power series, 490 
Proportional: 
mean, 584 
third, 584 
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Pseudosphere, 347 
Ptolemaic system, 620 
Ptolemy, 620 
Pythagoras, 9 

Pythagorean theorem, 7, 47, 578 

Quadrant, 6 
Quadratic formula, 22 
Quadratic function, 29 
Quadratrix, 574 
Quadric surface, 656 
Quandt, R. E., 689 
Quotient rule, 90 

Raabe, J. L., 821 
Raabe’s test, 821 
Radially simple, 727 
Radian, 39, 292 
Radioactive decay, 278 
Radiocarbon dating, 281 
Radius: 

of circle, 532 
of convergence, 486, 488 
of curvature, 614 
of gyration, 396 
Ramanujan, Srinivasa, 500 
Range, 23 

Rate of change, 65, 139 
Ratio test, 461 
Rational function, 27, 351 
improper, 351 
proper, 351 

Rational number, 2, 444 
Rational point, 479 
Rationalizing substitution, 816 
Reaction: 

first-order, 283 
second-order, 283 
Real line, 3 
Real numbers, 787 
Reciprocals of primes, series of, 480 
Rectangular (or Cartesian) coordinates, 633 
Rectangular coordinate system, 5 
Rectangular coordinates in space, 632 
Rectangular hyperbola, 546 
Rectilinear motion, 181 
Recursion formula, 511 
Reduction formula, 344, 356, 360 
Reflection, law of, 134-135 
Reflection property: 
of ellipse, 539 
of hyperbola, 548 
of parabola, 88, 534 
Refraction: 
index of, 136 

Snell’s law of, 135, 629, 702 
Region, simply connected, 781 
Relative maximum, 692 
Remainder formula, 496, 500, 501 


Repeated integral (see Iterated integral) 
Repeating decimal, 443 
Rest mass, 251 
Restoring force, 320 
Revenue function, 151 
Revenue, marginal, 151 
Revolution: 
solid of, 225 
surface of, 240, 655 
Rhetoric and invective, 382 
Richter, Charles, 264 
Richter scale, 263 

Riemann, B„ 202, 274, 707, 827, 833 
integral, 203 

rearrangement theorem, 827 
Riemann sums, 203 
Riemannian Geometry, 202 
Right circular cone, 529 
Right circular cylinder, 653 
Right triangle trigonometry, 39 
Right-hand thumb rule, 640 
Ritt, J. F., 790, 813, 816 
Robbins, H., 522 
Roberval, G., 584 
Robinson, A., 169 
Rolle, Michel, 800 
Rolle’s theorem, 800 
Roller, D„ 283 
Root test, 463 
Rose, four-leaved, 566 
Rosen, M., 565 
Rotation of axes, 554 
Routine research, 202 
Rule: 

parallelogram, 602 
trapezoidal, 370 
Ruled surface, 660 

Ruler-and-compass constructions, 566-567 
Rulings on surface, 654 
Rumor, spread of, 287 

Saddle point, 660 
Saddle surface, 660 
Scalar, 600 
Scalar field, 752 
Scalar projection, 636 
Schwarz, H. A., 747 
Schwarz’s inequality, 160, 702 
Sears, F. W., 691 
Secant, 294 
Second derivative, 107 
Second derivative test, 122, 694 
Second law of motion, Newton’s, 182, 251, 
608, 761 

Second partial derivative, mixed, 673 
Second-order partial derivatives, 672 
Second-order reaction, 283 
Segment of a sphere, volume of, 226 
Semimajor axis of ellipse, 536 


Semiminor axis of ellipse, 536 
Separation of variables, 179, 272 
Sequence, 432 
bounded, 432 
convergent, 433 
decreasing, 438 
Fibonacci, 471 
increasing, 435 
limit, 433 

lower bound of, 432 
of partial sums, 439 
unbounded, 433 
upper bound of, 433 
Series, 427, 439 
Abel’s, 457 

absolutely convergent, 467 
alternating, 465 
binomial, 503 

conditionally convergent, 468 

convergent, 439 

cosine, 431,450 

divergent, 439 

division of, 516, 836 

for e, 447 

geometric, 440 

harmonic, 441 

hypergeometric, 489, 824 

infinite, 427, 439 

interval of convergence of, 486 

inverse tangent, 498 

Leibniz’s, 317, 441,499 

In 2, 459 

logarithm, 490-491 
Maclaurin’s, 500 
multiplication of, 514, 827-829 
/?-, 453, 456 
partial sums of, 439 
power, 484 
in x — a, 488 
product of, 515 
sine, 497 
sum of, 439 
for tangent, 526 
Taylor, 495 
for cosjc, 498 
for e x , 496 

for even and odd functions, 518 
of/(*), 495 

in powers of x — a, 500 
for sin x, 497 
telescopic, 441 
Shakespeare, William, 276 
Shell method, 231 
Sigma notation, 194 
Sigmoid growth curve, 285 
Simmons, G. F., 2, 47, 205, 413, 423, 437, 
501, 599, 614, 689, 701, 815, 824 
Simple closed curve, 765 
positively oriented, 765 
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Simple curve, 765 
Simple harmonic motion, 319 
Simply connected region, 781 
Simpson, T., 371 
Simpson’s rule, 372 
Sine, 293 

of an acute angle, 38 
of an arbitrary angle, 40 
Euler’s infinite product for, 477 
graphs of, 42, 298 
hyperbolic, 324 
inverse, 313 

power series expansion of, 431, 483 
values of, 42 
Sines, law of, 50 
Skepticism, 94 
Slope of a line, 11 
Slope-intercept equation, 13 
Smith, D. E., 598 
Smooth curve, 239, 610 
Snell, W., 135 

Snell’s law of refraction, 135, 629, 702 
Solid of revolution, 225 
Speed, 64, 182, 607 
orbital, 618 
Sphere: 

equation of, 635 
surface area of, 228, 243 
volume of, 257, 728, 734, 737, 740 
“Sphere”: 

five-dimensional, 736 
four-dimensional, 736 
Spherical coordinates, 662 
Spherical shell, gravitational attraction of, 
741 

Spherical triangle, 747 
Spherical zone, 244 
Spheroid: 

oblate, 229, 541, 657 
prolate, 229, 541, 657 
Spiral: 

of Archimedes, 572, 579 
equiangular, 578 
exponential, 578 
hyperbolic, 572 

Spread of infectious disease, 357 
Spring constant, 245 
Square, completing the, 348 
Squeeze theorem, 794 
Standard deviation, 416 
Standard normal distribution, 420 
Standard normal probability density, 

419 

Stirling, J., 472 
Stirling’s formula, 472 
Stokes, Sir George, 778 
Stokes’ theorem, 780 
String property of ellipse, 536 
Subnormal, 558 


Substitution: 
inverse, 817 
method of, 337 
rationalizing, 816 
trigonometric, 344 
Subtangent, 558 
Subtraction formulas, 45, 296 
Sum(s): 
lower, 199 
partial, 439 
Riemann, 203 
of a series, 439 
upper, 200 
Surface: 

area of a sphere, 243 
doubly ruled, 660 
equipotential, 668 
flux through, 774 
isothermal, 668 
level, 668 

piecewise smooth, 775 
quadric, 656 
of revolution, 240, 655 
ruled, 660 
rulings on, 654 
saddle, 660 
Surface integral, 774 
Symmetric equations of line, 647 
Synchronous orbit, 619 
System: 

heiiocentic, 620 
Ptolemaic, 620 

Tan x, power series for, 526 
Tan -1 x, power series for, 498 
Tangent, 294 
graphs of, 298 
inverse, 314 
power series for, 526 
problem of, 51 
Tangent line, 5 3 
approximation, 166 
Tangent plane, 676 
Taste of victory, 380 
Tautochrone property, 598, 628 
Taylor, A. E., 719 
Taylor, Brook, 495 
Taylor: 

coefficients, of /(x), 495 
formula with derivative remainder, 496, 
500 

polynomial, nth-degree, 501 
series: 

for cos x, 498 
for e x , 496 

for even and odd functions, 518 
of f (x), 495 

in powers of x — a, 500 
for sin x, 497 


Telescopes of Cassegrain type, 548 
Telescopic series, 441 
Terminal velocity, 286 
Test: 

Abel’s, 833 
alternating series, 465 
Cauchy: 

condensation, 474 
integral, 456 
comparison, 451 
limit, 452 
Dirichlet’s, 831 
Gauss’s, 823 
integral, 456 
limit comparison, 452 
nth term, 442 
Raabe’s, 821 
ratio, 461 
root, 463 

second derivative, 122, 694 
Thales, 9, 16 
Theorem: 

of Algebra, fundamental, 353, 522 
binomial, 84, 110, 845 
boundedness, 796 
Brouwer’s, 800 

of Calculus, fundamental, 190, 208, 212, 
758 

divergence, 775 
extreme value, 77, 197, 797 
Fermat’s, 78 
last, 21, 591, 821 
four squares, 21 
four vertex, 615 
Gauss’s, 775 
Green’s, 765, 780 
implicit function, 710, 840 
intermediate value, 78, 143, 799 
Rummer’s, 821 

mean value, 76, 171, 237, 399, 802 
generalized, 401, 803 
parallel axis, 397 
partial fractions, 354, 818 
Pythagorean, 7, 47, 578 
Riemann’s rearrangement, 827 
Rolle’s, 800 
squeeze, 794 
Stokes’, 780 
Torricelli’s, 593 
uniform continuity, 806 
Wren’s, 593, 627 
Thinking out loud, 365 
Third proportional, 584 
Three-step rule, 58 
Throsby, C. D., 689 
Tietze, H., 106, 566 
Titchmarsh, E. C., 523 
Torque, 394 
vector, 645 
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Torricelli, Evangelista, 594 
Torricelli’s law, 230 
Torricelli’s theorem, 593 
Torus, 229, 392 
Total differential, 680 
Total energy, 248 
Tractrix, 347 
Transcendence of e, 815 
Transcendental function, 27 
Transcendental number, 815 
Trapezoidal rule, 370 
Triangle, spherical, 747 
Trigonometric functions, 292 
inverse, 313 

Trigonometric substitution, 344 
Trigonometry: 
analytic, 39 
right triangle, 39 
Triple integral, 731 
Tunnel through center of earth, 321 

Ultimate Certainty, 119 
Unbounded, 433 
Unconditional convergence, 830 
Uniform circular motion, 608 
Uniform continuity, 839 
theorem on, 806 
Uniform convergence, 490, 834 
for power series, 834 
Universal gravitation, Newton’s inverse 
square law of, 621 
Upper bound, 433, 789 
Upper bound, least, 789 
axiom, 789 

Upper limits of integration, 201 
Upper sum, 200 

Value, 23 
critical, 116 
maximum, 116 
minimum, 116 
Of 77 , 500 
Values: 
of cosine, 42 


Values (Cont.)\ 
of sine, 42 
Variable(s): 
dependent, 24 
dummy, 214 
independent, 24 
of integration, 201 
separation of, 272 
Vascular resistance, 312 
Vector, 600 
continuous, 606 
derivative of, 606 
field, 752 
flux of, 772 
length of, 634 
torque, 645 
Vector analysis, 605 
Vector equation: 
of line, 647 
of plane, 548 
Vector projection, 636 
Velocity, 64, 182, 607 
angular, 394 
average, 63 
escape, 186 
instantaneous, 64 
terminal, 286 
Vertex: 

angle of a cone, 529 
of a cone, 529 
of a parabola, 18, 533 
Vertically simple, 720 
Vertices: 
of ellipse, 536 
of a hyperbola, 544 
Victory, taste of, 380 
Volcanic ash problem, 362 
Volterra’s prey-predator equations, 290 
Volume, 719 
algebraic, 720 
element of, 255 
geometric, 720 

of a segment of a sphere, 226 
of a sphere, 728, 734, 737, 740 


Wall, H. S., 423 
Wallis, John, 382 

Wallis’s product, 275, 361, 380, 472, 477 
Washer method, 228 
Watson, E. C., 283 
Watson, Fletcher G., 540 
Wave equation, 675, 702 
one-dimensional, 705 
Weber, E. H., 290 
Weight, 183, 245 
Whispering gallery, 540 
White dwarfs, 187 
Whitehead, A. N., 105 
Whitmore, R. L., 233 
Whittaker, Sir Edmund, 461 
Widder, D. V., 747 
Wiles, Andrew, 21 
Witch, 591 
Work, 244, 638, 751 
element of, 246 
Working heart, 248 
World population, 289 
Wrathful Church, 620 
Wren, Christopher, 594 
Wren’s theorem, 593, 627 
Wright, E. M., 595 
Wundt, W., 290 

j:-axis, 5 
^-coordinate, 6 
^-intercept, 15 
jcy-plane, 6 

y-axis, 5 
y-coordinate, 6 
y-intercept, 13 

Zero of a function, 31 
Zeta function, 474, 480 
Zeuner, F. E., 280 
Zone, spherical, 244 
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